Бит (рачунарство)

Умношци бита
Види још: бајт, нибл, неодређеност умножака

Бит (енгл. bit) је најмања јединица информације у рачунарству. Један бит представља количину информације потребну за разликовање два међусобна искључива стања, често представљана као један (1) и нула (0), да/не, тачно/нетачно, има/нема напона, итд. Бит се такође користи као име цифре у бинарном бројном систему (систем са основом 2). Сама реч је први пут употребљена 1948. године у раду Клода Шенона и настала је склапањем почетка енглеске речи „бинарна” и краја речи „цифра” или „јединица” на енглеском језику (binary digit или binary unit). Он је рекао да реч потиче од Џона Такија.

Бит је најосновнија јединица информација у рачунарству и дигиталним комуникацијама.^[1] Име је контракција бинарне цифре (енгл. binary digit).^[2] Бит представља логичко стање са једном од две могуће вредности. Ове вредности се најчешће представљају као „1” или „0”, али се обично користе и друге репрезентације као што су тачно/нетачно, да/не, +/− или укључено/искључено.

Кореспонденција између ових вредности и физичких стања основног складишта или уређаја је ствар конвенције, а различите доделе се могу користити чак и унутар истог уређаја или програма. Може се физички имплементирати са уређајем са два стања.

Симбол за бинарну цифру је или 'бит' по препоруци стандарда IEC 80000-13:2008 или мало слово 'b', као што препоручује стандард IEEE 1541-2002.

Непрекидна група бинарних цифара се обично назива низ битова, вектор бита или једнодимензионални (или вишедимензионални) низ битова. Група од осам бинарних цифара се зове један бајт, али историјски величина бајта није стриктно дефинисана. Често се пола, пуне, двоструке и четвороструке речи састоје више бајтова што су мала степени двојке.

У теорији информација, један бит је информациона ентропија бинарне случајне променљиве која је 0 или 1 са једнаком вероватноћом,^[3] или информација која се добија када вредност такве променљиве постане позната.^[4]^[5] Као јединица информација, бит је познат и као шенон,^[6] назван по Клоду Е. Шенону.

Историја

Кодирање података дискретним битовима коришћено је у бушеним картицама које су измислили Басил Бошон и Жан-Баптист Фалкон (1732), које је развио Жозеф Мари Жакард (1804), а касније су то усвојили Семјон Корсаков, Чарлс Бабиџ, Херман Холерит и рани произвођачи рачунара попут ИБМ-а. Варијанта те идеје била је перфорирана папирна трака. У свим тим системима, медијум (картица или трака) је концептуално носио низ позиција рупа; свака позиција може бити или пробијена или не, носећи тако један бит информације. Кодирање текста по битовима је такође коришћено у Морзеовом коду (1844) и раним дигиталним комуникационим машинама као што су телетипови и тикерне машине (1870).

Ралф Хартли је предложио употребу логаритамске мере информација 1928. године.^[7] Клод Е. Шенон је први пут употребио реч „бит“ у свом семиналном раду из 1948. „Математичка теорија комуникације“.^[8]^[9]^[10] Он је његово порекло приписао Џону В. Тукију, који је 9. јануара 1947. написао меморандум Бел Лабса у коме је „бинарну информацију цифру“ контраховао на једноставно „бит“.^[8] Ваневар Буш је 1936. године писао о „битовима информација“ који су могли да се чувају на бушеним картицама које су се користиле у механичким рачунарима тог времена.^[11] Први програмабилни рачунар, који је направио Конрад Зус, користио је бинарну нотацију за бројеве.^[12]

Јединица и симбол

Бит није дефинисан у Међународном систему јединица (СИ). Међутим, Међународна електротехничка комисија издала је стандард IEC 60027, који прецизира да симбол за бинарну цифру треба да буде 'bit', и то треба да се користи у свим вишекратницима, као што је 'kbit', за килобит.^[13] Међутим, мало слово 'b' се такође широко користи и препоручено од стране IEEE 1541 стандарда (2002). Насупрот томе, велико слово 'B' је стандардни и уобичајени симбол за бајт.

Умношци бита

Више битова се може изразити и представити на неколико начина. За практичност представљања група битова који се често понављају у информационој технологији, традиционално се користи неколико јединица информација. Најчешћа је јединица бајт, који је сковао Вернер Бакхолц у јуну 1956. године. Она је историјски кориштено за представљање групе битова који се користе за кодирање једног карактера текста (док није преузето UTF-8 вишебајтно кодирање) у рачунару^[14]^[15]^[16]^[17]^[18] и из тог разлога је коришћен као основни адресабилни елемент у многим рачунарским архитектурама. Тренд у дизајну хардвера конвергирао је на најчешћу имплементацију коришћења осам битова по бајту, као што се данас широко користи. Међутим, због двосмислености ослањања на основни хардверски дизајн, октет јединица је дефинисан да експлицитно означава секвенцу од осам битова.

Груписање битова

Ради практичности баратања и коришћења информација, битови се групишу у физичке и логичке скупове. Најчешће су то:

Нибл – група од 4 бита, физички скуп битова али не и адресибилни.

Бајт – најмања адресабилна група битова. У почетку је број био врло променљив а касније је скоро потпуно стандардизован на 8.

Октет је група од 8 битова и, за разлику од Француске, често се користи реч бајт уместо октет јер се подразумева да је то исто.

Реч је већа група битова, обично 2 бајта, али није стандардизована (постоје архитектуре са речима од 4, 8 или више бајтова). Реч је најчешћа адресибилна меморијска јединица за податке и за програм. По дужини речи се разликују рачунарске архитектуре, па се говори о шеснаестобитној, тридесет двобитној или шездесет четворобитној архитектури.

Префикси СИ система (k-кило, M-мега, G-гига, итд) су у почетку коришћени да означавају сличне али не исте умношке. Тако је килобит био 2¹⁰=1.024 бита, мегабит 1.024 килобита, итд. јер декадна вредност 1.000 није прилагођена бинарном бројном систему рачунара а 1.024 је приближна вредност. Касније је, међутим дошло до забуне и стандардизовања нових префикса (киби-, меби-, итд) – погледајте табелу.

Брзина преноса

Приликом преноса дигиталног сигнала кроз неки медијум као јединица мере се користи број бита у секунди тј. b/s. Обично су брзине веће па се користе веће јединице као kb/s, Mb/s итд.

Референце

^ Information in small bits Архивирано на сајту Wayback Machine (21. октобар 2021) Information in Small Bits is a book produced as part of a non-profit outreach project of the IEEE Information Theory Society. The book introduces Claude Shannon and basic concepts of Information Theory to children 8 and older using relatable cartoon stories and problem-solving activities.
^ Mackenzie, Charles E. (1980). Coded Character Sets, History and Development. The Systems Programming Series (1 изд.). Addison-Wesley Publishing Company, Inc. стр. x. ISBN 978-0-201-14460-4. LCCN 77-90165. Архивирано из оригинала 2016-11-18. г. Приступљено 2016-05-22. [1]
^ Anderson, John B.; Johnnesson, Rolf (2006), Understanding Information Transmission
^ Haykin, Simon (2006), Digital Communications
^ IEEE Std 260.1-2004
^ „Units: B”. Архивирано из оригинала 2016-05-04. г.
^ Abramson, Norman (1963). Information theory and coding. McGraw-Hill.
^ ^а ^б Shannon, Claude Elwood (јул 1948). „A Mathematical Theory of Communication” (PDF). Bell System Technical Journal. 27 (3): 379—423. doi:10.1002/j.1538-7305.1948.tb01338.x. hdl:11858/00-001M-0000-002C-4314-2  . Архивирано из оригинала (PDF) 1998-07-15. г. „The choice of a logarithmic base corresponds to the choice of a unit for measuring information. If the base 2 is used the resulting units may be called binary digits, or more briefly bits, a word suggested by J. W. Tukey.”
^ Shannon, Claude Elwood (октобар 1948). „A Mathematical Theory of Communication”. Bell System Technical Journal. 27 (4): 623—666. doi:10.1002/j.1538-7305.1948.tb00917.x. hdl:11858/00-001M-0000-002C-4314-2  .
^ Shannon, Claude Elwood; Weaver, Warren (1949). A Mathematical Theory of Communication (PDF). University of Illinois Press. ISBN 0-252-72548-4. Архивирано из оригинала (PDF) 1998-07-15. г.
^ Bush, Vannevar (1936). „Instrumental analysis”. Bulletin of the American Mathematical Society. 42 (10): 649—669. doi:10.1090/S0002-9904-1936-06390-1  . Архивирано из оригинала 2014-10-06. г.
^ "The World's Technological Capacity to Store, Communicate, and Compute Information" Архивирано 2013-07-27 на сајту Wayback Machine, especially Supporting online material Архивирано 2011-05-31 на сајту Wayback Machine, Martin Hilbert and Priscila López (2011), Science, 332(6025), 60-65; free access to the article through here: martinhilbert.net/WorldInfoCapacity.html
^ National Institute of Standards and Technology (2008), Guide for the Use of the International System of Units. Online version. Архивирано 3 јун 2016 на сајту Wayback Machine
^ Bemer, Robert William (2000-08-08). „Why is a byte 8 bits? Or is it?”. Computer History Vignettes. Архивирано из оригинала 2017-04-03. г. Приступљено 2017-04-03. „[…] With IBM's STRETCH computer as background, handling 64-character words divisible into groups of 8 (I designed the character set for it, under the guidance of Dr. Werner Buchholz, the man who DID coin the term "byte" for an 8-bit grouping). […] The IBM 360 used 8-bit characters, although not ASCII directly. Thus Buchholz's "byte" caught on everywhere. I myself did not like the name for many reasons. […]”
^ Buchholz, Werner (1956-06-11). „7. The Shift Matrix” (PDF). The Link System. IBM. стр. 5—6. Stretch Memo No. 39G. Архивирано (PDF) из оригинала 2017-04-04. г. Приступљено 2016-04-04. „[…] Most important, from the point of view of editing, will be the ability to handle any characters or digits, from 1 to 6 bits long […] the Shift Matrix to be used to convert a 60-bit word, coming from Memory in parallel, into characters, or "bytes" as we have called them, to be sent to the Adder serially. The 60 bits are dumped into magnetic cores on six different levels. Thus, if a 1 comes out of position 9, it appears in all six cores underneath. […] The Adder may accept all or only some of the bits. […] Assume that it is desired to operate on 4 bit decimal digits, starting at the right. The 0-diagonal is pulsed first, sending out the six bits 0 to 5, of which the Adder accepts only the first four (0-3). Bits 4 and 5 are ignored. Next, the 4 diagonal is pulsed. This sends out bits 4 to 9, of which the last two are again ignored, and so on. […] It is just as easy to use all six bits in alphanumeric work, or to handle bytes of only one bit for logical analysis, or to offset the bytes by any number of bits. […]”
^ Buchholz, Werner (фебруар 1977). „The Word "Byte" Comes of Age...”. Byte Magazine. 2 (2): 144. „[…] The first reference found in the files was contained in an internal memo written in June 1956 during the early days of developing Stretch. A byte was described as consisting of any number of parallel bits from one to six. Thus a byte was assumed to have a length appropriate for the occasion. Its first use was in the context of the input-output equipment of the 1950s, which handled six bits at a time. The possibility of going to 8 bit bytes was considered in August 1956 and incorporated in the design of Stretch shortly thereafter. The first published reference to the term occurred in 1959 in a paper "Processing Data in Bits and Pieces" by G A Blaauw, F P Brooks Jr and W Buchholz in the IRE Transactions on Electronic Computers, June 1959, page 121. The notions of that paper were elaborated in Chapter 4 of Planning a Computer System (Project Stretch), edited by W Buchholz, McGraw-Hill Book Company (1962). The rationale for coining the term was explained there on page 40 as follows:
Byte denotes a group of bits used to encode a character, or the number of bits transmitted in parallel to and from input-output units. A term other than character is used here because a given character may be represented in different applications by more than one code, and different codes may use different numbers of bits (ie, different byte sizes). In input-output transmission the grouping of bits may be completely arbitrary and have no relation to actual characters. (The term is coined from bite, but respelled to avoid accidental mutation to bit.)
System/360 took over many of the Stretch concepts, including the basic byte and word sizes, which are powers of 2. For economy, however, the byte size was fixed at the 8 bit maximum, and addressing at the bit level was replaced by byte addressing. […]”
^ Blaauw, Gerrit Anne; Brooks, Jr., Frederick Phillips; Buchholz, Werner (1962), „Chapter 4: Natural Data Units” (PDF), Ур.: Buchholz, Werner, Planning a Computer System – Project Stretch, McGraw-Hill Book Company, Inc. / The Maple Press Company, York, PA., стр. 39—40, LCCN 61-10466, Архивирано из оригинала (PDF) 2017-04-03. г., Приступљено 2017-04-03
^ Bemer, Robert William (1959). „A proposal for a generalized card code of 256 characters”. Communications of the ACM. 2 (9): 19—23. S2CID 36115735. doi:10.1145/368424.368435.

Спољашње везе

Bit Calculator – a tool providing conversions between bit, byte, kilobit, kilobyte, megabit, megabyte, gigabit, gigabyte
BitXByteConverter Архивирано на сајту Wayback Machine (6. април 2016) – a tool for computing file sizes, storage capacity, and digital information in various units

[Information_in_small_bits-1] Information in small bits Архивирано на сајту Wayback Machine (21. октобар 2021) Information in Small Bits is a book produced as part of a non-profit outreach project of the IEEE Information Theory Society. The book introduces Claude Shannon and basic concepts of Information Theory to children 8 and older using relatable cartoon stories and problem-solving activities.

[Mackenzie_1980-2] Mackenzie, Charles E. (1980). Coded Character Sets, History and Development. The Systems Programming Series (1 изд.). Addison-Wesley Publishing Company, Inc. стр. x. ISBN 978-0-201-14460-4. LCCN 77-90165. Архивирано из оригинала 2016-11-18. г. Приступљено 2016-05-22. [1]

[Anderson_2006-3] Anderson, John B.; Johnnesson, Rolf (2006), Understanding Information Transmission

[Haykin_2006-4] Haykin, Simon (2006), Digital Communications

[IEEE_260-5] IEEE Std 260.1-2004

[Rowlett-6] „Units: B”. Архивирано из оригинала 2016-05-04. г.

[Abramson_1963-7] Abramson, Norman (1963). Information theory and coding. McGraw-Hill.

[Shannon_1948_1-8] а ^б Shannon, Claude Elwood (јул 1948). „A Mathematical Theory of Communication” (PDF). Bell System Technical Journal. 27 (3): 379—423. doi:10.1002/j.1538-7305.1948.tb01338.x. hdl:11858/00-001M-0000-002C-4314-2  . Архивирано из оригинала (PDF) 1998-07-15. г. „The choice of a logarithmic base corresponds to the choice of a unit for measuring information. If the base 2 is used the resulting units may be called binary digits, or more briefly bits, a word suggested by J. W. Tukey.”

[Shannon_1948_2-9] Shannon, Claude Elwood (октобар 1948). „A Mathematical Theory of Communication”. Bell System Technical Journal. 27 (4): 623—666. doi:10.1002/j.1538-7305.1948.tb00917.x. hdl:11858/00-001M-0000-002C-4314-2  .

[Shannon_1949-10] Shannon, Claude Elwood; Weaver, Warren (1949). A Mathematical Theory of Communication (PDF). University of Illinois Press. ISBN 0-252-72548-4. Архивирано из оригинала (PDF) 1998-07-15. г.

[Bush_1936-11] Bush, Vannevar (1936). „Instrumental analysis”. Bulletin of the American Mathematical Society. 42 (10): 649—669. doi:10.1090/S0002-9904-1936-06390-1  . Архивирано из оригинала 2014-10-06. г.

[Hilbert-Lopez_2011-12] "The World's Technological Capacity to Store, Communicate, and Compute Information" Архивирано 2013-07-27 на сајту Wayback Machine, especially Supporting online material Архивирано 2011-05-31 на сајту Wayback Machine, Martin Hilbert and Priscila López (2011), Science, 332(6025), 60-65; free access to the article through here: martinhilbert.net/WorldInfoCapacity.html

[NIST_2008-13] National Institute of Standards and Technology (2008), Guide for the Use of the International System of Units. Online version. Архивирано 3 јун 2016 на сајту Wayback Machine

[Bemer_2000-14] Bemer, Robert William (2000-08-08). „Why is a byte 8 bits? Or is it?”. Computer History Vignettes. Архивирано из оригинала 2017-04-03. г. Приступљено 2017-04-03. „[…] With IBM's STRETCH computer as background, handling 64-character words divisible into groups of 8 (I designed the character set for it, under the guidance of Dr. Werner Buchholz, the man who DID coin the term "byte" for an 8-bit grouping). […] The IBM 360 used 8-bit characters, although not ASCII directly. Thus Buchholz's "byte" caught on everywhere. I myself did not like the name for many reasons. […]”

[Buchholz_1956-15] Buchholz, Werner (1956-06-11). „7. The Shift Matrix” (PDF). The Link System. IBM. стр. 5—6. Stretch Memo No. 39G. Архивирано (PDF) из оригинала 2017-04-04. г. Приступљено 2016-04-04. „[…] Most important, from the point of view of editing, will be the ability to handle any characters or digits, from 1 to 6 bits long […] the Shift Matrix to be used to convert a 60-bit word, coming from Memory in parallel, into characters, or "bytes" as we have called them, to be sent to the Adder serially. The 60 bits are dumped into magnetic cores on six different levels. Thus, if a 1 comes out of position 9, it appears in all six cores underneath. […] The Adder may accept all or only some of the bits. […] Assume that it is desired to operate on 4 bit decimal digits, starting at the right. The 0-diagonal is pulsed first, sending out the six bits 0 to 5, of which the Adder accepts only the first four (0-3). Bits 4 and 5 are ignored. Next, the 4 diagonal is pulsed. This sends out bits 4 to 9, of which the last two are again ignored, and so on. […] It is just as easy to use all six bits in alphanumeric work, or to handle bytes of only one bit for logical analysis, or to offset the bytes by any number of bits. […]”

[Buchholz_1977-16] Buchholz, Werner (фебруар 1977). „The Word "Byte" Comes of Age...”. Byte Magazine. 2 (2): 144. „[…] The first reference found in the files was contained in an internal memo written in June 1956 during the early days of developing Stretch. A byte was described as consisting of any number of parallel bits from one to six. Thus a byte was assumed to have a length appropriate for the occasion. Its first use was in the context of the input-output equipment of the 1950s, which handled six bits at a time. The possibility of going to 8 bit bytes was considered in August 1956 and incorporated in the design of Stretch shortly thereafter. The first published reference to the term occurred in 1959 in a paper "Processing Data in Bits and Pieces" by G A Blaauw, F P Brooks Jr and W Buchholz in the IRE Transactions on Electronic Computers, June 1959, page 121. The notions of that paper were elaborated in Chapter 4 of Planning a Computer System (Project Stretch), edited by W Buchholz, McGraw-Hill Book Company (1962). The rationale for coining the term was explained there on page 40 as follows:
Byte denotes a group of bits used to encode a character, or the number of bits transmitted in parallel to and from input-output units. A term other than character is used here because a given character may be represented in different applications by more than one code, and different codes may use different numbers of bits (ie, different byte sizes). In input-output transmission the grouping of bits may be completely arbitrary and have no relation to actual characters. (The term is coined from bite, but respelled to avoid accidental mutation to bit.)
System/360 took over many of the Stretch concepts, including the basic byte and word sizes, which are powers of 2. For economy, however, the byte size was fixed at the 8 bit maximum, and addressing at the bit level was replaced by byte addressing. […]”

[Buchholz_1962-17] Blaauw, Gerrit Anne; Brooks, Jr., Frederick Phillips; Buchholz, Werner (1962), „Chapter 4: Natural Data Units” (PDF), Ур.: Buchholz, Werner, Planning a Computer System – Project Stretch, McGraw-Hill Book Company, Inc. / The Maple Press Company, York, PA., стр. 39—40, LCCN 61-10466, Архивирано из оригинала (PDF) 2017-04-03. г., Приступљено 2017-04-03

[Bemer_1959-18] Bemer, Robert William (1959). „A proposal for a generalized card code of 256 characters”. Communications of the ACM. 2 (9): 19—23. S2CID 36115735. doi:10.1145/368424.368435.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

Умношци бита
Одомаћена употреба и (ређе бинарно значење)			Стандард за бинарне префиксе по IEC 60027-2
Назив	Ознака	Количина	Назив	Ознака	Количина
килобит	kb	10³ (2¹⁰)	кибибит	Kibit	2¹⁰
мегабит	Mb	10⁶ (2²⁰)	мебибит	Mibit	2²⁰
гигабит	Gb	10⁹ (2³⁰)	гибибит	Gibit	2³⁰
терабит	Tb	10¹² (2⁴⁰)	тебибит	Tibit	2⁴⁰
петабит	Pb	10¹⁵ (2⁵⁰)	пебибит	Pibit	2⁵⁰
ексабит	Eb	10¹⁸ (2⁶⁰)	ексбибит	Eibit	2⁶⁰
зетабит	Zb	10²¹ (2⁷⁰)	зебибит	Zibit	2⁷⁰
јотабит	Yb	10²⁴ (2⁸⁰)	јобибит	Yibit	2⁸⁰
Види још: бајт, нибл, неодређеност умножака