Jump to content

Data: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Reverting possible vandalism by 212.121.212.106 to version by Djkernen. False positive? Report it. Thanks, ClueBot NG. (1217304) (Bot)
No edit summary
Line 3: Line 3:


<!-- Editors: please keep the count/mass discussion and etymology in the body, not the intro. -->
<!-- Editors: please keep the count/mass discussion and etymology in the body, not the intro. -->
'''Data''' ({{IPAc-en|icon|ˈ|d|eɪ|t|ə}} {{respell|DAY|tə}}, {{IPAc-en|ˈ|d|æ|t|ə}} {{respell|DA|tə}}, or {{IPAc-en|ˈ|d|ɑː|t|ə}} {{respell|DAH|tə}}) are values of [[Qualitative data|qualitative]] or [[Quantitative data|quantitative]] [[variable and attribute (research)|variable]]s, belonging to a set of items. Data in [[computing]] (or [[data processing]]) are often represented by a combination of items organized in rows and [[multivariate analysis|multiple variables]] organized in columns. Data are typically the results of measurements and can be [[data visualisation|visualised]] using [[graph (data structure)|graph]]s or [[image]]s. Data as an abstract concept can be viewed as the lowest level of [[abstraction]] from which information and then knowledge are derived. ''[[Raw data]]'', i.e., unprocessed data, refers to a collection of [[number]]s, [[character (computing)|characters]] and is a relative term; data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next. [[Field work|Field data]] refers to raw data collected in an uncontrolled [[in situ]] environment. [[Experimental data]] refers to data generated within the context of a scientific investigation by observation and recording.
'''Data''' ({{IPAc-en|icon|ˈ|d|eɪ|t|ə}} {{respell|DAY|tə}}, {{IPAc-en|ˈ|d|æ|t|ə}} {{respell|DA|tə}}, or {{IPAc-en|ˈ|d|ɑː|t|ə}} {{respell|DAH|tə}}) are values of [[Qualitative data|qualitative]] or [[Quantitative data|quantitative]] [[variable and attribute (research)|variable]]s, belonging to a set of items. Data in [[computing]] (or [[data processing]]) are luke is awful at cod often represented by a combination of items organized in rows and [[multivariate analysis|multiple variables]] organized in columns. Data are typically the results of measurements and can be [[data visualisation|visualised]] using [[graph (data structure)|graph]]s or [[image]]s. Data as an abstract concept can be viewed as the lowest level of [[abstraction]] from which information and then knowledge are derived. ''[[Raw data]]'', i.e., unprocessed data, refers to a collection of [[number]]s, [[character (computing)|characters]] and is a relative term; data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next. [[Field work|Field data]] refers to raw data collected in an uncontrolled [[in situ]] environment. [[Experimental data]] refers to data generated within the context of a scientific investigation by observation and recording.


The word ''data'' is the plural of ''datum'', [[Grammatical gender|neuter]] [[past participle]] of the [[Latin]] ''dare'', "to give", hence "something given". In discussions of problems in [[geometry]], [[mathematics]], [[engineering]], and so on, the terms ''givens'' and ''data'' are used interchangeably. Such usage is the origin of ''data'' as a concept in [[computer science]] or [[data processing]]: data are numbers, words, images, etc., accepted as they stand.
The word ''data'' is the plural of ''datum'', [[Grammatical gender|neuter]] [[past participle]] of the [[Latin]] ''dare'', "to give", hence "something given". In discussions of problems in [[geometry]], [[mathematics]], [[engineering]], and so on, the terms ''givens'' and ''data'' are used interchangeably. Such usage is the origin of ''data'' as a concept in [[computer science]] or [[data processing]]: data are numbers, words, images, etc., accepted as they stand.

Revision as of 08:37, 17 September 2012

Template:Two other uses

Data (/[invalid input: 'icon']ˈdtə/ DAY-tə, /ˈdætə/ DA-tə, or /ˈdɑːtə/ DAH-tə) are values of qualitative or quantitative variables, belonging to a set of items. Data in computing (or data processing) are luke is awful at cod often represented by a combination of items organized in rows and multiple variables organized in columns. Data are typically the results of measurements and can be visualised using graphs or images. Data as an abstract concept can be viewed as the lowest level of abstraction from which information and then knowledge are derived. Raw data, i.e., unprocessed data, refers to a collection of numbers, characters and is a relative term; data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next. Field data refers to raw data collected in an uncontrolled in situ environment. Experimental data refers to data generated within the context of a scientific investigation by observation and recording.

The word data is the plural of datum, neuter past participle of the Latin dare, "to give", hence "something given". In discussions of problems in geometry, mathematics, engineering, and so on, the terms givens and data are used interchangeably. Such usage is the origin of data as a concept in computer science or data processing: data are numbers, words, images, etc., accepted as they stand.

Usage in English

In English, the word datum is still used in the general sense of "an item given". In cartography, geography, nuclear magnetic resonance and technical drawing it is often used to refer to a single specific reference datum from which distances to all other data are measured. Any measurement or result is a datum, but data point is more usual,[1] albeit tautological or, more generously, pleonastic. Both datums (see usage in datum article) and the originally Latin plural data are used as the plural of datum in English, but data is commonly treated as a mass noun and used with a verb in the singular form, especially in day-to-day usage. For example, This is all the data from the experiment. This usage is inconsistent with the rules of Latin grammar and traditional English (These are all the data from the experiment). Even when a very small quantity of data is referenced (one number, for example) the phrase piece of data is often used, as opposed to datum. The debate over appropriate usage is ongoing.[2][3][4]

The IEEE Computer Society allows usage of data as either a mass noun or plural based on author preference.[5] Other professional organizations and style guides[6] require that authors treat data as a plural noun. For example, the Air Force Flight Test Center specifically states that the word data is always plural, never singular.[7]

Data is most often used as a singular mass noun in educated everyday usage.[8][9] Some major newspapers such as The New York Times use it either in the singular or plural. In the New York Times the phrases "the survey data are still being analyzed" and "the first year for which data is available" have appeared within one day.[10] The Wall Street Journal explicitly allows this in its style guide.[11] In scientific writing data is often treated as a plural, as in These data do not support the conclusions, but it is also used as a singular mass entity like information. British usage now widely accepts treating data as singular in standard English,[12] including everyday newspaper usage[13] at least in non-scientific use.[14] UK scientific publishing still prefers treating it as a plural.[15] Some UK university style guides recommend using data for both singular and plural use[16] and some recommend treating it only as a singular in connection with computers.[17]

Meaning of data, information and knowledge

The terms data, information and knowledge are frequently used for overlapping concepts. The main difference is in the level of abstraction being considered. Data is the lowest level of abstraction, information is the next level, and finally, knowledge is the highest level among all three.[18] Data on its own carries no meaning. For data to become information, it must be interpreted and take on a meaning. For example, the height of Mt. Everest is generally considered as "data", a book on Mt. Everest geological characteristics may be considered as "information", and a report containing practical information on the best way to reach Mt. Everest's peak may be considered as "knowledge".

Information as a concept bears a diversity of meanings, from everyday usage to technical settings. Generally speaking, the concept of information is closely related to notions of constraint, communication, control, data, form, instruction, knowledge, meaning, mental stimulus, pattern, perception, and representation.

Beynon-Davies uses the concept of a sign to distinguish between data and information; data are symbols while information occurs when symbols are used to refer to something.[19]

It is people and computers who collect data and impose patterns on it. These patterns are seen as information which can be used to enhance knowledge. These patterns can be interpreted as truth, and are authorized as aesthetic and ethical criteria. Events that leave behind perceivable physical or virtual remains can be traced back through data. Marks are no longer considered data once the link between the mark and observation is broken.[20]

Mechanical computing devices are classified according to the means by which they represent data. An analog computer represents a datum as a voltage, distance, position, or other physical quantity. A digital computer represents a datum as a sequence of symbols drawn from a fixed alphabet. The most common digital computers use a binary alphabet, that is, an alphabet of two characters, typically denoted "0" and "1". More familiar representations, such as numbers or letters, are then constructed from the binary alphabet.

Some special forms of data are distinguished. A computer program is a collection of data, which can be interpreted as instructions. Most computer languages make a distinction between programs and the other data on which programs operate, but in some languages, notably Lisp and similar languages, programs are essentially indistinguishable from other data. It is also useful to distinguish metadata, that is, a description of other data. A similar yet earlier term for metadata is "ancillary data." The prototypical example of metadata is the library catalog, which is a description of the contents of books.

See also

References

This article is based on material taken from the Free On-line Dictionary of Computing prior to 1 November 2008 and incorporated under the "relicensing" terms of the GFDL, version 1.3 or later.

  1. ^ Matt Dye (2001). "Writing Reports". University of Bristol.
  2. ^ "Data is a singular noun".
  3. ^ "Grammarist: Data".
  4. ^ "Dictionary.com Data".
  5. ^ "IEEE Computer Society Style Guide, DEF". IEEE Computer Society.
  6. ^ "WHO Style Guide" (PDF). Geneva: World Health Organization. 2004. p. 43.[dead link]
  7. ^ The Author's Guide to Writing Air Force Flight Test Center Technical Reports. Air Force Flight Test Center.
  8. ^ New Oxford Dictionary of English, 1999
  9. ^ "...in educated everyday usage as represented by the Guardian newspaper, it is nowadays most often used as a singular." http://www.eisu2.bham.ac.uk/johnstf/revis006.htm
  10. ^
  11. ^ "Is Data Is, or Is Data Ain't, a Plural?". Wall Street Journal. 2012.
  12. ^ New Oxford Dictionary of English. 1999. {{cite encyclopedia}}: Missing or empty |title= (help)
  13. ^ Tim Johns (1997). "Data: singular or plural?". ...in educated everyday usage as represented by The Guardian newspaper, it is nowadays most often used as a singular.
  14. ^ "Data". Compact Oxford Dictionary.
  15. ^ "Data: singular or plural?". Blair Wisconsin International University.
  16. ^ "Singular or plural". University of Nottingham Style Book. University of Nottingham.[dead link]
  17. ^ "Computers and computer systems". OpenLearn.[dead link]
  18. ^ Akash Mitra (2011). "Classifying data for successful modeling".
  19. ^
  20. ^ Sharon Daniel. The Database: An Aesthetics of Dignity.