Data is increasingly cheap and all-pervasive. We are now digitizing analog articles that was created over centuries plus collecting myriad new types of information from web logs, mobile devices, detectors, instruments, and transactions. IBM estimations that 90 percent of the information in the world today has been created in the past 2 yrs.
At the same time, new technologies are usually emerging to organize and make sense of the avalanche of data. We are now able to identify patterns and regularities within data of all sorts that enable us to advance scholarship, improve the human being condition, and create commercial and interpersonal value. The rise of “big data” has the potential to deepen our understanding of phenomena ranging from bodily and biological systems to human being social and economic behavior1.
A Challenge Identified
Virtually every field of the economy now has entry to more data than would have already been imaginable even a decade ago. Businesses today are accumulating new information at a rate that exceeds their capability to extract value from it. The question facing every organization that will wants to attract a community is using data effectively â? not just their very own data, but all of the data that can be found and relevant.
Our ability to obtain social and economic value through the newly available data is limited from the lack of expertise. Working with this information requires distinctive new skills plus tools. The corpuses are often as well voluminous to fit on a single computer, to control with traditional databases or record tools, or to represent using regular graphics software. The data can also be more heterogeneous than the highly curated data of the past. Digitized textual content, audio, and visual content, such as sensor and blog data, is normally messy, incomplete, and unstructured; it is of uncertain provenance and high quality; and frequently must be combined with other information to be useful. Working with user-generated data sets also raises difficult issues of privacy, security, plus ethics.
The field of information science is emerging at the intersection of the fields of social technology and statistics, information and pc science, and design. The UC Berkeley School of Information will be ideally positioned to bring these professions together and to provide students using the research and professional skills to achieve leading edge organizations