Big data becomes a new focus of information technology
Recently, in the field of information technology, following the cloud computing, the word "big data" has become the focus of media chasing. In response, Academician Li Guojie, academician of the Chinese Academy of Engineering and chief scientist of the Institute of Computing Technology, Chinese Academy of Sciences, said in an interview with reporters: “The scientific and technological community should pay close attention to the new development direction of big data research, and find challenging scientific issues from big data applications. The fourth scientific paradigm based on big data promotes the formation of a new type of interdiscipline: cyberdata science."
Changes in the information society
"60 years ago digital computers made information readable. The Internet made information available 20 years ago. Search engine crawlers turned the Internet into a database 10 years ago. Now Google and similar companies are processing massive corpora as a human social laboratory." At the 424th colloquium held at the recent Xiangshan Science Conference, Li Guojie cited a quote from Anderson, editor-in-chief of Wired magazine, as the opening speech of his speech.
Wikipedia defines: "Big data is a collection of data that cannot be captured, managed, and processed by conventional software tools within a certain period of time." "Big data" is characterized by large amounts of data, variety, and speed. Involved in the Internet, economy, biology, medicine, astronomy, meteorology, physics and many other fields.
IDC’s digital universe research report stated that the total amount of data created and copied in the world in 2011 was 1.8ZB, and it is predicted that by 2020, the world will have 35ZB of data volume.
“The decline in data costs has led to a dramatic increase in the amount of data, and the emergence of new data sources and data acquisition technology has increased the types of data,†Li Guojie told reporters. “Unstructured data has added to the complexity of big data.â€
On March 29, 2012, the U.S. government allocated 200 million U.S. dollars to launch the Big Data Research and Development Initiative. Li Guojie believes that this is a landmark event that shows that big data has become the focus of information technology after the IC and the Internet.
Pay attention to the technical challenges raised by big data
In response to the United States plan for big data research, Li Guojie told reporters that this big data plan is most concerned with data engineering rather than data science. It mainly considers the efficiency of big data analysis algorithms and systems. For our country, the technical challenges of big data projects should also be taken seriously.
For hundreds of years, scientific research has been doing "from thin to thick" things, turning "small data" into "big data." Li Guojie believes that what we must do now is "from thick to thin" and we must turn "big data" into "small data." "A lot of data is repetitive or worthless. In the future, our task is not to acquire more and more data, but to re-classify data and to refine it," he said.
He further pointed out that the existing data center technology is difficult to meet the application needs of big data, and the revolutionary reconstruction of the entire IT architecture is imperative. First, the growth of storage capacity is far behind the growth of data. Designing the most reasonable hierarchical storage architecture has become the key to information systems. Second, the movement of data has become the largest overhead of information systems. Information systems need to change from data around processors to processing power around data. In addition, highly scalable and highly available data analysis techniques, new data representation methods, and high-throughput computers are all technical issues that need to be addressed.
Basic scientific issues still have no consensus
Although the academic community has taken note of the scientific challenges brought about by big data, there is still no consensus on some basic scientific issues.
Many scholars believe that computer science is the science of algorithms, and data science is the science of data. Some scholars have tried to study "data" as a "natural body", that is, the "data world."
However, in Li Guojie's view, the common problem of the "data world" as a form of indirect existence of objective things is not clear from the "physical world" of various fields.
He believes that unlike data mining and statistics, scholars engaged in big data research should pay more attention to the knowledge and laws behind statistical distribution.
The complexity of "big data" comes mainly from the connections between individuals. “The data is behind the network. The people behind the network are people. Researching the network data is actually a social network of researchers.†Li Guojie pointed out that “the 'network data science' should be a science that studies society as a whole and its focus is on research. The social network behind the data."
Therefore, Big Data has become the link connecting human society, the physical world, and the information space. It is necessary to build a unified information system that integrates the three worlds of humans, machines, and things.
Li Guojie called for an upsurge in big data research and academics to stay awake. “First, we must clarify the most valuable application areas of big data research, and clarify the boundaries and research objects of data science. Only by clarifying the scientific issues to be studied, will the online data science be on the track of sound development,†he said.
Anti Static Flooring,Anti Static Floor Tiles,Anti Static Vinyl Flooring,Anti Static Tiles
JIANGSU HUAJING FLOOR TECHNOLOGY CO.,LTD , http://www.huajing-floor.com