1、中文 5850 字, 3500 英文单词, 18500 英文字符 文献出处: Torrecilla J L, Romo J. Data learning from big dataJ. Statistics & Probability Letters, 2018. Data learning from big data Jos L. Torrecilla , Juan Romo Abstract Technology is generating a huge and growing availability of observations of diverse nature. This big
2、 data is placing data learning as a central scientific discipline. It includes collec - tion, storage, preprocessing, visualization and, essentially, statistical analysis of enormous batches of data. In this paper, we discuss the role of statistics regarding some of the issues raised by big data in
3、this new paradigm and also propose the name of data learning to describe all the activities that allow to obtain relevant knowledge from this new source of information. Keywords: Big data, Data learning, Statistics 1. Introduction Big data is one of the most fashionable concepts nowadays: everybody
4、talks about it, is permanently in the media, and companies and governments try to exploit the new amount of available information (Lohr, 2012; John Walker, 2014; James, 2018). The ideas behind this interest are mainly two. First, the fact that at present, most activities generate data (with very low
5、 cost) that contains (potentially valuable) information. The second one is well summarized in John Walker (2014): Data- driven decisions are better decisions - it is as simple as that. Using big data enables managers to decide on the basis of evidence rather than intuition. The opportunities offered
6、 by big data are undeniable, but there is still a debate about the scope and usefulness of this (Secchi, 2018; Bhlmann and van de Geer, 2018). The opinions of the most fervent followers speak of the end of the theory and the models and, in articles like the controversial The end of theory (Anderson, 2008) they argue that with enough data, the numbers speak for themselves. On the other hand, there have been more critical voices that question whether the optimism and the faith tha