Statistics and Big Data

We have lots of data these days, many numbers (zeros and ones) stored in our computers. We have much more data than in the old days when measurements of the orbit of a single planet took weeks. Then in the 18th century, Gauss invented the least-squares method. As soon as there are data, there comes a statistical method. Think of your data sitting on your computer. If you do not get them out of there what is the sense of it all?

The scientific motivation for analyzing data is to increase knowledge. A classical statistical uncertainty measure is the -value. If it is small one has found something. Then the result is called significant and published in scientific journals. False positives are part of the process of gaining knowledge and insight. The significant findings are to be subjected to further studies and sense has to be made out of it. Thus statistics plays its role in understanding algorithms, the value of their outcomes and consequent decisions for new research directions.

Dr Rajeeva Laxman Karandikar is an Indian mathematician, statistician and psephologist. He has been the director of Chennai Mathematical Institute, Chennai since 2010. He is a Fellow of the Indian Academy of Sciences and Indian National Science Academy. 


