Skip to main content

Big Data - Brian Clegg ****

I first became involved with what we now term big data when providing some mathematical assistance to a major supermarket.  They wanted to know what products would suffer, or benefit, if another product were put on special offer – the victims and victors as they called them.  As an example, if fresh meat pies are put on buy one get one free, should the supermarket plan on stocking more fresh vegetables? That sort of thing.  The supermarket in question had a lot of data concerning historical sales, and what had previously been put on special offer, so it was just a case of designing a set of algorithms to analyse this data to provide the necessary forecast, and also to have the system learn through what we would now call reinforcement learning over time.  This was back in the mid 90s. One can imagine how things - in all camps - should have vastly improved since then.  That’s just one example of where Big Data transparently impacts our lives.

In Big Data, Clegg sets out an assortment of examples from the success of Netflix and the prediction of crime locations to algorithms that have lost people their jobs or caused stock market crashes, examining the mechanisms and implications of each.  Taking the supermarket example - although this is my example and not his - we might ask ourselves who really benefits here – who exactly are the victims and victors (or villains perhaps) in real life?

Big Data is here to stay - should we be afraid of it or embrace it?  As always, Clegg writes with an easy clarity that draws us in - no technical expertise required to understand his exploration of this essential subject - and throughout Big Data’s highly enjoyable pages, the spread and range of material is highly impressive – dizzying in fact.  I personally found entirely new perspectives on the subject that will keep me pondering for quite some time.  

I should add that, if I were still a statistics lecturer at Oxford, I would recommend the book to my students as bedside reading.

Paperback:  

Kindle:  

Review by Peet Morris
Please note, this title is written by the editor of the Popular Science website. Our review is still an honest opinion – and we could hardly omit the book – but do want to make the connection clear.

Comments

Popular posts from this blog

Beyond Weird - Philip Ball *****

It would be easy to think 'Surely we don't need another book on quantum physics.' There are loads of them. Anyone should be happy with The Quantum Age on applications and the basics, Cracking Quantum Physics for an illustrated introduction or In Search of Schrödinger's Cat for classic history of science coverage. Don't be fooled, though - because in Beyond Weird, Philip Ball has done something rare in my experience until Quantum Sense and Nonsense came along. It makes an attempt not to describe quantum physics, but to explain why it is the way it is.

Historically this has rarely happened. It's true that physicists have come up with various interpretations of quantum physics, but these are designed as technical mechanisms to bridge the gap between theory and the world as we see it, rather than explanations that would make sense to the ordinary reader.

Ball does not ignore the interpretations, though he clearly isn't happy with any of them. He seems to come clo…

The AI Delusion - Gary Smith *****

This is a very important little book ('little' isn't derogatory - it's just quite short and in a small format) - it gets to the heart of the problem with applying artificial intelligence techniques to large amounts of data and thinking that somehow this will result in wisdom.

Gary Smith as an economics professor who teaches statistics, understands numbers and, despite being a self-confessed computer addict, is well aware of the limitations of computer algorithms and big data. What he makes clear here is that we forget at our peril that computers do not understand the data that they process, and as a result are very susceptible to GIGO - garbage in, garbage out. Yet we are increasingly dependent on computer-made decisions coming out of black box algorithms which mine vast quantities of data to find correlations and use these to make predictions. What's wrong with this? We don't know how the algorithms are making their predictions - and the algorithms don't kn…

Five Photons - James Geach ****

It is generally acknowledged that Stephen Hawking's A Brief History of Time is one of the most common books to be bought but not read beyond the first few pages. If you are the kind of popular science reader who found Hawking hard going, you can stop now - Five Photons is not for you. If, on the other hand, you found A Brief History of Time a piece of cake and wished you could get into more depth without resorting to heavy mathematics or a tedious textbook style, Five Photons could be just up your street.

Astrophysicist James Geach starts of fairly gently with a chapter on the nature of light that mostly sets aside quantum physics, leading up to the observation that light is our vehicle for for stripping back the history of the universe to its earliest times (or, at least, the point where the universe became transparent). From here on, the five photons of the title take us on different journeys, from the oldest surviving light of the cosmic microwave background radiation to that fr…