Skip to main content

Can Computers write Science Books? - Brian Clegg

The German academic publisher Springer has for some time been using automated editing software (with mixed results) - but recently has brought out a whole book written by a piece of AI software called Beta Writer. The book, Lithium-Ion Batteries: a machine generated summary of current research, can be downloaded free of charge as a PDF. But is this a serious challenge for science writers?

It's certainly interesting. If I'm honest, this is hardly a book at all - it's more the output of an automated abstract generator pulled together in book form, where frankly this information would be far better just as a web page. However, there's no doubt that there is some interesting work going on here, particularly in the introduction and conclusion sections of the 'book'.

The whole thing starts with a (human written) preface explaining the technology - by far the most readable part of the text. We then get four 'chapters' of machine-generated content, which each have the format introduction/ set of abstracts / conclusion. Obviously it's the introduction and conclusion that provide the most interest.

I'll focus on the first introduction, though the same criticisms apply throughout. The first test of a piece of scientific writing meant to be readable is to take a step back and get an overview of a chunk of text - does it look like English or is it dominated by acronyms and numbers? A chunk out of the first page shows that this is very dense technical text, extremely low on readability:



The other two significant indicators of readability are whether the text is a collection of fact statements or is written using connectives and summary to give flow, and whether or not overall there is a structure that takes the reader by the hand and leads them through a communication process. On both tests, the book falls down in a big way. Pretty well every sentence is a standalone fact statement that could be a bullet point: there is no flow whatsoever. And although some attempt has been made to group these statements effectively, there is no sense of a thought-through structure. In the interminable-seeming introductions - the first one runs to 22 dense pages - there is no sense that we are going anywhere, just that we are experiencing randomly thrown together bits of data.

Inevitably, an automated process will produce some sentences that don't quite work, so one essential here is to see whether these have been captured and fixed. A reasonably high percentage of the content does make grammatical sense, but there are regular hiccups - for example we get: 

  • 'That sort of research's principal aim...' - it should be 'principle' not 'principal'. 
  • 'Materials, a number of metal oxides with high theoretical capacity have aroused more and more attention including...' - that 'Materials,' start makes no sense.
  • 'Through Tang and others, mesoporous nanosheet is synthesized...' - sounds painful.
  • 'It is still maintained the huge capacity of 611 mAg-1... when utilized as an anode.' - doesn't make any sense.
  • 'Apart from, few-layer nanosheets enhance a fast insertion...' - apart from what?
  • And so on for many, many more examples.

Going on comments I've had from some Springer authors, the level of uncaught or automatic-editing-generated errors is fairly high in their human-authored publications - these books tend not to be heavily edited - but because they are starting with far more readable text, this is less of an issue.

So, should science writers be worried? Obviously, as a professional writer myself I'm biassed, but I would say 'No' - at least, not yet. The text in the introductions and conclusions is nowhere near the readability of a decent technical science book, let alone the far higher writing quality required for a good popular science book. And the outcome also emphasises that even if, long-term, automated writing becomes more common, it is always likely to need a look over by a human editor to avoid errors creeping in. However, this is a fascinating experiment and Springer should be congratulated for getting this far.

Comments

Popular posts from this blog

The God Game (SF) - Danny Tobey *****

Wow. I'm not sure I've ever read a book that was quite such an adrenaline rush - certainly it has been a long time since I've read a science fiction title which has kept me wanting to get back to it and read more so fiercely. 

In some ways, what we have here is a cyber-SF equivalent of Stephen King's It. A bunch of misfit American high school students face a remarkably powerful evil adversary - though in this case, at the beginning, their foe appears to be able to transform their worlds for the better.

Rather than a supernatural evil, the students take on a rogue AI computer game that thinks it is a god - and has the powers to back its belief. Playing the game is a mix of a virtual reality adventure like Pokemon Go and a real world treasure hunt. Players can get rewards for carrying out tasks - delivering a parcel, for example, which can be used to buy favours, abilities in the game and real objects. But once you are in the game, it doesn't want to let you go and is …

Uncertainty - Kostas Kampourakis and Kevin McCain ***

This is intended as a follow-on to Stuart Firestein's two books, the excellent Ignorance and its sequel, Failure, which cut through some of the myths about the nature of science and how it's not so much about facts as about what we don't know and how we search for explanations. The authors of Uncertainty do pretty much what they set out to do in explaining the significance of uncertainty and why it can make it difficult to present scientific findings to the public, who expect black-and-white facts, not grey probabilities, which can seem to some like dithering.

However, I didn't get on awfully well with the book. A minor issue was the size - it was just too physically small to hold comfortably, which was irritating. More significantly, it felt like a magazine article that was inflated to make a book. There really was only one essential point made over and over again, with a handful of repeated examples. I want something more from a book - more context and depth - that …

The Ascent of Gravity - Marcus Chown ****

Marcus Chown is one of the UK's best writers on physics and astronomy - it's excellent to see him back on what he does best. Here we discover our gradual approach to understanding the nature of gravity - the 'ascent' of the title - which, though perhaps slightly overblown in the words 'the force that explains everything' (quantum physics does quite a lot too, for example), certainly makes us aware of the importance of this weakest of fundamental forces. Chown's approach to gravity is a game of three halves, as they say, broadly covering Newton, Einstein and where we go from general relativity.
As far as the first two sections go, with the exception of the 2015 gravitational waves detection, there's not much that's actually new - if you want a popular science exploration of these aspects of the topic with more depth see this reviewer's Gravity - but no one has covered the topic with such a light touch and joie de vivre as Chown. 
Although Chown doe…