Skip to main content

Can Computers write Science Books? - Brian Clegg

The German academic publisher Springer has for some time been using automated editing software (with mixed results) - but recently has brought out a whole book written by a piece of AI software called Beta Writer. The book, Lithium-Ion Batteries: a machine generated summary of current research, can be downloaded free of charge as a PDF. But is this a serious challenge for science writers?

It's certainly interesting. If I'm honest, this is hardly a book at all - it's more the output of an automated abstract generator pulled together in book form, where frankly this information would be far better just as a web page. However, there's no doubt that there is some interesting work going on here, particularly in the introduction and conclusion sections of the 'book'.

The whole thing starts with a (human written) preface explaining the technology - by far the most readable part of the text. We then get four 'chapters' of machine-generated content, which each have the format introduction/ set of abstracts / conclusion. Obviously it's the introduction and conclusion that provide the most interest.

I'll focus on the first introduction, though the same criticisms apply throughout. The first test of a piece of scientific writing meant to be readable is to take a step back and get an overview of a chunk of text - does it look like English or is it dominated by acronyms and numbers? A chunk out of the first page shows that this is very dense technical text, extremely low on readability:



The other two significant indicators of readability are whether the text is a collection of fact statements or is written using connectives and summary to give flow, and whether or not overall there is a structure that takes the reader by the hand and leads them through a communication process. On both tests, the book falls down in a big way. Pretty well every sentence is a standalone fact statement that could be a bullet point: there is no flow whatsoever. And although some attempt has been made to group these statements effectively, there is no sense of a thought-through structure. In the interminable-seeming introductions - the first one runs to 22 dense pages - there is no sense that we are going anywhere, just that we are experiencing randomly thrown together bits of data.

Inevitably, an automated process will produce some sentences that don't quite work, so one essential here is to see whether these have been captured and fixed. A reasonably high percentage of the content does make grammatical sense, but there are regular hiccups - for example we get: 

  • 'That sort of research's principal aim...' - it should be 'principle' not 'principal'. 
  • 'Materials, a number of metal oxides with high theoretical capacity have aroused more and more attention including...' - that 'Materials,' start makes no sense.
  • 'Through Tang and others, mesoporous nanosheet is synthesized...' - sounds painful.
  • 'It is still maintained the huge capacity of 611 mAg-1... when utilized as an anode.' - doesn't make any sense.
  • 'Apart from, few-layer nanosheets enhance a fast insertion...' - apart from what?
  • And so on for many, many more examples.

Going on comments I've had from some Springer authors, the level of uncaught or automatic-editing-generated errors is fairly high in their human-authored publications - these books tend not to be heavily edited - but because they are starting with far more readable text, this is less of an issue.

So, should science writers be worried? Obviously, as a professional writer myself I'm biassed, but I would say 'No' - at least, not yet. The text in the introductions and conclusions is nowhere near the readability of a decent technical science book, let alone the far higher writing quality required for a good popular science book. And the outcome also emphasises that even if, long-term, automated writing becomes more common, it is always likely to need a look over by a human editor to avoid errors creeping in. However, this is a fascinating experiment and Springer should be congratulated for getting this far.

Comments

Popular posts from this blog

The Decline and Fall of the Human Empire - Henry Gee ****

In his last book, Henry Gee impressed with his A (Very) Short History of Life on Earth - this time he zooms in on one very specific aspect of life on Earth - humans - and gives us not just a history, but a prediction of the future - our extinction. The book starts with an entertaining prologue, to an extent bemoaning our obsession with dinosaurs, a story that leads, inexorably towards extinction. This is a fate, Gee points out, that will occur for every species, including our own. We then cover three potential stages of the rise and fall of humanity (the book's title is purposely modelled on Gibbon) - Rise, Fall and Escape. Gee's speciality is palaeontology and in the first section he takes us back to explore as much as we can know from the extremely patchy fossil record of the origins of the human family, the genus Homo and the eventual dominance of Homo sapiens , pushing out any remaining members of other closely related species. As we move onto the Fall section, Gee gives ...

Pagans (SF) - James Alistair Henry *****

There's a fascinating sub-genre of science fiction known as alternate history. The idea is that at some point in the past, history diverged from reality, resulting in a different present. Perhaps the most acclaimed of these books is Kingsley Amis's The Alteration , set in a modern England where there had not been a reformation - but James Alistair Henry arguably does even better by giving us a present where Britain is a third world country, still divided between Celts in the west and Saxons in the East. Neither the Normans nor Christianity have any significant impact. In itself this is a clever idea, but what makes it absolutely excellent is mixing in a police procedural murder mystery, where the investigation is being undertaken by a Celtic DI, Drustan, who has to work in London alongside Aedith, a Saxon reeve of equivalent rank, who also happens to be daughter of the Earl of Mercia. While you could argue about a few historical aspects, it's effectively done and has a plot...

Amazing Worlds of Science Fiction and Science Fact: Keith Cooper ****

There's something appealing (for a reader like me) about a book that brings together science fiction and science fact. I had assumed that the 'Amazing Worlds' part of the title suggested a general overview of the interaction between the two, but Keith Cooper is being literal. This is an examination of exoplanets (planets that orbit a different star to the Sun) as pictured in science fiction and in our best current science, bearing in mind this is a field that is still in the early phases of development. It becomes obvious early on that Cooper, who is a science journalist in his day job, knows his stuff on the fiction side as well as the current science. Of course he brings in the well-known TV and movie tropes (we get a huge amount on Star Trek ), not to mention the likes of Dune, but his coverage of written science fiction goes into much wider picture. He also has consulted some well-known contemporary SF writers such as Alastair Reynolds and Paul McAuley, not just scient...