Skip to content

Category: Updates

Glossary System. What’s next?

A personal glossary system can be a powerful way to help writers write less but to simultaneously communicate more.

A good example is an author writing about going ‘to work’ and then defining what the author’s work is, for future writing where this does not need to be repeated. It brings up the immediate question of what happens when the author changes their work? This may well be an intractable problem but having both the date of creation of the glossary term and a link to an online master glossary where terms can be checked can be a useful tool. At the very least this shows–unless the author made a mistake in not updating the term, where the author was working at the point when the document was published.

Questions come up with regularity as to whether the glossary should automatically tag text on export and the answer seems to be that showing the document with all suggested tags shown with the first sentence of the definition highlighted in a colour to allow the author to say no with a click, and to manually add with a click.

The glossary should follow the document, not live on a server, that much is clear, but only as instantiations, not as a master list. This should then be possible with WordPress as host for the terms, with an ability for the user to update definitions, with a trail by default or as corrections if the user so wishes.

This is a basic hypertext and it can be read in documents or spread out in graphs and is what I would like to tackle next, if I can ever get visual-meta well enough into the world to focus on this :-)

Leave a Comment

elo2019

I attended http://elo2019.ucc.ie as a sponsor for Liquid | Author and to promote and get feedback for the Visual-Meta proposal.

A short flight from Heathrow and a brief taxi ride (cash, much of Cork was cash so had to stop at a gas station to get some Euros) and quite a bit of walking back and forth between the charming B&B and downtown, resulting in some serious blisters, and I was settled for the three days of the Electronic Literature Organisation Festival.

Emily & Edgar did not join me so I felt I was in a TV commercial for video communication when we used FaceTime since it was just so immersive since I missed them so much. Here is a FaceTime screenshot:

The campus was beautiful and the people charming.

My host James Sullivan gave me 5 mins to present Liquid during the opening ceremony, which I pre-made with two edited videos to make sure I didn’t waffle on:

My view:

And the view of us:

I felt the reception to the presentation and the software was good and there were three days of inspirational presentations on both theory and artistic practice. To see so many experimentations with the medium of text was exhilarating.

I invited a few people to contribute to The Future of Text book.

There was even a fantastic coffee shop in the art gallery, which I discovered a bit late:

And then it was all over, and back to London.

Leave a Comment

Making Information Self Aware

We can fight fake news and find more useful information in the academic and scientific publishing tsunami if we make the information self aware–if the information knows what it is. This is not a suggestion of Harry Potter level magical fantasy but a concrete act we can start with today and lay for foundation for future massive improvement.

the intelligent environment

Many years ago I read an interview with one of the developers of the computer game Crysis where he was lauded with the quality of the AI of the opponents in the game. He said that making the AI was not really the hard part, making the different parts of the environment aware of their attributes was key. If a tree trunk is thick, then the enemy can hide behind it. If it is dense then it will also serve as a shield, up to a point.

the self aware document

This is what we can and must do to documents. We must encode the meaning in documents as clearly as possible so that the document may be read by software and human. The document must be aware of who authored it, when, what its title is and so on, to at least provide the minimal context for useful citations.

It should also know what citations it contains and what any charts and graphs means what glossary terms are used and how they connect. Of course, we call this ‘metadata’ – information about information and the term has been used in many ways for many years now, but the metadata has so far been hidden inside the document, away from direct human and system interaction. We should maybe instead call it ‘hiddendata’. For some media this is actively used, such as the EXIF data in photographs, but it is lost when the photograph changes format, is inserted into other media or is printed. For text-based documents this is certainly currently possible but seldom actually used and not usefully read by the reader software and lost on printing.

bibtex foundation

You may well feel that this is simply a call for yet another document format but it is not. This is simply a call for a new way to add academic ‘industry-standard’ BibTeX style formatting of metadata to any document, starting with PDFs, in a robust, useful and legacy friendly way, by simply adding a final appendix to the document which follows a visually human-readable (hence BibTeX) and therefore also machine parseable format.

As this will include who authored the information, which the reading software can ‘understand’ and make it possible for the user to simply copy text from the document and paste it as a full citation into a new document in one operation, making citations easier, quicker and more robust. Further information can be explained for reader-software parsing, such as how the headings are formatted (so that the reader software can re-format the document if required, to show academic citation styles in the preference of the reader if they are different from the presence of the author), what citations are used, what glossary terms are used and what the data in tables etc. contains and more.

more connected texts

This is making the document say what it is, where it comes from, how it’s connected, what it means, and what data it contains. This is, in effect, making the document self aware and able to communicate with the world. These are truly augmented documents.

This will power simple parsing today and enable more powerful AI in the future in order to much better ‘understand’ the ‘intention’ of the author producing the document, by making documents readable.

This explicitly applies to documents and has the added benefit that even if they are turned into different formats and even if they are printed and scanned they will still retain the metadata. The concept is extensible to other textual media, but that is beyond this proposal.

visual-meta

I call this approach Visual-Meta and it’s presented in more detail here liquid.info/visual-meta.html. I believe this is important and I have therefore started the process of hosting a dialog with industry and I have produced two proof-of-concept applications, one for authoring Visual-Meta documents and one for reading and parsing them: Liquid | Author and Liquid | Reader: www.liquid.info

paper

Digital capabilities run deeper than what previous substrates could, but even in the pursuit of more liquid information environments we should not ignore the power of the visual symbolic layer. We hide the meta at our peril – we reveal it and include it in the visual document and gain robustness through document format changes and even writing and scanning, gaining archival strength without any loss of deep digital interactivity, something which matters more and more as we live and discover how brittle our digital data is and how important rich interactivity is to enable the deeper literacy required to fight propaganda and to propagate academic discoveries often lost in the sheer volume of documents.

Furthermore, with the goal of more robust formats and supporting reading of printed books and documents, addressing information (as discussed in the Visual-Meta addressability post) can be printed on each page in the footer to allow for easy scanning of hand-annotated texts to be OCR’d and entered into the user’s digital workflow automatically. Digital is magic. Paper is also magic. One day they will merge, but until then there is value to be had to use both to their strengths.

 

As we make our information aware,
we increase the potential of our own awareness

 

 

Leave a Comment