Skip to content

Closing the IMPACT Project blog

3 January, 2012

The IMPACT project has now officially finished and been superseded by the Centre of Competence.

This blog has now been frozen. Comments have been disabled and we do not intend to publish further posts. We have published the following statistics for future reference. They are intended to inform others about the lifecycle of the blog and assist people wishing to reuse resources by identifying the authors of articles etc.

Active Dates: From 10 December 2009 to 31 December 2011
Number of posts:   117
Number of comments:  16
Akismet statistics: 1750 spams caught and an overall accuracy rate of 100%.
Details of contributors: The IMPACT project (used as a generic log-in for IMPACT staff, impacteib, mariekeguy, Nora Daly, Greta Franzini, simonaitken
Categories used: admin, Bratislava (May 2010), British Library, conference, Demo Day, Deutsch (German), English, Final Conference 2011, hackday, Munich (March 2010), Munich (October 2011), myGrid – Taverna Hackathon, Nederlands (Dutch), Rouen (March 2011), taverna, The Hague (Feb 2011)
Details of blog theme: Vigilance with 4 Widgets
Details of type and version of software used: This blog was run on the free hosted version of WordPress at
Blog licence: All items on the blog are copyright of the IMPACT project and unless otherwise stated have been released under the Creative Commons License:  Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License

BSB/ÖNB Demo Day – Videos online!

21 November, 2011

Es hat dann doch etwas länger gedauert, aber jetzt sind alle Vorträge unserer Doppel-Veranstaltung “Historische Dokumente auf dem Weg zum digitalen Volltext” (11. – 12. Oktober 2011) und in die entsprechenden Blog-Artikel eingebunden.

Wie gehabt finden sich alle Informationen zum ersten Tag, dem “IMPACT Demo Day”, hier auf dem IMPACT-Blog, während Sie sich alles Wissenswerte zum zweiten Tag, den “Erfahrungen aus der Digitalisierungspraxis”, auf dem Blog des Münchener DigitalisierungsZentrums zu Gemüte führen können.

Viel Vergnügen beim Ansehen!


It took us a bit longer than expected, but all videos of our dual event “Turning Historical Documents into Digital Full Texts” (11 – 12 October 2011) are now online and embedded into the relevant blog posts.

For the firstday, you’ll find them here on the IMPACT blog. For the second day, please visit the blog of the Munich DigitiZation Center.

Have fun watching!

Mark-Oliver Fischer (BSB)

IMPACT/myGrid Hackathon – Taverna Roadmap

14 November, 2011
Shoaib Sufi talks about the Taverna Roadmap

Shoaib Sufi talks about the Taverna Roadmap

In the afternoon, after everyone had worked through the 3 group tasks in the practical session: ‘Workflow Development in Digitisation’, we returned to hear from the Taverna Manager – Shoaib Sufi.

Shoib gave an interesting talk about where he sees Taverna going in the next few years and the further development of Taverna 3, including some of the projects that they hope to work with.

IMPACT/myGrid Taverna Hackathon – Taverna Server as a Portal

14 November, 2011

Clemens Neudecker leads a session on using Taverna Server as a portal, using IMPACT workflows to demonstrate the functionality.

This was followed by Rob Haines from myGrid who gave more examples of Taverna Server Interfaces.

The IMPACT Framework – From Tools to Workflows

14 November, 2011

This practical session started with the attendees introducing themselves and splitting up into 3 groups, so that each could work on a different set of tasks based on a Case Study:

Sven Schlarb at IMPACT/myGrid Hackathon

Sven Schlarb at IMPACT/myGrid Hackathon

Case Study:

A collection holder wants to reduce storage costs for his collections that
are currently available as TIFF master files. She/he heard that JPEG2000 is
a good candidate for storing digital master files, and she/he heard about
the efficiency of image compression when using lossy compression.

She/he knows that JPEG2000 compression can be “visually lossless”, so that
the compression is reversible, but she/he is still concerned about the
impact the JPEG2000 compression could have on OCR.

We suggest a Taverna workflow that creates an executable processing pipeline
for studying the results.

The workflow should have 1 TIFF image as input and a list of increasing
compression parameters which are used when encoding the image. The image
should then be decompressed before applying the OCR. Finally, the impact
of the compression on the OCR should be measured by comparing the original
OCR output to the OCR output of the compressed images.

IMPACT myGrid Taverna Hackathon

IMPACT myGrid Taverna Hackathon

The Three Groups:

Group 1

Use the toolwrapper for providing access to a JPEG2000 encoding/decoding tool:

Group 2

Use Taverna for creating the workflow:

Group 3

Use a Taverna beanshell for creating the Text comparison

  • commons-lang-2.4.jar (/home/<youruser>/.taverna-home/lib/commons-lang-2.4.jar)
Carl Wilson from the BL concentrates on Taverna

Carl Wilson from the BL concentrates on Taverna

The selection of groups has shown a definite preference for the more ‘user’ based tasks rather than ‘developer’ tasks, with 12 working on Group 1, 6 on Group 2 and only 3 on Group3.  However, quite a few attendees seemed happy to be involved in more than one group, or work in one, but support users in another.

General feeling is that this bodes well for tomorrow which has a more ‘practical’ based timetable.

IMPACT/myGrid Taverna Hackathon

14 November, 2011

Full details of this workshop are available through the workshop wiki at:

Clemens Neudecker at the IMPACT myGrid Taverna Hackathon

Clemens Neudecker at the IMPACT myGrid Taverna Hackathon

The day started with an introduction to IMPACT from Clemens Neudecker:

and then an introduction to Taverna from Katy Wolstencroft:

Katy Wolstencroft gives an introduction to Taverna

Katy Wolstencroft gives an introduction to Taverna

IMPACT Final Conference – Blog-index

26 October, 2011

The whole conference was blogged and photographed with presentations uploaded to Slideshare and videos to Vimeo.

These are also embedded within the blogs on this site.

This post contains direct links to all posts made at the Final Conference.  Please do feel free to add comments or thoughts below the posts.

Monday 24 October 2011




Tuesday 25 October 2011




  • Research Session: Presentation and discussion of state of the art research tools for document analysis and OCR, hosted by Apostolos Antonacopoulos (University of Salford).
  • Language Session: Presentation and demonstration of the IMPACT language tools & resources in further detail, hosted by Katrien Depuydt (INL)
  • Digitisation Tips Session: Meet the expert: questions & answers on digitisation issues, hosted by Aly Conteh (The British Library)