Print

Print


6th Framework

Hello all,

I would like to draw your attention to taxonomies. Applied to DC, his means some controlled vocabulary to be used as values to be assigned to the metadata attributes.

You all know we have GEMET
http://www.mu.niedersachsen.de/cds/etc-cds_neu/library/select.html,
a 19-lingual thesaurus.

Those who have been in Thun (CH) last year, or in the Expo00 the year before, may remember I had presentations about a thesaurus-based auto-classification we use in the German Environmental Information Network http://www.gein.de/index_en.html, and I was asking for people who could provide (or develop) the same (or a similar) auto-classification in their own language.

Now I think the time has come. The EU is preparing the IST Programme in FP6 http://www.cordis.lu/ist/fp6/workshops.htm. This week I visited the Knowledge Management Workshop in Luxemburg, and I think this is exactly of the kind they are looking for.

I already contributed to the 6th Framework Consultation Meeting: 'Technologies for Major Societal Challenges' last year in Brussels, with a proposal named "European Environmental Topic Map", and this has been accepted as part of the subjects to be sponsored - not yet as a project.

The final call for proposals will be published in Q4 this year. I propose to do the following:

1. Select a set of sample documents (test cases)
2. Have them translated in all languages (currently 19)
3. Re-organize & enhance GEMET as a topic map
4. Discuss some common classification methods
5. Find the language-specific challenges
6. Develop the (currently 19) language-specific text analysis modules
7. Apply them to the test cases: Each of the languages should result in exactly the same GEMET descriptors.

What do you think about this?
I can imagine my US mother company Schlumberger when I will ask for some funding for this project: They'll die laughing about us Europeans going bananas with our 19 languages.

Well, let's give them an example of what will be "KM made in Europe"!

Cheers

Thomas Bandholtz
CM / KM Division Manager; XML Network Moderator
Competence Center Content Management
SchlumbergerSema
http://www.schlumbergersema.com

Kaltenbornweg 3
D50679 Köln / Cologne
Germany
+49 221 8299 264