[Apologies for cross posting]
Good morning,
CERN is hiring three developers with experience in text-mining and/or digital libraries, through our fellowship program. Details are appended and can be also found at http://cern.ch/go/9z69
Thanks in advance for disseminating the information to those who might be interested,
Salvatore Mele
--
Dr. Salvatore Mele
CERN - Head of Open Access - http://www.cern.ch/oa
SCOAP3 - Interim Project Manager - http://scoap3.org
INSPIRE - Strategic Director - http://inspirehep.net
Voice: + 41 22 767 8603 - E-mail: [log in to unmask]
Postal address: C27900, CERN, CH1211 Geneva 23, Switzerland
======
The CERN Scientific Information Service is looking for three enthusiastic and motivated developers with experience in text-mining or digital libraries. You will join a dynamic team which is leveraging the Invenio software ( http://invenio-software.org/ ) in two R&D projects underpinning productions services. (1) We build, enhance and operate the INSPIRE information service ( https://inspirehep.net/ ), a digital library with one million records, which is a key working tool used by 50’000 scientists worldwide in their cutting-edge research in High-Energy Physics. (2) We design, implement and operate innovative services to connect 13’000 CERN users and the information they need daily, and in particular a vast topical collection of books and e-books ( http://cds.cern.ch/collection/Books/ ).
We have three positions available: on text-mining in INSPIRE, on author disambiguation in INSPIRE, and on innovative (e-)book services.
Positions
1-Text-mining fellowship
INSPIRE hosts over one million records, complete with metadata on authors, affiliations, keywords and, most relevant, citation to other scholarly material, within INSPIRE and beyond. The citation network counts over 16 million cites-cited pairs, and you will:
• Develop and expand our current text-mining suite of documents to extract all possible metadata: authors, affiliations, references and additional scientific content, both from material ingested daily, and from hundreds of thousands of existing documents
• Explore solutions to enrich and complete the INSPIRE citation network, mining both the existing INSPIRE records, and other existing citation databases, as well as the open web.
• Leverage the INSPIRE citation network to deliver additional discovery and analysis services to the INSPIRE community.
2-Author disambiguation and management fellowship
INSPIRE records have about 8 million name strings in the ‘author’ field: it is crucial to correctly attribute each and every article. An advanced disambiguation algorithm, and crowd-sourcing approaches, have unambiguously identified about 100’000 author profiles. You will:
• Expand the current infrastructure and improve the system performance for algorithmic disambiguation and crowd-sourced curation
• Deliver an integrated experience around the concept of a ‘person’ (retrieval of information and creation/maintenance of profiles and the UX/UI of associated bio- and biblio-graphic information)
• Assure seamless interoperability and bulk-data exchange with other relevant partners such as NASA-ADS, arXiv, ORCID and leading publishers in the field.
3-Innovative (e-)book services fellowship
CERN users have access to almost 100’000 (e-)books in an integrated environment, with material provided by dozens of external sources and including a physical collection of titles. As the physical and digital collections get integrated, and to take full advantage of the opportunities of the digital texts you will:
• Design, implement and deploy solutions to search, within the CERN library catalogue, across the full-text of e-books from several sources, both hosted locally and most often on external vendor platforms.
• Propose and develop analytical tools to leverage information on the usage of individual (e-)books and suggest improvements in the handling of the existing collection and its development.
• Expand the existing suite of (e-)book services (crowd-sourcing suggestion of new material, linking to external sources, management of individual user accounts) leveraging the functionalities of the ‘next’ version of Invenio.
Other things you will do (for all vacancies):
• According to your inclination and abilities, help out on other projects, such as: crowdsourcing aspects of digital library curation; integrating our services with other data sources; UI/UX (re-)design; operation of the production infrastructure; log-analysis to improve service offering.
• Participate in stand-by duty for hot-fixes in the operation of our information web services (which could include evenings, weekends and public holidays).
Expertise required for the three vacancies:
• Proven experience within a LAMP environment (Linux, Apache, MySQL, Python) and Distributed Version Control Systems (e.g. Git), preferably in open source projects.
• Good understanding of software performance measurement, analysis and optimization procedures as well as software testing and quality assurance.
• Comfortable working in both small and medium-sized software teams, collecting and analyzing multiple requirements.
• Familiarity with web development technologies (flask, bootstrap), databases technologies (MongoDB, Redis, PostgreSQL), protocols in digital libraries (MARC21, OAI-PMH, RDF, XML, XSLT), and standards in scholarly communication (DOI, ORCID) is an advantage.
• Experience in the maintenance, development or operation of advanced information systems or digital libraries, possibly within the Invenio framework, is an asset.
Eligibility and benefits:
These vacancies are within the CERN Fellowship program. All information is available at: https://jobs.web.cern.ch/join-us/fellowship-programme. A shortened summary:
• Candidates should have either a MSc (Computer Science or equivalent) level diploma or above with no more than 10 years relevant experience; OR a BSc (Computer Science or equivalent) with no more than 4 years relevant experience
• Citizens of CERN Member States and some other countries are eligible for the program: https://jobs.web.cern.ch/content/member-states/.
• Stipend range : 5165 CHF to 8043 CHF per month, calculated individually and net of tax.
• Contract duration is one to three years, typical duration is two years. It can be exceptionally possible to extend for all or part of a 3rd year.
• Family allowance, child allowance, infant allowance.
• Pension Fund and Health Insurance scheme membership.
• 40 hours working week with 2.5 days paid annual leave per month.
How to apply:
• Please follow the instructions at https://jobs.web.cern.ch/job/10948/.
• Important: please indicate “Scientific Information Service” in the field “Miscellaneous information: Please give details of the work you are interested in doing at CERN”. We might not be able to process your application otherwise.
• In addition, please also indicate in the same field which vacancy interests you of these three and describe why it matches your career development.
Further information:
• For the text-mining and author-management vacancies please contact: [log in to unmask]
• For the innovative (e-)books services vacancy please contact: [log in to unmask]
• Answers to most questions on the CERN Fellowship program are available at https://jobs.web.cern.ch/join-us/fellowship-programme
|