From WJHutchins@compuserve.com Wed Jan 7 10:12:54 2004 From: WJHutchins@compuserve.com (John Hutchins) Date: Wed, 7 Jan 2004 05:12:54 -0500 Subject: [MT-List] first public demonstration of MT Message-ID: <200401070513_MC3-1-64B2-F621@compuserve.com> Fifty years ago today (7 January 1954) the first public demonstration of a MT system was held in New York at the head office of IBM. The demonstration was widely reported in the = newspapers and received much public interest. The system itself, however, was no more than what today would be called a "toy" system, having just 250 words and translating just 49 carefully selected Russian sentences into English -- mainly in the field of chemistry. Nevertheless it encouraged the view that MT was imminent -- and in particular stimulated the financing of MT research, not just in the US but worldwide. To find out more see the article on my website http://ourworld.compuserve.com/WJHutchins. You will find there also further references, and the sentences used in the original demo: http://.../WJHutchins/Russian-GU-1954.doc John Hutchins 7 Jan 2004 = From WJHutchins@compuserve.com Thu Jan 8 10:27:18 2004 From: WJHutchins@compuserve.com (John Hutchins) Date: Thu, 8 Jan 2004 05:27:18 -0500 Subject: [MT-List] RE: first public demonstration of MT - website address Message-ID: <200401080527_MC3-1-64DD-9AB5@compuserve.com> As a number of you have pointed out, the address I gave yesterday was incomplete. It should be: http://ourworld.compuserve.com/homepages/WJHutchins My apologies to all ... John Hutchins 8 Jan =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D >From: John Hutchins [mailto:WJHutchins@compuserve.com] = Sent: Wednesday, January 07, 2004 11:13 AM To: MT-list Subject: [MT-List] first public demonstration of MT Fifty years ago today (7 January 1954) the first public demonstration of a MT system was held in New York at the head office of IBM. The demonstration was widely reported in the = newspapers and received much public interest. The system itself, however, was no more than what today would be called a "toy" system, having just 250 words and translating just 49 carefully selected Russian sentences into English -- mainly in the field of chemistry. Nevertheless it encouraged the view that MT was imminent -- and in particular stimulated the financing of MT research, not just in the US but worldwide. John Hutchins 7 Jan 2004 = < From jeff.allen@free.fr Wed Jan 14 10:48:02 2004 From: jeff.allen@free.fr (Jeff Allen) Date: Wed, 14 Jan 2004 11:48:02 +0100 Subject: [MT-List] 2nd Call for Contributions TAMA 2004 Message-ID: <1074077282.40051e62237a6@imp2-l.free.fr> Hello listers, See conf announcement below. Jeff ----- Forwarded message from TermNet ----- Date: Tue, 13 Jan 2004 14:25:55 +0100 From: TermNet Reply-To: TermNet Subject: 2nd Call for Contributions TAMA 2004 To: tama-2004-infot@rundlemall.progipark.com TermNet, International Network for Terminology, would like to inform you that the deadline for the submission of abstracts for TAMA 2004 Conference (Terminology in Advanced Management Applications) has been extended. 2nd Call for Contributions Please submit your abstracts of 200-300 words until 28 February. The abstract should be either sent as an e-mail text or an attached Word document. Visit our Events pages at linux.termnet.org for further information about TAMA 2004 and the preliminary programme or contact Anja Drame (termnet@termnet.at) ----------------------------------------------------------------------------- TermNet - International Network for Terminology Aichholzgasse 6/12, 1120 Vienna, Austria Tel +43 1 817 44 99 - Fax +43 1 817 44 99-44 ----------------------------------------------------------------------------- Office Cologne University of Applied Sciences Cologne Ubierring 40 (Room 245), 50678 Cologne, Germany GSM: +49 171 5723 750 - Fax +49 221 8275-3615 Termnet@termnet.at http://linux.termnet.org ------------------------------------------------------------------------------ TermNet - There is no Knowledge without Terminology ----- End forwarded message ----- From jeff.allen@free.fr Wed Jan 14 12:12:58 2004 From: jeff.allen@free.fr (Jeff Allen) Date: Wed, 14 Jan 2004 13:12:58 +0100 Subject: [MT-List] Thesis on MT Postediting vs Human Translation In-Reply-To: <1074077282.40051e62237a6@imp2-l.free.fr> References: <1074077282.40051e62237a6@imp2-l.free.fr> Message-ID: <1074082378.4005324a5dc86@imp2-l.free.fr> Hello MT-listers, The link indicated in a previous post concerning the following Master's thesis seems to be inaccurate. GUERRA, Lorena. 2003. Human Translation versus Machine Translation and Full Post-Editing of Raw Machine Translation Output. Master's Thesis. Dublin City University. Corrected URL should be: http://www.promt.ru/news-e/news.phtml?id=293 The thesis is also referenced on the LISA web site and with mention at the Localisation Research Centre website: http://lrc.csis.ul.ie/research/projects/Elect_Project_Web/winner_bt.htm http://leit.lisa.org/studentPublications.html Many thanks to Lorena for making the results of her study available to the community at large. MT user organizations should hopefully be able to benefit much from this comparative study for their implementation decisions and efforts. Regards, Jeff Jeff Allen Editorial Advisory Board MultiLingual Computing & Technology http://www.multilingual.col/editorialBoard/ jeff@multilingual.com OR jeff.allen@free.fr From dekai@cs.ust.hk Thu Jan 15 06:52:04 2004 From: dekai@cs.ust.hk (Dekai Wu) Date: Thu, 15 Jan 2004 14:52:04 +0800 Subject: [MT-List] EMNLP 2004 - Preliminary Call for Papers Message-ID: <021d01c3db34$19b39e10$6401a8c0@riff> 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004) Preliminary Call for Papers SIGDAT, the Association for Computational Linguistics' special interest group on linguistic data and corpus-based approaches to NLP, invites submissions to EMNLP 2004. The conference will be held on July 25-26 in Barcelona, Spain, immediately following the 42nd meeting of the ACL (ACL 2004). We are interested in papers from academia, government, and industry on all areas of traditional interest to the SIGDAT community and aligned fields, including but not limited to: o information extraction o information retrieval o language and dialogue modeling o lexical acquisition o machine translation o multilingual technologies o question answering o statistical parsing o summarization o generation o tagging o term and named entity extraction o word sense disambiguation o word, term, and text segmentation o general NLP-related machine learning techniques: theory, methods and algorithms Submissions Requirements: Submissions must describe original, completed, unpublished work, and include concrete evaluation results when appropriate. Papers being submitted to other meetings must provide this information (see submission format). In the event of multiple acceptances, authors are requested to immediately notify the EMNLP program chair (lindek@cs.ualberta.ca) and to choose which meeting to present and publish the work at as soon as possible. EMNLP cannot accept for publication or presentation work that will be (or has been) published elsewhere. Format: Submissions should take the form of full papers (up to 8 pages in two-column format). Authors are strongly encouraged to use the style files originally provided for ACL 2004. We strongly prefer submissions to be as PS files. Any author who submits in PDF must assume the responsibility for ensuring that fonts are treated properly so that the paper will print (not just view) anywhere. (This may involve reading the manual.) DOC/RTF formats cannot be accepted. Reviewing will be blind. No information identifying the authors should be in the paper: this includes not only the authors' names and affiliations, but also self-references that reveal authors' identities; for example, "We have previously shown (Smith 1999)" should be changed to "Smith (1999) has previously shown". A separate identification email is required: see below. Procedure: Submissions will be entered via a website: http://www.cs.ualberta.ca/~lindek/emnlp04/submit A PostScript file of the paper must be uploaded onto the system by the date of the deadline (12 midnight GMT). In addition, information about each paper must be entered on the website. This information includes: * Paper title * Contact author name, affiliation, and email address * A short list of keywords (selected from a predefined list) * Abstract (no more than 300 words) * The conferences that the paper has also been submitted to (if any). Important Dates Submission deadline: April 28, 2004 Acceptance notification: May 26, 2004 Camera-ready copy due: June 15, 2004 Conference: July 25-26, 2004 Program Co-Chairs Dekang Lin, University of Alberta, (lindek@cs.ualberta.ca) Dekai Wu, The Hong Kong University of Science & Technology, (dekai@cs.ust.hk) Conference URL http://www.cs.ualberta.ca/~lindek/emnlp04 From malek.boualem@rd.francetelecom.com Fri Jan 16 18:06:48 2004 From: malek.boualem@rd.francetelecom.com (BOUALEM Malek FTRD/DMI/LAN) Date: Fri, 16 Jan 2004 19:06:48 +0100 Subject: [MT-List] Last CFP - Arabic NLP Text & Speech - deadline January 20 Message-ID: <749A556ABA92204FA7E566D8CA6A169BBF094D@ftrdmel1.rd.francetelecom.fr> !!!! Deadline extended to January 20, 2004 !!!! **************************************************** J E P 2 0 0 4 - T A L N 2 0 0 4 - Special Session - =20 ARABIC LANGUAGE PROCESSING TEXT & SPEECH -------- Last call for Papers =20 Palais des Congr=E8s Fez (Morocco) 19-22 April 2004 http://www.lpl.univ-aix.fr/jep-taln04/ http://www.fsdmfes.ac.ma/jep-taln04/ **************************************************** Due to its morphological, syntactic, phonetic and phonologic=20 properties, the Arabic language is considered to be one of the most difficult languages for written and spoken language processing. Research on written Arabic language processing started in the 1970s, even before the problems of Arabic text editing were completely solved. The first studies focused primarily on lexicons and morphology. In the past ten years, the internationalisation of the WWW and the proliferation of communication tools in Arabic have led to the need for a large number of Arabic NLP applications. As a result, research activity has extended to address more general areas of Arabic language processing, including syntactic analysis, machine translation, document indexing, information retrieval, etc. Research on Arabic speech processing has made significant progress due to more improved signal processing technologies, and to recent advances in the knowledge of the prosodic and the segmental characteristics of Arabic and the acoustic modelling of Arab schemes. These results should make it possible to further progress in more innovative areas, such as Arabic speech recognition and synthesis, speech translation and automatic identification of a speaker and his/her geographic origin discrimination, etc. The aim of the joint session is to gather and reinforce collaboration between researchers from both the written and spoken Arabic language processing communities. It will also offer the opportunity to discuss recent advances on both the scientific and application sides of the problem, in monolingual and multilingual contexts. TOPICS This special session on written and spoken Arabic processing includes (but is not limited to) the following topics : - Speech recognition and comprehension, - Text to speech synthesis, - Automatic prosody generation, - Automatic speaker and language identification, - Geographic origin discrimination of Arabic speakers, - Arabic corpora & resources, - Speech acquisition for ASR and TTS systems, - Morphology, - Syntax, - Semantics, - Text parsing and generation, - Discourse analysis, - Text summarization, - Dialogue, - Machine translation. IMPORTANT DATES Submission deadline : extended to January 20, 2004 Notification to authors : 20 February 2004=20 Camera-ready : 8 March 2004=20 Conference : 19-22 April 2004 SELECTION CRITERIA=20 Authors are invited to submit original, previously unpublished research = work. Submissions will be reviewed by at least two experts. Decisions will be based on the following criteria : - importance and originality of the paper, - soundness of the scientific and technical content, - comparison of the results obtained with other relevant works, - clarity of the exposition, - relevance to the topics of the conference. LANGUAGES All papers should be written in English or French. PAPER FORMAT Submitted papers should be about 6 to 10 pages in Times 12pt, single spaced, including figures, examples and references. Papers MUST be sent in PDF format. In particular cases, we might accept submissions in RTF (Word) format. All the PostScript versions must be in A4 format, and not US Letter. - Download the LaTeX stylesheet - Download the Word template (English version) SUBMISSION PROCEDURE Electronic submissions with the message object "JEP-TALN-2004-Arabic" should be sent to the following email address : < jep-taln04-arabic@fsdmfes.ac.ma > In case electronic submission is not possible, printed versions might be accepted. In this case, three hard-copies of the paper together with a floppy disk, should be sent to : Malek Boualem France Telecom R&D - DMI/GRI 2, avenue Pierre Marzin 22307 Lannion - France or to Noureddine Chenfour D=E9partement de Math. et Informatique Facult=E9 des Sciences Dhar El Mahraz, F=E8s BP : 1796 Atlas, Fez - Morocco PROGRAMME COMMITTEE - Abderrahim Benabbou, FST, Fez, Morocco. - Mohammed Benkhalifa, Facult=E9 des Sciences, Rabat, Morocco. - Thami Benkirane, Sidi Mohammed University, Morocco. - Malek Boualem, France Telecom R&D, France. - Achraf Chalabi, Sakhr, Egypt. - Noureddine Chenfour, Sidi Mohammed University, Fez, Morocco. - Khalid Choukri, ELRA/ELDA, France. - Fethi Debili, CNRS, Paris, France. - Emilie De Neef, France Telecom R&D, France. - Joseph Dichy, Lumi=E8re-Lyon 2 University, France. - Everhard Ditters, University of Nijmegen, The Netherlands. - Mohamed Embarki, Laboratoire de Phon=E9tique, Montpellier, France. - Mohammed Hassoun, ENSSIB, Lyon, France. - Med Tayeb Laskri, Badji Mokhtar University, Algeria. - Fabrice Lefevre, LIMSI, Paris-Sud Orsay University, France. - Chafic Mokbel, Balimand University, Lebanon. - Abdelhak Mouradi, ENSIAS Rabat, Morocco. - Omar Nouali, CERIST, Algeria. - Abdenbi Rajouani, ENS, Fez, Morocco. - Mustafa Yaseen, ATS Online, Jordan. - Mohamed Yeou, Chouaib Doukkali El-Jadida University, Morocco. - Chakir Zeroual, Sidi Mohamed University, Fez, Morocco. - Adnane Zribi, ISG, Tunis University, Tunisia. **************************************************** ------------------------------------------------------ Malek Boualem France Telecom R&D - DMI/GRI Natural Language Processing group 2, avenue Pierre Marzin - 22307 Lannion - France Tel: (33)(0)2.96.05.29.83 Fax: (33)(0)2.96.05.32.86 Email: malek.boualem@rd.francetelecom.com ------------------------------------------------------ From WJHutchins@compuserve.com Mon Jan 26 23:26:31 2004 From: WJHutchins@compuserve.com (John Hutchins) Date: Mon, 26 Jan 2004 18:26:31 -0500 Subject: [MT-List] EAMT Workshop 2004 Message-ID: <200401261826_MC3-1-69DA-354D@compuserve.com> European Association for Machine Translation Workshop, Malta, 26-27 April 2004 MISSED THE DEADLINE? It is not too late. We are leaving the door open for last minute submissions until the end of this week (31 January). Remember that extended abstracts will = be accepted. -------------------------- The European Association for Machine Translation (EAMT) will be holding i= ts Ninth Workshop in Malta on 26-27 April 2004. The meeting is one of a seri= es of annual workshops acting as a forum for the exchange of ideas concernin= g all aspects of machine translation within the European and adjacent regions. (For information about previous workshops see the EAMT website: www.eamt.org.) As in previous EAMT workshops, contributions will be particularly welcomed from developers, vendors and users of MT systems an= d translation aids. = The SPECIAL THEME for this workshop will be machine-translation-related issues concerning = = * Semitic languages, * Languages of the newly accessioned states of the European Union. Papers on machine translation systems and computer-based translation aids= relating to these languages will be of particular interest. = However, papers relating to other aspects of machine translation, and oth= er languages, will also be considered such as: * Translation Memories * Web-based MT services = * Issues for developers and users of MT and translation aids = * Economic aspects of MT * Evaluation of MT systems = SUBMISSION of papers: We invite submissions as either extended abstracts (minimum 2 pages) or a= s full papers (maximum 10 pages). Submissions should be sent before 25th January 2004, by email as attachments in DOC, RTF, or plain text, to the chairman John Hutchins (WJHutchins@compuserve.com), with the words 'EAMT 2004 Workshop' in the subject heading). = Notification of acceptance will be made on or before 15 February, togethe= r with any comments from reviewers for revision. FINAL VERSIONS of papers Final versions of papers (maximum 10 pages, DOC or RTF files) should be sent for inclusion in the proceedings before 12th March. (Any papers received after that date will not be included in the printed proceedings)= . A stylesheet for accepted submissions will be available in due course on the EAMT website. It is anticipated that papers will be made available in electronic form o= n the EAMT website after the conference -- for this purpose, authors may al= so send PDF or PS versions of their papers. PROGRAMME COMMITTEE: John Hutchins (chair, EAMT president) Michael Rosner (University of Malta) Anthony Clarke (CLS Corporate Language Services) Jaro Lajovic (Ljubljana, Slovenia) Mustafa Yaseen (University of Amman) Jan Hajic (Charles University, Prague) Violetta Cavalli-Sforza (San Francisco State University) VENUE, registration: Details about the venue in MALTA, registration fees, hotel accommodation,= etc. will be circulated from mid January or early February. Full details,= including a list of all speakers and an online registration form, will be= posted on the EAMT website (www.eamt.org) in early March. Local arrangements organiser is Mike Rosner (mike.rosner@um.edu.mt) from whom further details can be obtained. IMPORTANT DATES: Deadline for submissions: 25 January 2004 Acceptance notification: 15 February 2004 Final copies due: 12 March 2004 Registration: early March 2004 Conference dates: 26-27 April 2004 From WJHutchins@compuserve.com Thu Jan 29 16:35:57 2004 From: WJHutchins@compuserve.com (John Hutchins) Date: Thu, 29 Jan 2004 11:35:57 -0500 Subject: [MT-List] new edition of Compendium Message-ID: <200401291136_MC3-1-6A87-2718@compuserve.com> The latest edition (8th ed., January 2004) of the "Compendium of translation software" is now available on the EAMT website (www.eamt.org). As before, there are considerable changes from the previous edition -- including the reappearances of T1 and Power Translator. As always, do not forget to let me know of any changes which we have missed and to keep us up to date with new systems. John Hutchins 29 Jan = From karinem@inxight.com Thu Jan 29 21:42:01 2004 From: karinem@inxight.com (Karine Megerdoomian) Date: Thu, 29 Jan 2004 13:42:01 -0800 Subject: [MT-List] CFP: Workshop on Arabic Script Languages Message-ID: <13CB2B479B43D411BAC600B0D04947B40DF5DED8@ascension.inxight.com> =============================================================== COLING 2004 WORKSHOP Computational Approaches to Arabic Script-based Languages Saturday, August 28, 2004 University of Geneva Geneva, Switzerland =============================================================== CONFERENCE WEBSITE: http://www.issco.unige.ch/coling2004/ WORKSHOP DESCRIPTION Recently, there has been a surge of interest in the study of the languages of the Middle East, especially Arabic, Persian (Farsi), Pashto and Urdu. This sudden and urgent interest is manifested by the availability of funding for rapid development of practical systems for processing large volumes of data in these languages. Computational applications for proper name identification, entity recognition, categorization, information retrieval, summarization, machine translation and other implementations are currently in high demand. This comes at a time when advances in formal and computational linguistics over the last fifty years are being consolidated, while work on machine learning and statistical methods has been showing great promise. Although there exists a considerable body of work in computational linguistics specifically targeted to these middle eastern languages, much of the research and development has been the result of initiatives by individual research establishments or industry firms. Furthermore, the usage of the Arabic script gives rise to certain issues that are common to all these languages despite their being of distinct language families. Hence, these languages share properties such as the absence of capitalization, right to left direction, lack of clear word boundaries, complex word structure, a high degree of ambiguity due to non-representation of short vowels in the writing system, and related encoding issues. The goal of this workshop is to provide a forum for those involved in the development of NLP systems in Arabic script languages to exchange ideas, approaches and implementations of computational systems; to discuss the common challenges faced by all practitioners; and to assess the state of the art in the field. In addition, one of the aims of the workshop is to identify promising areas for future collaborative research in the development of NLP systems for Arabic script languages. Solutions that are designed to solve the specific problems of these languages could very well have wider applications and relevance to the rest of the NLP community. WORKSHOP TOPICS Authors of papers in any area of NLP in Arabic script-based languages are encouraged to apply. We encourage submissions dealing with language-specific issues, as well as discussions of challenges imposed by the usage of the Arabic script. Papers could be on - but not limited to - any of the following topics: * Morphological analysis * Syntactic ambiguity resolution * Relevance of shallow parsing * Machine translation from and to Arabic script languages * Sense disambiguation * Homograph resolution * Semantic analysis * Entity recognition * Information retrieval * Classification of documents * Text mining * Summarization * Statistical approaches * Speech recognition and generation * Lexical databases * Knowledge and domain representation * Spelling and grammar checking tools SUBMISSION REQUIREMENTS Papers should be original, previously unpublished work and should not identify the author(s). They should emphasize completed work rather than intended work. Papers that are being submitted to other conferences must reflect this fact on the title page. Submissions should be no longer than 8 pages (including figures and references). Email submissions (ps or pdf) are preferred and should be sent to both AliFarghaly@aol.com and karinem@inxight.com by midnight of the due date. Submissions should be in English. The papers should be attached to an email indicating contact information for the author(s) and paper's title. Formatting requirements for the final version of accepted papers will be posted as soon as they become available. Hardcopy submissions should be sent to: Ali Farghaly SYSTRAN Software, Inc. 9333 Genesee Ave, Pl 1 San Diego, CA 92121 USA IMPORTANT DATES Submissions due: March 25th, 2004 Notification date: April 25th, 2004 Deadline for camera ready copy: May 25th, 2004 ORGANIZING COMMITTEE This workshop is organized by Ali Farghaly (SYSTRAN Software, Inc.) Karine Megerdoomian (Inxight Software and University of California San Diego) The call for papers as well as future information on the workshop can be found at http://members.cox.net/karinem/COLING2004 PROGRAM COMMITTEE Jan W. Amtrup, Bowne Global Solutions Tim Buckwalter, Linguistic Data Consortium Violetta Cavalli-Sforza, Carnegie Mellon University Joseph Dichy, Lyon University Andrew Freeman, University of Washington Nizar Habash, University of Maryland, College Park Masayo Iida, Inxight Software, Inc. Simin Karimi, University of Arizona Martin Kay, Stanford University Kevin Knight, USC/Information Sciences Institute Farhad Oroumchian, University of Wollongong in Dubai Ahmed Rafea, The American University in Cairo Jean Senellart, SYSTRAN Software Rémi Zajac, SYSTRAN Software From esteam@otenet.gr Fri Jan 30 14:11:30 2004 From: esteam@otenet.gr (ESTeam) Date: Fri, 30 Jan 2004 16:11:30 +0200 Subject: [MT-List] LREC Workshop on the Amazing Utility of Parallel and Comparable Corpora References: <13CB2B479B43D411BAC600B0D04947B40DF5DED8@ascension.inxight.com> Message-ID: <401A6611.163DAC04@otenet.gr> Workshop on the Amazing Utility of Parallel and Comparable Corpora “The Re-Use of Parallel and Comparable Corpora including Translation Memories for Applications such as Machine Translation, Knowledge Structuring and Multilingual Information Browsing” Lisbon, Portugal, 25 May 2004 LREC 2004 Conference on Language Resources and Evaluation Second call for papers Workshop description and aim: Parallel and comparable corpora, including translation memories, contain a large amount of information for translation, multilingual lexicon building and multilingual knowledge structuring. In human language technology these resources can be used for multiple applications in speech or text processing. This workshop addresses the versality and utility of parallel and comparable corpora in the expanding and multilingual space of the internet. It aims at exchanging ideas concerning all aspects of re-usability of parallel and comparable corpora including translation memories for applications such as machine translation, knowledge structuring, multilingual information browsing and so on. Abstract submission: We invite submission of abstracts on the topics below. Abstracts should be approximately 600 words. Abstracts outlining live demonstrations are encouraged. Submissions should be sent before 16 February 2004 by email as attachments in doc, rtf, or plain text, to the contact person, Anna Samiotou esteam@otenet.gr, with the words ‘LREC2004 Workshop on the Amazing Utility of Parallel and Comparable Corpora’ in the subject heading. Notification of acceptance will be made on or before the 8 March 2004, together with any comments from reviewers from revision. Workshop topics: * re-use of parallel and comparable corpora including translation memories for: o machine translation o knowledge structuring o multilingual information browsing o speech multilingual applications * re-usability of generic vs task/domain specific existing parallel and comparable corpora * building language resourses for new applications from existing ones * methods for the extraction and acquisition of knowledge (e.g. terms, lexical information) from existing parallel and comparable corpora for re-using purposes in new applications * methods for identifying comparable corpora on the web * methods for quantifying comparability Organising Committee: Dr. Lambros Kranias, Chief Technical Manager ESTeam AB, Greece/Sweden Dr. Nicoletta Calzolari, Director, Istituto di Linguistica Computazionale del CNR, Italy Dr. Gregor Thurmair, R&D Manager Linguatec, Germany Dr. Yorick Wilks, Professor, University of Sheffield, UK Dr. Eduard Hovy, Information Sciences Institute, University of Southern California, USA Gudrun Magnusdottir, Managing Director ESTeam AB, Greece/Sweden Anna Samiotou, R&D Manager ESTeam AB, Greece/Sweden Dr. Khalid Choukri, CEO ELDA/ELRA Important Dates: Deadline for submissions: 16 February 2004 Notification of acceptance: 8 March 2004 Deadline final paper: 22 March 2004 Workshop date: full day 25 May 2004 From esteam@otenet.gr Fri Jan 30 14:39:53 2004 From: esteam@otenet.gr (ESTeam) Date: Fri, 30 Jan 2004 16:39:53 +0200 Subject: [MT-List] LREC Workshop on the Amazing Utility of Parallel and Comparable Corpora Message-ID: <401A6CB9.CE26FEB8@otenet.gr> ************APOLOGIES FOR RECEIVING MULTIPLE COPIES************ Workshop on the Amazing Utility of Parallel and Comparable Corpora "The Re-Use of Parallel and Comparable Corpora including Translation Memories for Applications such as Machine Translation, Knowledge Structuring and Multilingual Information Browsing" Lisbon, Portugal, 25 May 2004 LREC 2004 Conference on Language Resources and Evaluation Second call for papers Workshop description and aim: Parallel and comparable corpora, including translation memories, contain a large amount of information for translation, multilingual lexicon building and multilingual knowledge structuring. In human language technology these resources can be used for multiple applications in speech or text processing. This workshop addresses the versality and utility of parallel and comparable corpora in the expanding and multilingual space of the internet. It aims at exchanging ideas concerning all aspects of re-usability of parallel and comparable corpora including translation memories for applications such as machine translation, knowledge structuring, multilingual information browsing and so on. Abstract submission: We invite submission of abstracts on the topics below. Abstracts should be approximately 600 words. Abstracts outlining live demonstrations are encouraged. Submissions should be sent before 16 February 2004 by email as attachments in doc, rtf, or plain text, to the contact person, Anna Samiotou esteam@otenet.gr, with the words ‘LREC2004 Workshop on the Amazing Utility of Parallel and Comparable Corpora’ in the subject heading. Notification of acceptance will be made on or before the 8 March 2004, together with any comments from reviewers from revision. Workshop topics: - re-use of parallel and comparable corpora including translation memories for: o machine translation o knowledge structuring o multilingual information browsing o speech multilingual applications - re-usability of generic vs task/domain specific existing parallel and comparable corpora - building language resourses for new applications from existing ones - methods for the extraction and acquisition of knowledge (e.g. terms, lexical information) from existing parallel and comparable corpora for re-using purposes in new applications - methods for identifying comparable corpora on the web - methods for quantifying comparability Organising Committee: Dr. Lambros Kranias, Chief Technical Manager ESTeam AB, Greece/Sweden Dr. Nicoletta Calzolari, Director, Istituto di Linguistica Computazionale del CNR, Italy Dr. Gregor Thurmair, R&D Manager Linguatec, Germany Dr. Yorick Wilks, Professor, University of Sheffield, UK Dr. Eduard Hovy, Information Sciences Institute, University of Southern California, USA Gudrun Magnusdottir, Managing Director ESTeam AB, Greece/Sweden Anna Samiotou, R&D Manager ESTeam AB, Greece/Sweden Dr. Khalid Choukri, CEO ELDA/ELRA Important Dates: Deadline for submissions: 16 February 2004 Notification of acceptance: 8 March 2004 Deadline final paper: 22 March 2004 Workshop date: full day 25 May 2004 From dzo@bisharat.net Sat Jan 31 21:38:51 2004 From: dzo@bisharat.net (Don Osborn) Date: Sat, 31 Jan 2004 22:38:51 +0100 Subject: [MT-List] MT & African languages Message-ID: <00fe01c3e842$b8186d40$abe4fbc1@gktg001> This is a multi-part message in MIME format. ------=_NextPart_000_00FB_01C3E84B.0019FC00 Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable I'm building a (very humble) page on MT and African languages at = http://www.bisharat.net/Trans/ and would be interested in any = suggestions or pointers. If it needs to be added, I am not an expert in = MT. Thanks in advance for any feedback and all the best! Don Osborn, Ph.D. dzo@bisharat.net *Bisharat! A language, technology & development initiative *Bisharat! Initiative langues - technologie - d=E9veloppement http://www.bisharat.net ------=_NextPart_000_00FB_01C3E84B.0019FC00 Content-Type: text/html; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable
I'm building a (very = humble) page=20 on MT and African languages at http://www.bisharat.net/Trans/ and=20 would be interested in any suggestions or pointers. If it needs to be = added,=20 I am not an expert in MT. Thanks in advance for any feedback and = all the=20 best!
 
Don Osborn,=20 Ph.D.         dzo@bisharat.net
*Bisharat! A = language,=20 technology & development initiative
*Bisharat! Initiative langues = -=20 technologie - d=E9veloppement
http://www.bisharat.net
=
 
 
------=_NextPart_000_00FB_01C3E84B.0019FC00-- From jeff.allen@free.fr Sat Jan 31 23:36:10 2004 From: jeff.allen@free.fr (Jeff Allen) Date: Sun, 1 Feb 2004 00:36:10 +0100 Subject: [MT-List] MT Postediting publications In-Reply-To: <00fe01c3e842$b8186d40$abe4fbc1@gktg001> References: <00fe01c3e842$b8186d40$abe4fbc1@gktg001> Message-ID: <1075592170.401c3bea434bb@imp5-q.free.fr> Dear all, I've finally centralized the access to all of my publications on Machine Translation, Postediting, Controlled languages, etc at the following URL: http://www.geocities.com/mtpostediting/ During the coming days, I will be adding an entire section with links to publications/presentations by other people on the specific topic of MT Postediting. If you have written a paper, thesis, dissertation or conference presentation on MT Postediting and would like it to be listed at this web site, please contact me offline with a link to a URL, or a copy of it. Regards, Jeff Jeff Allen jeff.allen@free.fr or jeff@multilingual.com From cri@nats.informatik.uni-hamburg.de Tue Feb 3 14:24:21 2004 From: cri@nats.informatik.uni-hamburg.de (Cristina Vertan) Date: Tue, 3 Feb 2004 15:24:21 +0100 Subject: [MT-List] LREC 2004 workshop Message-ID: Our apologies for multiple postings ************************************************************************* DEADLINE EXTENSION 15th FEBRUARY ! International Workshop on LANGUAGE RESOURCES: INTEGRATION AND DEVELOPMENT IN E-LEARNING AND IN TEACHING COMPUTATIONAL LINGUISTICS. http://nats-wiki.informatik.uni-hamburg.de/~cri/lrecwp04/index.html Centro Cultural de Belem, LISBON, Portugal, 24th May 2004 *********************************************************************** In association with 4th INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION LREC 2004 http://www.lrec-conf.org/lrec2004/index.php Main conference 26-27 May 2004 MOTIVATION ========= Language resources (LRs) are of crucial importance not only for research and development in language and speech technology but also for teaching purposes. E-learning added a new dimension to the usability of language resources and made them interesting outside the area of computational linguistics. This one-day workshop will focus on the integration of LRs in the educational process and the relation between LRs and e-learning. Special attention will be dedicated to the use of LRs in the curriculum of computational linguistics. Its goal is four-fold: 1. to provide a forum for presentation, discussion and case collections of the use of LRs in in Linguistics and Computational Linguistics 2. to discuss the additional skills acquired by the students when using or developing LRs (e.g. how to acquire standards); 3. to discuss how the competence built up in the development of LRs could be used for creating better interfaces for educational material; 4. to discuss how LRs could be included in electronic scientific handbooks, which are crucial for hands-on experience in e-learning. Topics of interest include, but are not limited to: a. necessity of LRs in the curriculum; b. types of LRs used in the curriculum; c. integration of LRs in the curriculum; d. format and storage of LRs; e. access to and distribution of multilingual LRs for different purposes (e.g. Machine Translation, E-learning lessons, scientific manuals or electronic handbooks) The workshop will bring together (computational) linguists, LRs developers, developers of e-learning material, e-publishers and teachers. It will provide a forum for interaction among members of possibly disparate research communities, and a means for attendees to increase their knowledge and understanding of the potential of LRs in teaching computational linguistics and e-learning. The workshop will feature also a panel discussing longer-term activities such as a mailing list, an archive of educational materials based on LRs, etc. However the main goal of this workshop would be laying the groundwork for further professional collaboration in integrating LRs within different types of e-learning. Important dates =============== Abstract Submissions: February 15th, 2004 (extended) Notification of Acceptance: February 28th, 2004 Final Versions: April 1st, 2004 Workshop: May 24th, 2004 Submissions =========== The workshop will consist of paper presentation, demos and panel discussions. Abstracts for workshop contributions should not exceed two A4 pages for paper presentation and one A4 page for demo presentation. Each submission should show: the title; author(s); affiliation(s); and contact author's e-mail address, as well as postal address, telephone and fax numbers. Submissions are to be sent by email to Cristina Vertan (cri@nats.informatik.uni-hamburg.de), in plain text or PDF format. The final version of the accepted papers should not be longer than 6 pages. Instructions for formatting and presentation of the final version will be sent to authors upon notification of acceptance. Proceedings of the workshop will be printed by the LREC Local Organizing Committee. TIME SCHEDULE and REGISTRATION FEE ================================== The workshop will consist of a morning session and an afternoon session. The afternoon session will end with a panel discussion. For this full-day Workshop, the registration fee is 100 EURO for LREC Conference participants and 170 EURO for other participants. These fees will include coffee breaks and the Proceedings of the workshop. Organizing Committee =================== Paola Monachesi, University of Utrecht, Paola.Monachesi@let.uu.nl Cristina Vertan, University of Hamburg, cri@nats.informatik.uni-hamburg.de Walther von Hahn, University of Hamburg, vhahn@nats.informatik.uni-hamburg.de Susanne Jekat, Zuerich University of Applied Sciences Winterthur, jes@zhwin.ch From Andrejs@Tilde.lv Tue Feb 3 14:23:58 2004 From: Andrejs@Tilde.lv (=?windows-1257?Q?Andrejs_Vasi=EFjevs?=) Date: Tue, 3 Feb 2004 16:23:58 +0200 Subject: [MT-List] [E-CFP] Categorial Grammars 2004 HLT - the Baltic Perspective Message-ID: <89BDC0B404731447812DCBE6AEAE7BE021DE7F@PASTS.Tilde.lv> CALL FOR PAPERS The First Baltic Conference HUMAN LANGUAGE TECHNOLOGIES =96 THE BALTIC PERSPECTIVE RIGA, Latvia, April 21 =96 22, 2004 The conference will provide a significant push for research, development = and implementation of computer technologies for languages of the Baltic = countries. The main goals of the conference include raising public = awareness of the important role of the human language technologies in = information society, sharing information about latest developments in = the area of computer linguistics, building interdisciplinary ties = between specialists of computational science, linguistic and information = management, raising awareness about development of smaller languages in = the European Union and sharing experience and establishing co-operation = between different institutions of the Baltic countries and the European = Union. The conference will bring together researchers and IT companies = developing HLT and users of these technologies.=20 The conference will be one of the major events of Baltic IT&T 2004 Forum = (www.ebaltics.com) which has long traditions and important parallel = events: - The 8th International Conference "ICT in the Baltic Sea and CEE = Region" - Exhibition "Baltic IT&T 2004" - Global Cities Dialogue Workshop - Baltic ICT Partneriat Workshop - Meeting of Senior Officials of the Information Society of the Baltic = region countries=20 Combining the conference with a forum provides an excellent opportunity = to obtain a comprehensive overview about the activities in HLT = particularly and within ICT sector overall in the Baltic Sea region and = the new European Union member states. VENUE **************************************************** The Conference will be held at Radisson SAS Daugava Hotel, Kugu 24, = Riga, LATVIA MOTIVATIONS ********************************************** HLT is a new and developing field in Baltic countries. So far the Baltic = specialists in computer linguistics and HLT have had only one common = dedicated event - seminar supported by European Union =93Language and = Technology 2000=94 in November 1994.=20 The main goals of the 2004 conference ten after years after the first = event are: * to raise public awareness about important role of the HLT in = information society; * to share information about latest developments in the area of HLT; * to build interdisciplinary ties between specialists of computational = science, linguistic and information management; * to promote public-private partnership in development of HLT = applications; * to introduce Baltic participants to programs and activities of the = European Union in this field; * to raise awareness about development of languages of the new member = states in the European Union; * to share experience and establish cooperation between different = institutions of the Baltic countries and the European Union. CONFERENCE THEMES **************************************** The conference is the first event in 10 years to provide a wide coverage = of HLT-related research and development in Baltics. The wide range of = topics to be addressed in the conference includes, but is not limited = to: HLT in Baltic countries =96 overview, policy, cooperation, opportunities * Language policy and new technologies =96 Baltic perspective * Current situation in the area of HLT in Baltic countries * European Union HLT policy, programs and supported activities * Towards building of global policy on promotion of language diversity = in cyberspace * Baltic Sea region and European Cooperation: opportunities, existing = and potential projects Research and Development of HLT * Machine Translation and inter-language communication systems * Speech technologies * Information retrieval systems * Multimodal dialog systems * Methods and tools for language processing Language Corpuses and Content Development * Promoting native-language content in Baltic cyberspace * Terminology databases * Text corpora, thesauruses, ontologies * Electronic dictionaries * Speech corpuses * Methods for development and maintenance of corpuses * Corpuses and Copyrights Demonstrations and user studies * Demonstration of practical applications in different HLT areas * User needs, user experience in practical application of HLT IMPORTANT DATES **************************************** * February 20, 2004: Deadline for abstract submission * March 5, 2004: Acceptance notifications and preliminary program * April 5, 2004: Deadline final version of accepted papers * April 21 =96 22, 2004: Conference SUBMISSIONS ******************************************** The conference will consist primarily of paper presentations, panel and = demonstration sessions. Papers of the conference will be published in = official conference proceedings. Abstract of presentation (not exceeding 500 characters) should be sent = till February 20, 2004 to the following e-mail: bitt.paper@dtmedia.lv Full papers in English (not exceeding 15000 characters) should be sent = after acceptance notification until April 5, 2004.=20 Demonstrations of HLT tools and applications are encouraged (a = demonstration outline of 2 pages can be submitted). LANGUAGE ************************************************ The conference language for presentations and publications is English. SPONSORSHIP OPPORTUNITIES ******************************* Please contact conference organizers for sponsorship opportunities. CONFERENCE ORGANISERS *********************************** The initiator and host of the conference is the Commission of the = Official Language at the Chancellery of the President of Latvia. = Partners in organisation of the conference are Latvian Information = Technology and Telecommunications Association (LITTA) and Data Media = Group. Among institutions to be invited to participate in the = organisation of conference and program committee are Estonian Language = Commission, Lithuanian Language Commission, European Network of Language = Excellence, University of Latvia, University of Kaunas, Tartu = University, SICS, IT Committee of Cabinet of Ministers of Lithuania, = Tampere University and others. From jeff.allen@free.fr Thu Feb 12 18:47:25 2004 From: jeff.allen@free.fr (Jeff Allen) Date: Thu, 12 Feb 2004 19:47:25 +0100 Subject: [MT-List] IN MEMORIAM: Emilio Benito In-Reply-To: <200401291136_MC3-1-6A87-2718@compuserve.com> References: <200401291136_MC3-1-6A87-2718@compuserve.com> Message-ID: <1076611645.402bca3d96d77@imp4-q.free.fr> MT listers, please see the sorrowful announcement below. This was just published the LANTRA-L translator and interpreter list, however I did not see it at the ATRIL site. I am relying on this being accurate information as posted to LANTRA. Jeff >>>>>>>>>>>>>>>>>>>> >From ATRIL: Madrid, February 12, 2004 It is with the greatest sorrow that we announce the passing away of Emilio Benito, President and Founder of ATRIL and original author of the DEjA Vu software concept. Emilio died aged 56 last Sunday morning in Madrid, Spain, after a long struggle with a cruel disease. For more than a decade prior to his effective retirement in early 2003, Emilio's legendary work inspired the translation software industry, setting the standard for translation software quality, speed, reliability and, perhaps more than anything else, customer care. Following a smooth, seamless transfer of roles over the past two years, Emilio's work has been continued by ATRIL's team of software and support engineers led by his son, Daniel Benito. From the solid DEjA Vu platform designed by Emilio, Daniel has taken the DEjA Vu approach to new heights with the D=E9j=E0 Vu X software range, marking the beginning of the new DEjA era-- an era that has just only begun. Emilio's life and work is the legend of a humble man with the mind of a genius who practised software programming as a form of art and customer support as an act of love. His example will not be in vain. While we mourn Emilio's death, we shall use the inspiration he left us with to reach the goals that he visualized for Atril and those of D=E9j=E0 Vu users worldwide. Emilio will be remembered forever in our hearts. May he rest in peace. ATRIL From coolspeech@hotmail.com Fri Feb 13 12:24:36 2004 From: coolspeech@hotmail.com (ByteCool Software) Date: Fri, 13 Feb 2004 12:24:36 +0000 Subject: [MT-List] BabelCode Project Message-ID: BabelCode Project investigates a new methodology of controlled translation and makes it available for practical use. By-products such as foreign language writing assistants and learning tools are also useful applications based on BabelCode databases. Website: http://www.babelcode.org Z.Y. Yao, Principal Investigator of BabelCode Project info@babelcode.org _________________________________________________________________ Choose now from 4 levels of MSN Hotmail Extra Storage - no more account overload! http://click.atdmt.com/AVE/go/onm00200362ave/direct/01/ From esteam@otenet.gr Mon Feb 16 15:23:41 2004 From: esteam@otenet.gr (ESTeam) Date: Mon, 16 Feb 2004 17:23:41 +0200 Subject: [MT-List] Extended Deadline: LREC2004 Workshop Call on "the Amazing Utility of Parallel and Comparable Corpora" Message-ID: <4030E07D.D31B4F66@otenet.gr> *************ATTENTION PLEASE !!! IF YOU HAVE SENT AN ABSTRACT BUT HAVE NOT RECEIVED A CONFIRMATION OF RECEIVAL PLEASE RE-SENT IT TO BOTH: esteam@otenet.gr and esteam@telia.com*************** ************APOLOGIES FOR RECEIVING MULTIPLE COPIES************ Workshop on the Amazing Utility of Parallel and Comparable Corpora "The Re-Use of Parallel and Comparable Corpora including Translation Memories for Applications such as Machine Translation, Knowledge Structuring and Multilingual Information Browsing" Lisbon, Portugal, 25 May 2004 LREC 2004 Conference on Language Resources and Evaluation Extension of Deadline for Call for Papers Workshop description and aim: Parallel and comparable corpora, including translation memories, contain a large amount of information for translation, multilingual lexicon building and multilingual knowledge structuring. In human language technology these resources can be used for multiple applications in speech or text processing. This workshop addresses the versality and utility of parallel and comparable corpora in the expanding and multilingual space of the internet. It aims at exchanging ideas concerning all aspects of re-usability of parallel and comparable corpora including translation memories for applications such as machine translation, knowledge structuring, multilingual information browsing and so on. Abstract submission: We invite submission of abstracts on the topics below. Abstracts should be approximately 600 words. Abstracts outlining live demonstrations are encouraged. Submissions should be sent before 16 February 2004 by email as attachments in doc, rtf, or plain text, to the contact person, Anna Samiotou esteam@otenet.gr, with the words ‘LREC2004 Workshop on the Amazing Utility of Parallel and Comparable Corpora’ in the subject heading. Notification of acceptance will be made on or before the 8 March 2004, together with any comments from reviewers from revision. Workshop topics: - re-use of parallel and comparable corpora including translation memories for: o machine translation o knowledge structuring o multilingual information browsing o speech multilingual applications - re-usability of generic vs task/domain specific existing parallel and comparable corpora - building language resourses for new applications from existing ones - methods for the extraction and acquisition of knowledge (e.g. terms, lexical information) from existing parallel and comparable corpora for re-using purposes in new applications - methods for identifying comparable corpora on the web - methods for quantifying comparability Organising Committee: Dr. Lambros Kranias, Chief Technical Manager ESTeam AB, Greece/Sweden Dr. Nicoletta Calzolari, Director, Istituto di Linguistica Computazionale del CNR, Italy Dr. Gregor Thurmair, R&D Manager Linguatec, Germany Dr. Yorick Wilks, Professor, University of Sheffield, UK Dr. Eduard Hovy, Information Sciences Institute, University of Southern California, USA Gudrun Magnusdottir, Managing Director ESTeam AB, Greece/Sweden Anna Samiotou, R&D Manager ESTeam AB, Greece/Sweden Dr. Khalid Choukri, CEO ELDA/ELRA Important Dates: Extended Deadline for submissions: 1 March 2004 Notification of acceptance: 8 March 2004 Deadline final paper: 22 March 2004 Workshop date: full day 25 May 2004 From WJHutchins@compuserve.com Fri Feb 20 10:11:23 2004 From: WJHutchins@compuserve.com (John Hutchins) Date: Fri, 20 Feb 2004 05:11:23 -0500 Subject: [MT-List] EAMT Workshop 2004 registration and hotels Message-ID: <200402200511_MC3-1-70C7-50F9@compuserve.com> Information about the EAMT Workshop in Malta, 26-27 April 2004 is now available at: http:// www.cs.um.edu.mt/~mros/EAMT04 = The information includes details of registration fees, a registration form, information about hotels and booking of accommodation, and links to other information about the venue and about Malta. A link to this site from the EAMT website will appear shortly. We look forward to seeing you in Malta. From: John Hutchins 20 February 2004 From jeff.allen@free.fr Sat Feb 21 11:35:39 2004 From: jeff.allen@free.fr (Jeff Allen) Date: Sat, 21 Feb 2004 12:35:39 +0100 Subject: [MT-List] what happened to EAMT website In-Reply-To: <200402200511_MC3-1-70C7-50F9@compuserve.com> References: <200402200511_MC3-1-70C7-50F9@compuserve.com> Message-ID: <1077363339.4037428bc6070@imp6-q.free.fr> Quoting John Hutchins wrote on 20 February 2004 > A link to this site from the EAMT website will appear shortly. > We look forward to seeing you in Malta. Help!!!! What happened to the EAMT website? I'm trying to access the archives for an early 1990s paper. http://www.eamt.org now redirects to the Dublin City Univ website with a login and password request. http://www.eamt.org/mt-list.html gives page not found. Jeff jeff.allen@free.fr From Andrejs@Tilde.lv Mon Feb 23 20:04:20 2004 From: Andrejs@Tilde.lv (=?windows-1257?Q?Andrejs_Vasi=EFjevs?=) Date: Mon, 23 Feb 2004 22:04:20 +0200 Subject: [MT-List] [E-CFP] HLT Baltic Conference - Deadline Extension Message-ID: <89BDC0B404731447812DCBE6AEAE7BE021DF2A@PASTS.Tilde.lv> Deadline extension: CALL FOR PAPERS Extended deadline: February 29, 2004 --------------------------- The First Baltic Conference HUMAN LANGUAGE TECHNOLOGIES =96 THE BALTIC PERSPECTIVE RIGA, Latvia, April 21 =96 22, 2004 The conference will provide a significant push for research, development = and implementation of computer technologies for languages of the Baltic = countries. The main goals of the conference include raising public = awareness of the important role of the human language technologies in = information society, sharing information about latest developments in = the area of computer linguistics, building interdisciplinary ties = between specialists of computational science, linguistic and information = management, raising awareness about development of smaller languages in = the European Union and sharing experience and establishing co-operation = between different institutions of the Baltic countries and the European = Union. The conference will bring together researchers and IT companies = developing HLT and users of these technologies.=20 The conference will be one of the major events of Baltic IT&T 2004 Forum = (www.ebaltics.com) which has long traditions and important parallel = events: - The 8th International Conference "ICT in the Baltic Sea and CEE = Region" - Exhibition "Baltic IT&T 2004" - Global Cities Dialogue Workshop - Baltic ICT Partneriat Workshop - Meeting of Senior Officials of the Information Society of the Baltic = region countries=20 Combining the conference with a forum provides an excellent opportunity = to obtain a comprehensive overview about the activities in HLT = particularly and within ICT sector overall in the Baltic Sea region and = the new European Union member states. VENUE The Conference will be held at Radisson SAS Daugava Hotel, Kugu 24, = Riga, LATVIA MOTIVATIONS HLT is a new and developing field in Baltic countries. So far the Baltic = specialists in computer linguistics and HLT have had only one common = dedicated event - seminar supported by European Union =93Language and = Technology 2000=94 in November 1994.=20 The main goals of the 2004 conference ten after years after the first = event are: * to raise public awareness about important role of the HLT in = information society; * to share information about latest developments in the area of HLT; * to build interdisciplinary ties between specialists of computational = science, linguistic and information management; * to promote public-private partnership in development of HLT = applications; * to introduce Baltic participants to programs and activities of the = European Union in this field; * to raise awareness about development of languages of the new member = states in the European Union; * to share experience and establish cooperation between different = institutions of the Baltic countries and the European Union. CONFERENCE THEMES The conference is the first event in 10 years to provide a wide coverage = of HLT-related research and development in Baltics. The wide range of = topics to be addressed in the conference includes, but is not limited = to: HLT in Baltic countries =96 overview, policy, cooperation, opportunities * Language policy and new technologies =96 Baltic perspective * Current situation in the area of HLT in Baltic countries * European Union HLT policy, programs and supported activities * Towards building of global policy on promotion of language diversity = in cyberspace * Baltic Sea region and European Cooperation: opportunities, existing = and potential projects Research and Development of HLT * Machine Translation and inter-language communication systems * Speech technologies * Information retrieval systems * Multimodal dialog systems * Methods and tools for language processing Language Corpora and Content Development * Promoting native-language content in Baltic cyberspace * Terminology databases * Text corpora, thesauruses, ontologies * Electronic dictionaries * Speech corpora * Methods for development and maintenance of corpora * Corpora and Copyrights Demonstrations and user studies * Demonstration of practical applications in different HLT areas * User needs, user experience in practical application of HLT IMPORTANT DATES * February 29, 2004: Extended deadline for abstract submission * March 5, 2004: Acceptance notifications * April 5, 2004: Deadline final version of accepted papers * April 21 =96 22, 2004: Conference SUBMISSIONS The conference will consist primarily of paper presentations, panel and = demonstration sessions. Papers of the conference will be published in = official conference proceedings. Abstract of presentation (not exceeding 1000 characters) should be sent = till February 29, 2004 to the following e-mail: bitt.paper@dtmedia.lv Full papers in English (not exceeding 15000 characters) should be sent = after acceptance notification until April 5, 2004.=20 Demonstrations of HLT tools and applications are encouraged (a = demonstration outline of 2 pages can be submitted). LANGUAGE The conference language for presentations and publications is English. SPONSORSHIP OPPORTUNITIES Please contact conference organizers for sponsorship opportunities. CONFERENCE ORGANISERS The initiator and host of the conference is the Commission of the = Official Language at the Chancellery of the President of Latvia. = Partners in organization of the conference are Latvian Information = Technology and Telecommunications Association (LITTA) and Data Media = Group.=20 PROGRAMME COMMITTEE Jussi Karlgren, Swedish Institute of Computer Science (SICS), Sweden Steven Krauwer, European Network of Language Excellence (Elsnet), The = Netherlands Ruta Marcinkeviciene, Vytautas Magnus University, Lithuania Tiit Roosma, University of Tartu, Estonia Inguna Skadina, University of Latvia, Tilde, Latvia Irena Smetoniene, Lithuanian Language Commission, Lithuania Eero Sormunen, Tampere University, Finland Andrejs Spektors, University of Latvia, Latvia Andrejs Vasiljevs, Tilde, Commission of the Official Language, Latvia From Michel.Simard@xrce.xerox.com Tue Feb 24 09:48:09 2004 From: Michel.Simard@xrce.xerox.com (Michel Simard) Date: Tue, 24 Feb 2004 10:48:09 +0100 Subject: [MT-List] Journal of Natural Language Engineering: Special Issue on Parallel Texts -- 2nd Call for Papers Message-ID: <403B1DD9.2020801@xrce.xerox.com> (apologies for multiple postings) S E C O N D C A L L F O R P A P E R S Journal of Natural Language Engineering Special Issue on PARALLEL TEXTS Guest Editors: Rada Mihalcea, University of North Texas Michel Simard, Xerox Research Centre Europe http://www.cs.unt.edu/~rada/jnle OBJECTIVE OF THIS SPECIAL ISSUE Parallel texts have become a vital resources for efficiently deriving many multi-lingual text processing tools. This special issue is devoted to advances in building and using parallel corpora. We invite papers on all topics related to parallel texts, including but not limited to: The collection, organization and processing of parallel corpora: - Identifying and harvesting parallel texts from the Web and other large collections - Evaluating the quality of parallel corpora (e.g. detecting omissions and gaps, translation errors or inconsistencies, etc.) - Sentence-, phrase- and word-level alignment - Alignment evaluation metrics and methods Active uses of parallel corpora for: - Building multilingual lexical resources - Deriving language processing tools and resources for new languages - Annotating corpora (e.g. word-sense disambiguation) - Machine translation (e.g. statistical and example-based MT) - Machine-assisted translation (e.g. translation memories and interactive MT) - Cross-linguistic information retrieval and information extraction While we invite submissions addressing any of the above topics, or related issues, we particularly welcome work involving parallel corpora addressing languages with scarce resources. SUBMISSION FORMAT We are expecting full papers to describe original, previously unpublished research, addressing issues related to the construction and use of parallel texts. Papers should be formatted according to the NLE journal instructions, and should not exceed 15 pages. The preferred formatting system is LaTeX, which can be used for direct typesetting, and a style file is available through anonymous ftp from the following address: ftp.cup.cam.ac.uk/pub/texarchive/journals/latex/nle-sty/. In case of difficulty there is a helpline available on e-mail: texline@cup.cam.ac.uk. Send your submission (a PostScript or PDF file), prepared for anonymous review, to both: Rada Mihalcea, University of North Texas, rada@cs.unt.edu and Michel Simard, Xerox Research Centre Europe, Michel.Simard@xrce.xerox.com IMPORTANT DATES Paper submissions: May 1, 2004 Notification of acceptance: August 30, 2004 Final versions due: November 30, 2004 Journal publication: June, 2005 PROGRAM COMMITTEE Lars Ahrenberg, Linkoping University Susan Armstrong, ISSCO Michael Barlow, Rice University Ken Church, AT&T Labs Research Ido Dagan, Bar-Ilan University Jason Eisner, Johns Hopkins University George Foster, University of Montreal Pascale Fung, University of Science and Technology, Hong Kong Eric Gaussier, Xerox Research Centre Europe Ulrich Germann, University of Toronto Daniel Gildea, University of Rochester Julio Gonzalo, UNED Cyril Goutte, Xerox Research Centre Europe Gregory Grefenstette, Clairvoyance Corporation Eduard Hovy, University of Southern California / Information Sciences Institute Pierre Isabelle, Xerox Research Centre Europe Hitoshi Iida, Tokyo University of Technology Wessel Kraaij, TNO/TPD Netherlands Philippe Langlais, University of Montreal Elliot Macklovitch, University of Montreal Dan Melamed, New York University Ruslan Mitkov, University of Wolverhampton Hermann Ney, RWTH Aachen Franz Och, Information Sciences Institute Kemal Oflazer, Sabanci University Kishore Papineni, IBM Ted Pedersen, University of Minnesota, Duluth Jessie Pinkham, University of Chicago Andrei Popescu-Belis, ISSCO/TIM/ETI University of Geneva Florence Reeder, MITRE Philip Resnik, University of Maryland Harold Somers, University of Manchester Institute of Science and Technology Hideki Tanaka, ATR Spoken Language Translation Research Laboratories Arturo Trujillo, Canon Research Centre Europe Jean Veronis, University of Provence Clare Voss, Army Research Lab Yorick Wilks, University of Sheffield Dekai Wu, University of Science and Technology, Hong Kong Kenji Yamada, Xerox Research Centre Europe ABOUT THE JOURNAL Natural Language Engineering is an international journal designed to meet the needs of professionals and researchers working in all areas of computerized language processing, whether from the perspective of theoretical or descriptive linguistics, lexicology, computer science or engineering. Its principal aim is to bridge the gap between traditional computational linguistics research and the implementation of practical applications with potential real-world use. As well as publishing research articles on a broad range of topics from text analysis, machine translation and speech generation and synthesis to integrated systems and multi modal interfaces the journal also publishes book reviews. Its aim is to provide the essential link between industry and the academic community. Natural Language Engineering encourages papers reporting research with a clear potential for practical application. Theoretical papers that consider techniques in sufficient detail to provide for practical implementation are also welcomed, as are shorter reports of on-going research, conference reports, comparative discussions of NLE products, and policy-oriented papers examining e.g. funding programs or market opportunities. All contributions are peer reviewed. Edited by John I. Tait University of Sunderland, UK Branimir K. Boguraev IBM Thomas J. Watson Research Center, New York, USA Christian Jacquemin CNRS-LIMSI, France .