From ram@srv.net Sat Jan 5 20:16:03 2002 From: ram@srv.net (Rick Morneau) Date: Sat, 5 Jan 2002 13:16:03 -0700 Subject: [MT-List] Generation of English genitives Message-ID: <200201052016.g05KG3A17707@localhost.localdomain> Does anyone have any pointers to algorithms that can be used to determine whether to generate an apostrophe-s genitive or a genitive using "of"? I'm currently using an ad hoc approach that doesn't always work very well, but I'd really like to use something more principled. Any help would be greatly appreciated. Regards, Rick Morneau ram@srv.net ram@axxess.net From steven.krauwer@let.uu.nl Tue Jan 15 15:22:23 2002 From: steven.krauwer@let.uu.nl (Steven Krauwer) Date: Tue, 15 Jan 2002 16:22:23 +0100 (MET) Subject: [MT-List] CfP: MT Roadmap Workshop at TMI2002, Keihanna, Japan Message-ID: <200201151522.g0FFMNZ03799@sfinx.let.uu.nl> _________________________________________________________________ Call for Papers MT Roadmap Workshop (March 16) at TMI2002 (March 13-17) Keihanna (near Kyoto), Japan Organized by ELSNET Background: Since 2000, ELSNET (The European Network of Excellence in Human Language Technologies) has organised a series of workshops aimed at the creation of a broadly supported technological roadmap for various subfields of language and speech technology. A technology roadmap comprises an analysis of the present situation, a vision of where we want to be in e.g. ten years from now, and a number of intermediate milestones that would help in setting intermediate goals and in measuring our progress towards our goals. The function of the road map is not to impose anything on anyone, but rather to provide a broadly supported definition of a context in which to position the MT community's efforts, which would allow us to identify common priorities for joint activities in e.g. research, resources and training. ELSNET aims at (co-)organizing roadmap workshops at all major events in order to encourage continuous reflection on where we stand, where we want to go, and, most importantly, how we can get there. Format: It will be a one day workshop consisting of three components: * The first component will aim at giving a critical analysis of the present state of Machine Translation in the broadest sense. * The second component will be dedicated to visions of the future. * The third component will aim at identifying major research challenges and establishing intermediate milestones on our way towards our goals. Each of the components could contain * an invited speaker or panel, * submitted papers (reviewed), * and ample space for discussion. The results of the discussions will be published in a report that will published on ELSNET's website, and that will serve as a starting point of a broad consultation of the MT community on the future directions of Machine Translation in the broadest sense. Papers: We invite papers that * give critical analyses of the present state of the art in machine translation of written and spoken language, * present visions of the future of machine translation, both from a theoretical, a technological or from an application point of view, or * identify major milestones and challenges (both theoretical, practical and organisational) on our way towards the future. Submission: Abstracts must be submitted in English, and should be no more than 4 pages long and in single column format. Submissions should be sent electronically in plain text, MS Word or PDF to steven.krauwer@elsnet.org. Important dates: * Submission deadline: 13 February * Notification: 22 February * Final papers due: 5 March * Workshop: 16 March Proceedings: Participants will receive proceedings, containing * a summary of the results of the previous MT Roadmap workshop (at the MT Summit) * summaries of the invited talks * full versions of the submitted papers Audience: The primary audience consists of people with an analytical or future oriented, programmatic interest, both from research and from industry. Registration: The workshop registration fee is 5000 yen (app. 38 USD or 43 Euro). Registration details can be found on the TMI 2002 website (see below). URLs: Main conference: http://www.kecl.ntt.co.jp/events/tmi/ This workshop: http://www.elsnet.org/roadmap-tmi2002.html Contact point: Steven Krauwer (steven.krauwer@elsnet.org) ELSNET / Utrecht University Trans 10, 3512 JK Utrecht, NL phone +31 30 253 6050 fax +31 30 253 6000 Core Programme Committee: * Steven Krauwer * Laurie Gerber From steveri@microsoft.com Mon Jan 21 20:24:05 2002 From: steveri@microsoft.com (Steve Richardson) Date: Mon, 21 Jan 2002 12:24:05 -0800 Subject: [MT-List] AMTA-2002 Call for Participation Message-ID: <0FDD2891FCDF6E42891AEDE2198E5F7C0425D66F@red-msg-04.redmond.corp.microsoft.com> This is a multi-part message in MIME format. --------------InterScan_NT_MIME_Boundary Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C1A2B9.9294E1F8" ------_=_NextPart_001_01C1A2B9.9294E1F8 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable --- CALL FOR PARTICIPATION --- The Association for Machine Translation in the Americas =20 AMTA-2002 Conference Location: Tiburon, California Dates: October 8-12, 2002 =20 The Association for Machine Translation in the Americas (AMTA) is = pleased to announce its fifth biennial conference, planned for October = 8-12, 2002, in Tiburon (near San Francisco), California. =20 Conference theme: From Research to Real Users =20 Ever since the showdown between Empiricists and Rationalists a decade = ago at TMI-92, MT researchers have hotly pursued promising paradigms for = MT, including data-driven approaches (e.g., statistical, example-based) = and hybrids that integrate these with more traditional rule-based = components. =20 During the same period, commercial MT systems with standard transfer = architectures have evolved along a parallel and almost unrelated track, = increasing their coverage (primarily through manual update of their = lexicons, we assume) and achieving much broader acceptance and usage, = principally through the medium of the Internet. Web page translators = have become commonplace; a number of online translation services have = appeared, including in their offerings both raw and post-edited MT; and = large corporations have been turning increasingly to MT to address the = exigencies of global communication. Still, the output of the = transfer-based systems employed in this expansion represents but a small = drop in the ever-growing translation marketplace bucket. =20 Now, 10 years later, we wonder if this mounting variety of MT users is = any better off, and if the promise of the research technologies is being = realized to any measurable degree. In this regard, we pose the = following questions: =20 Why aren't any current commercially available MT systems primarily = data-driven? =20 Do any commercially available systems integrate (or plan to integrate) = data-driven components? =20 Do data-driven systems have significant performance or quality issues? =20 Can such systems really provide better quality to users, or is their = main advantage one of fast, facilitated customization? =20 If any new MT technology could provide such benefits (somewhat higher = quality, or facilitated customization), would that be the key to more = widespread use of MT, or are there yet other more relevant unresolved = issues, such as system integration? =20 If better quality, customization, or system integration aren't the = answer, then what is it that users really need from MT in order for it = to be more useful to them? =20 We solicit participation on these and other topics related to the = research, development, and use of MT in the form of original papers, = demonstrations, workshops, tutorials, and panels. We invite all who are = interested in MT to participate, including developers, researchers, end = users, professional translators, managers, and marketing experts. We = especially invite users to share their experiences, developers to = describe their novel systems, managers and marketers to talk about what = is happening in the marketplace, researchers to detail new capabilities = or methods, and visionaries to describe the future as they see it. We = also welcome and encourage participation by members of AMTA's sister = organizations, AAMT in Asia and EAMT in Europe.=20 =20 Details regarding the conference may be found on the AMTA Web site: http://www.amtaweb.org/AMTA2002/ =20 =20 CONFERENCE ORGANIZERS Elliott Macklovitch, General Chair Stephen D. Richardson, Program Chair Violetta Cavalli-Sforza, Local Arrangements Chair=20 Bob Frederking, Workshops and Tutorials=20 Laurie Gerber, Exhibits Coordinator=20 =20 =20 AMTA-2002: PAPER AND SYSTEM DESCRIPTION/DEMONSTRATION SUBMISSIONS. =20 =20 Authors/system developers are invited to submit presentations in English = in any of the following three categories: =20 =20 1. Theoretical papers: Unpublished papers describing original work on = all aspects of Machine Translation. Preference will be given to papers = that include concrete results and that address the theme of moving MT = research technology (including, but not limited to, data-driven systems = or components) into real use. Papers should not be longer than 10 = pages, with minimum character font size of 11 pt. =20 =20 2. User studies: Studies of users' experiences with implementing MT or = testing its applicability to some task. Of particular interest are = experiences deploying new or advanced MT technology in a production = context. Users, managers, and sales/marketing professionals are = especially welcome to submit. Studies should not be longer than 8 = pages, with minimum character font size of 11 pt. =20 =20 3. System descriptions with optional system demonstrations: Approx. 25 = minutes will be allocated per system description/demo. Submissions = should not be longer than 4 pages. The goal of system descriptions is to = educate participants about the features and functionality of current and = emerging MT systems. Sales presentations are not appropriate. The = following information should be provided in each system description; - name and contact information of system builder - system category (research, pre-market prototype, or commercially = available) - system characteristics (e.g., languages, domains, = integration/networking features) If a system demonstration is included, please provide the following = information: - hardware platform and operating system - name and contact information of system operations specialist =20 =20 First page: All submissions should include a separate title page with = the following information: - paper title, - author(s)' name(s), address(es), telephone and fax numbers, email = address(es), - one-paragraph abstract, - for theoretical papers: subject area keyword(s) - for user studies: the words "User study" - for system descriptions/demos: the words "System description/demo" =20 =20 DEADLINES and SCHEDULE: Submissions due at address below: April 15, 2002 (Monday) Notification of acceptance: May 31, 2002 (Friday) Final versions of papers due: July 15, 2002 (Monday) =20 =20 Electronic submissions are strongly preferred. They should be sent to: email address: steveri@microsoft.com subject line: AMTA-2002 submission =20 in one of the following formats: Microsoft Word (RTF format) PostScript ASCII plain text =20 Hardcopy submissions (please send four (4) copies): AMTA-2002: Stephen D. Richardson Microsoft Research One Microsoft Way Redmond, WA 98052 USA=20 =20 =20 =20 ------_=_NextPart_001_01C1A2B9.9294E1F8 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
 = ; --- CALL FOR PARTICIPATION ---
The Association for = Machine Translation in the Americas
AMTA-2002 Conference
Location: Tiburon, California
Dates: October 8-12, 2002
The Association for Machine Translation in = the Americas (AMTA) is pleased to announce its fifth biennial conference, planned for October 8-12, 2002, in Tiburon (near San Francisco), = California.
Conference theme: From Research to Real = Users
Ever since the showdown between Empiricists = and Rationalists a decade ago at TMI-92, MT researchers have hotly pursued promising paradigms for MT, including data-driven approaches (e.g., statistical, example-based) and hybrids that integrate these with more traditional rule-based components.
During the same period, commercial MT systems = with standard transfer architectures have evolved along a parallel and almost unrelated track, increasing their coverage (primarily through manual = update of their lexicons, we assume) and achieving much broader acceptance and = usage, principally through the medium of the Internet. Web page translators = have become commonplace; a number of online translation services have = appeared, including in their offerings both raw and post-edited MT; and large corporations have been turning increasingly to MT to address the exigencies of global = communication. Still, the output of the transfer-based systems employed in this expansion represents but a small drop in the ever-growing translation marketplace bucket.
Now, 10 years later, we wonder if this = mounting variety of MT users is any better off, and if the promise of the = research technologies is being realized to any measurable degree. In this = regard, we pose the following questions:
Why aren't any current commercially available = MT systems primarily data-driven?
Do any commercially available systems = integrate (or plan to integrate) data-driven components?
Do data-driven systems have significant = performance or quality issues?
Can such systems really provide better = quality to users, or is their main advantage one of fast, facilitated = customization?
If any new MT technology could provide such = benefits (somewhat higher quality, or facilitated customization), would that be = the key to more widespread use of MT, or are there yet other more relevant = unresolved issues, such as system integration?
If better quality, customization, or system integration aren't the answer, then what is it that users really need = from MT in order for it to be more useful to them?
We solicit participation on these and other = topics related to the research, development, and use of MT in the form of = original papers, demonstrations, workshops, tutorials, and panels. We invite all = who are interested in MT to participate, including developers, researchers, end = users, professional translators, managers, and marketing experts. We especially = invite users to share their experiences, developers to describe their novel = systems, managers and marketers to talk about what is happening in the = marketplace, researchers to detail new capabilities or methods, and visionaries to = describe the future as they see it. We also welcome and encourage = participation by members of AMTA's sister organizations, AAMT in Asia and EAMT in Europe.
Details regarding the conference may be found = on the AMTA Web site:
http://www.amtaweb.org/AMTA2002= /
<= /font>
CONFERENCE ORGANIZERS
Elliott Macklovitch, General = Chair
Stephen D. Richardson, Program = Chair
Violetta Cavalli-Sforza, Local Arrangements = Chair
Bob Frederking, Workshops and Tutorials =
Laurie Gerber, Exhibits Coordinator =
AMTA-2002: PAPER AND SYSTEM DESCRIPTION/DEMONSTRATION SUBMISSIONS.
Authors/system developers are invited to = submit presentations in English in any of the following three categories:
1. Theoretical papers: Unpublished papers = describing original work on all aspects of Machine Translation. Preference = will be given to papers that include concrete results and that address the theme = of moving MT research technology (including, but not limited to, = data-driven systems or components) into real use. Papers should not be longer = than 10 pages, with minimum character font size of 11 pt.
2. User studies: Studies of users’ = experiences with implementing MT or testing its applicability to some task. Of particular interest are experiences deploying new or advanced MT = technology in a production context. Users, managers, and sales/marketing = professionals are especially welcome to submit. Studies should not be longer = than 8 pages, with minimum character font size of 11 pt.
3. System descriptions with optional system demonstrations: Approx. 25 minutes will be allocated per system description/demo. Submissions should not be longer than 4 = pages. The goal of system descriptions is to educate participants about the = features and functionality of current and emerging MT systems. Sales presentations = are not appropriate. The following information should be provided in each system = description;
- name and contact information of = system builder
- system category (research, pre-market prototype, or commercially available)
- system characteristics (e.g., = languages, domains, integration/networking features)
If a system demonstration is included, please provide the following information:
- hardware platform and operating = system
- name and contact information of = system operations specialist
First page: All submissions should include a separate title page with the following information:
- paper title,
- author(s)' name(s), address(es), telephone = and fax numbers, email address(es),
- one-paragraph abstract,
- for theoretical papers: subject area = keyword(s)
- for user studies: the words "User = study"
- for system descriptions/demos: the words "System description/demo"
DEADLINES and SCHEDULE:
Submissions due at address below: = April 15, 2002 (Monday)
Notification of = acceptance: May 31, = 2002 (Friday)
Final versions of papers due: July 15, 2002 (Monday)
Electronic submissions are strongly = preferred. They should be sent to:
email address: steveri@microsoft.com
subject line: = AMTA-2002 submission
in one of the following = formats:
Microsoft Word (RTF = format)
= PostScript
ASCII plain = text
Hardcopy submissions (please send four (4) = copies):
AMTA-2002: Stephen D. = Richardson
Microsoft Research
One Microsoft Way
Redmond, = WA 98052
USA
Dear all:
Does anybody know of a tool that allows the 'translation' of Cyrillic
characters to Latin characters via mouse click?
I truly appreciate your input.
Best regards
Kerstin Sinautzki
--
BMW Group - Service
Translation (VS-41)
80788 Muenchen - Germany
phone.: +49 (0)89 382 - 25831
fax: +49 (0)89 382 - 41675
email: Kerstin.Sinautzki@bmw.de
Hi,
My name is Mohammad Hosin Mehraby Aghda= m, I live in Iran (West Azarbaijan).=
I am computer Eng. With 15 years Experie= nce. During in 1987-2002, I designed 6 package about application (MT and = other), Now I design (3 version designed) a great software about translat= or from English to Turkish , Farsi, Arabic and I ready to make to other f= ields.
My skills
:1- Fu= lly in Morphology.
2- " " Semantics.
3- " " Analytic.
<= P>4- Modern mathematic methods for machine translation data structure.5- New Algorithms in Parser, Pragmatics and other fields in translat= ion
.6= - Fully in Delphi, very strong in C++, and professional in Data Structure= & IT systems.
sincerely= .
My E-MAIL(s) : DILM21@msn.co= m <mailto:DILM21@msn.com> & hosinagdam2002@msn.com <mailto:hosinagdam2002@msn.com> = & hosinagdam@msn.com <mailto:hosinagdam@m= sn.com>
The Association = for=20 Machine=20 Translation in the=20 Americas
*** SUBMISSION = GUIDELINES=20 HAVE CHANGED ***
*** SEE = UPDATED=20 GUIDELINES BELOW ***
AMTA-2002=20 Conference
Location: = Tiburon,=20 California
Dates: = October 8-12,=20 2002
The Association = for=20 Machine=20 Translation in the Americas = (AMTA) is=20 pleased to announce its fifth biennial conference, planned for October = 8-12,=20 2002, in Tiburon (near San Francisco), California.
CONFERENCE THEME: From Research to = Real=20 Users
Ever since the = showdown=20 between Empiricists and Rationalists a decade ago at TMI-92, MT = researchers have=20 hotly pursued promising paradigms for MT, including data-driven = approaches=20 (e.g., statistical, example-based) and hybrids that integrate these with = more=20 traditional rule-based components.
During the same = period,=20 commercial MT systems with standard transfer architectures have evolved = along a=20 parallel and almost unrelated track, increasing their coverage = (primarily=20 through manual update of their lexicons, we assume) and achieving much = broader=20 acceptance and usage, principally through the medium of the Internet. = Web page=20 translators have become commonplace; a number of online translation = services=20 have appeared, including in their offerings both raw and post-edited MT; = and=20 large corporations have been = turning=20 increasingly to MT to address the = exigencies of=20 global communication. Still, the output of the transfer-based = systems=20 employed in this expansion represents but a small drop in the = ever-growing=20 translation marketplace bucket.
Now, 10 years = later, we=20 wonder if this mounting variety of MT users is any better off, and if = the=20 promise of the research technologies is being realized to any measurable = degree. In this regard, we pose the following = questions:
Why aren't any = current=20 commercially available MT systems primarily = data-driven?
Do any = commercially=20 available systems integrate (or plan to integrate) data-driven=20 components?
Do data-driven = systems have=20 significant performance or quality issues?
Can such systems = really=20 provide better quality to users, or is their main advantage one of fast, = facilitated customization?
If any new MT = technology=20 could provide such benefits (somewhat higher quality, or facilitated=20 customization), would that be the key to more widespread use of MT, or = are there=20 yet other more relevant unresolved issues, such as system=20 integration?
If better quality, = customization, or system integration aren't the answer, then what is it = that=20 users really need from MT in order for it to be more useful to=20 them?
We solicit = participation on=20 these and other topics related to the research, development, and use of = MT in=20 the form of original papers, demonstrations, workshops, tutorials, and = panels.=20 We invite all who are interested in MT to participate, including = developers,=20 researchers, end users, professional translators, managers, and = marketing=20 experts. We especially invite users to share their experiences, = developers to=20 describe their novel systems, managers and marketers to talk about what = is=20 happening in the marketplace, researchers to detail new capabilities or = methods,=20 and visionaries to describe the future as they see it. We also = welcome and=20 encourage participation by members of AMTA's sister organizations, AAMT = in=20 Asia and EAMT in=20 Europe.
INVITED SPEAKERS
We are pleased to announce that = invited speakers=20 for the conference will include Yorick Wilks and Ken Church, both = notable=20 participants at TMI-92, and = Jaap van=20 der Meer, former = president of=20 ALPNET. We anticipate that the speakers will provide a sharp = and=20 stimulating focus on the theme of the conference.
Further=20 details regarding the=20 conference, including a call = for Tutorial=20 and Workshop proposals, may be found on the AMTA Web = site at:
http://www.amtaweb.org/AMTA2002/
CONFERENCE=20 ORGANIZERS
Elliott=20
Macklovitch, General=20
Chair
Stephen D. = Richardson,=20 Program Chair
Violetta = Cavalli-Sforza,=20 Local Arrangements Chair
Bob Frederking, = Workshops=20 and Tutorials
Laurie Gerber, = Exhibits=20 Coordinator
*** PLEASE NOTE THAT THE SUBMISSION=20 GUIDELINES HAVE CHANGED***
&nbs= p; ***=20 UPDATED GUIDELINES ARE PROVIDED BELOW ***
PAPER AND SYSTEM=20 DESCRIPTION/DEMONSTRATION SUBMISSIONS=20 GUIDELINES
We are pleased to announce that the AMTA-2002 conference proceedings = will be=20 published in the Lecture Notes in Artificial Intelligence series by=20 Springer-Verlag. (LNCS/LNAI series home page is located at: http://www.springer.de/comp/lncs/index.html)= SPAN>
It is therefore recommended = that initial=20 submissions to AMTA-2002 adhere as closely as possible to the formatting = guidelines for authors located at:
http://www.springer.de/comp/lncs/authors.html<= /SPAN>
These guidelines will need to = be strictly=20 adhered to for the final versions of submissions that are accepted = for=20 publication in the proceedings.
All submissions should be in English, and it = is=20 recommended that they be prepared using Latex2e or Microsoft Word, per=20 instructions at the authors' web site given above (see site for details = on using=20 other text processing systems). Once prepared, they should be submitted=20 electronically for review in one of the following three=20 formats:
PDF (recommended)
PostScript
Microsoft Word
All submissions will be received and=20 processed using the Conference Management Toolkit (CMT), located = at: http://cmt.research.microsoft.com/AMTA2002.=20
Authors should follow the instructions at the = CMT web=20 site to register, enter information about themselves and their paper, = and upload=20 a copy of their paper in one of the acceptable formats by the = submission deadline.
Any questions regarding submissions or the = use of this=20 web site should be directed in email to:
Important SUBMISSION DEADLINES are = as=20 follows:
Submissions uploaded at CMT web site: April 15, = 2002 = (Monday)
Notification of=20 acceptance: &nbs= p; =20 May 31,=20 2002=20 (Friday)
Final versions of = papers=20 due: &nbs= p; =20 July 15,=20 2002=20 (Monday)
At the CMT web site, authors will be asked to = designate=20 their submissions for one of the three conference tracks listed below. Again, initial submissions are expected to = adhere as=20 closely as possible to the guidelines found at http://www.springer.de/comp/lncs/authors.html<= /A>.=20 Information regarding submission length and additional requirements = is also=20 provided below.
Conference tracks:
1.=20 Theoretical papers: Unpublished papers describing original work on all = aspects=20 of Machine=20 Translation. Preference = will be=20 given to papers that include concrete results and that address the theme = of=20 moving MT research technology (including, but not limited to, = data-driven=20 systems or components) into real use. Papers may not be longer than 10 = pages.
2.=20 User studies: Studies of users’ experiences with implementing MT = or testing its=20 applicability to some task. Of particular interest are experiences = deploying new or advanced MT technology in a production context. = Users,=20 managers, and sales/marketing professionals are especially welcome to=20 submit. Studies may = not be=20 longer than 8 pages.
3.=20 System descriptions with optional system demonstrations: Approx. 25 = minutes will=20 be allocated per system description/demo. Descriptions may not be longer than 4 = pages. The goal of=20 system descriptions is to educate participants about the features and=20 functionality of current and emerging MT systems. Sales presentations = are not=20 appropriate. The following additional=20 information should be provided in each system = description;
- name and = contact=20 information of system builder
- system = category=20 (research, pre-market prototype, or commercially = available)
- system=20 characteristics (e.g., languages, domains, integration/networking=20 features)
If a system = demonstration is=20 included, please provide the following information:
- hardware = platform=20 and operating system
- name and = contact=20 information of system operations specialist
Dear Sears ,
Greetings.
I'm Mohamad Hosein Mehrabi , a computer= eng. from Tehran(IRAN) University, and 12 years experience and software = designing on MT "1990-2002".I explain all known = my MT activities;
My request is :
Please tell me (How can i cooperate with your company or teams), if may = be.
D
ur= ing 1994-2000, I designed and producted 3 version of translator from Engl= ish into Farsi/Arabic. Now i'm completing a software with modern methods.= As indicated during 12 years I solved many problems and finded some inno= vation ways for MT design(Only target). I wrote 2 books about"ambiguity (or indeterminacies & unsnarl) and Data structur= e for MT with high coefficient".
I am full success on MT design for English into Turkish,Azari,, Persian,= Arabic.
I will can product a MT (English into Turkish= /Azari/Arabic/Persia) with following terms:
I need to=
=B7 =
two lexicologist=B7
one Typist=B7
5-6= months for one target language.I'll can explain all my findings in conference or meetings.<= /P>
<= P align=3Dleft>My skills o= n Software and computational linguistics:-Algorithm designing with AI technics & Fuzzy logic
-Exper in Pragmatic & Semantic Patterns
-Data= Structure analysis
-Designing and producting managem= ent
-Programming with C++, Delphi & Pascal
M
y Phone: 0098 411 6580= 048M
y E-Mails :hosinagdam2002@msn= .com
DILM21@msn.com
Ac= cept my best wishes,
Sincerely,
M= ohamad Hosein Mehrabi.
<= br clear=3Dall>------_=_NextPart_001_01C20650.9ADCB0E0-- From m.shuttleworth@ic.ac.uk Mon May 27 13:46:00 2002 From: m.shuttleworth@ic.ac.uk (Shuttleworth, Mark) Date: Mon, 27 May 2002 13:46:00 +0100 Subject: [MT-List] job opportunity Message-ID:
-----Original Message-----
From: hannouna=20 [mailto:hannouna@uruklink.net]
Sent: Thursday, January 18, = 2001 3:20=20 PM
To: mt-list@eamt.org
Subject: [MT-List] MT = Evaluation=20 HistoryDear Friends of MT - List , =I am a Ph.D student carrying out a = research=20 regarding the "Evaluation of MT Systems that go from English = into Arabic=20 ". I am in need of certain information relevant to the : History of = MT=20 Evaluation , Theoretical background of MT Evaluation metrics , scales = , and=20 measurements and other issues . If you are interested to give me = information=20 on these aspects , please kindly send it to the following e-mail = address : hannouna@uruklink.net .You=20 can also contact me on this postal address :Miss.Yasmin H.HannounaAL-Karkh Post OfficeP.O.Box . 28322Baghdad - Iraq .I highly appreciate your interest = and cooperation=20 , with respect .Awaiting your kind reply . =Best regards .Sincerely ,Yasmin . =
Online registration is now available. Register before July 31 and save!
For more details, visit the AMTA Web site at :
http://www.amtaweb.org
*****************************************
Elliott Macklovitch
Laboratoire RALI, Dept. d'informatique et
de recherche operationnelle
Universite de Montreal, C.P. 6128, succursale
Centre-ville
Montreal, Quebec, Canada H3C 3J7
tel: (514) 343-7535
fax: (514) 343-2496
http://www-rali.iro.umontreal.ca
--------------01FFD4389CD7D2D5AA397383--
From deborahc@microsoft.com Thu Jun 27 23:15:20 2002
From: deborahc@microsoft.com (Deborah Coughlin)
Date: Thu, 27 Jun 2002 15:15:20 -0700
Subject: [MT-List] Posting for Customization Strategies for MT Workshop
Message-ID: <9795C27BA2AECB4BA2A99CFD6068237B01C5DDB5@RED-MSG-10.redmond.corp.microsoft.com>
This is a multi-part message in MIME format.
--------------InterScan_NT_MIME_Boundary
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C21E28.201F1E61"
------_=_NextPart_001_01C21E28.201F1E61
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Customization Strategies for MT
Chairs: Jessie Pinkham, Deborah Coughlin, Bill Dolan=20
=20
Machine translation systems must have customization capabilities in
order to claim success as a commercial product or a research prototype.
Minimally, these include the ability to add translations for new words
and phrases, but may also include more sophisticated functionality such
as adapting to new syntactic structures or writing styles, and may even
be the means of acquiring all the system's translation knowledge (e.g.,
in statistical systems), We propose to bring MT developers and
researchers together to discuss the customization capabilities of their
systems, with an emphasis on using common data for the discussion. =20
This workshop is intended to cover all types of customization
strategies, although preference in selection will be given to novel
strategies. We encourage participation from MT developers of commercial
systems, and researchers working on all types of MT systems (traditional
transfer, interlingua, example-based, or statistical).
To make the discussion more interesting, we request that participants
demonstrate the customization capabilities of their system using data
freely available to everyone, such as Hansard data for French-English or
other data available from ELRA or LDC. Microsoft has also agreed to
make technical manual data available for several language pairs (English
and any of these: French, Spanish, German, Japanese) for purposes of
research related to this workshop. (For access to this data please send
email to CustomWS@microsoft.com.)
We request that interested parties submit a two page abstract with the
following information:
* overview of your MT system
* description of customization capabilities
* comparison of this strategy to other known strategies
* data that will be used to test capability
* proposed evaluation to determine the effectiveness of the
customization
* estimate of time that would be required to customize for the
chosen domain, based on your sample run.
Dates:
Call for participation: June 8
Abstract submission deadline: July 7
Acceptance notification: July 22
Early registration for AMTA: July 31
Papers due:
September 6
AMTA workshop October 8
=20
Instructions for submission:
All submissions should be in English, and it is recommended that they be
submitted in one of the following three formats: PDF (preferred);
PostScript; Microsoft Word. All submissions will be received and
processed using the Conference Management Toolkit (CMT) located at
http://cmt.research.microsoft.com/CustomWS.
Authors should follow the instructions at the CMT web site to register,
enter information about themselves and their abstract, and upload a copy
of their abstract in one of the acceptable formats by the submission
deadline. Report any problems with the website to
CustomWS@microsoft.com.
For information on obtaining Microsoft Data to participate in the
workshop, please contact CustomWS@microsoft.com
=20
=20
=20
=20
------_=_NextPart_001_01C21E28.201F1E61
Content-Type: text/html;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Customization = Strategies for MT
Chairs: Jessie = Pinkham, Deborah Coughlin, Bill Dolan
Machine translation =
systems must have customization capabilities in order to claim success =
as a commercial product or a research prototype. Minimally, these =
include the ability to add translations for new words and phrases, but =
may also include more sophisticated functionality such as adapting to =
new syntactic structures or writing styles, and may even be the means of =
acquiring all the system’s translation knowledge (e.g., in =
statistical systems), We propose to bring MT developers and researchers =
together to discuss the customization capabilities of their systems, =
with an emphasis on using common data for the discussion. =
This workshop is = intended to cover all types of customization strategies, although = preference in selection will be given to novel strategies. We encourage = participation from MT developers of commercial systems, and researchers = working on all types of MT systems (traditional transfer, interlingua, = example-based, or statistical).
To make the = discussion more interesting, we request that participants demonstrate = the customization capabilities of their system using data freely = available to everyone, such as Hansard data for French-English or other = data available from ELRA or LDC. Microsoft has also agreed to make = technical manual data available for several language pairs (English and = any of these: French, Spanish, German, Japanese) for purposes of = research related to this workshop. (For access to this data please send = email to CustomWS@microsoft.com.)
We request that = interested parties submit a two page abstract with the following = information:
· = overview of your MT system
· = description of customization capabilities
· = comparison of this strategy to other known strategies
· data = that will be used to test capability
· = proposed evaluation to determine the effectiveness of the = customization
· = estimate of time that would be required to customize for the chosen = domain, based on your sample run.
Dates:
&nbs= p; Call for = participation:  = ; = June 8
&nbs= p; Abstract submission = deadline: &nbs= p; July 7
&nbs= p; Acceptance = notification: = &= nbsp; July 22
&nbs= p; Early registration for = AMTA: &n= bsp; July 31
&nbs= p; Papers = due: &nb= sp; &nbs= p;  = ; September 6
&nbs= p; AMTA = workshop  = ; = October 8
Instructions for = submission:
All submissions = should be in English, and it is recommended that they be submitted in = one of the following three formats: PDF (preferred); PostScript; = Microsoft Word. All submissions will be received and processed using the = Conference Management Toolkit (CMT) located at http://cmt.research.microsoft.com/CustomWS<= FONT COLOR=3D"#008000" SIZE=3D2 FACE=3D"Arial">.
Authors should follow = the instructions at the CMT web site to register, enter information = about themselves and their abstract, and upload a copy of their abstract = in one of the acceptable formats by the submission deadline. Report any = problems with the website to CustomWS@microsoft.com.
For information on = obtaining Microsoft Data to participate in the workshop, please contact = CustomWS@microsoft.com
------_=_NextPart_001_01C21E28.201F1E61-- --------------InterScan_NT_MIME_Boundary-- From Lori Levin
Bonjour,
Quelqu'un aurait-il par hasard =
(=E9tabli) un cahier de tests
(ou m=E9thode d'=E9valuation) pour =
tester et =E9valuer des logiciels
de traduction automatique et/ou des =
logiciels de r=E9sum=E9 ?
Merci d'avance pout toute information =
utile =E0 ce sujet.
English query :
Looking for evaluation methods for =
translation and/or for
summarization software.
Thank you in advance.
Malek Boualem
France T=E9l=E9com R&D
With the = pushing back=20 of the early registration date for the AMTA conference and fewer = abstracts being=20 submitted by the deadline than anticipated (and hoped for), the = organizers=20 of this workshop are extending the deadlines and hoping that = by inviting=20 specific groups and individuals working with various MT systems to=20 participate, they will take the opportunity to submit an abstract and = join in=20 what promises to be a very enlightening workshop.
We = invite your=20 participation--please review the Call below and consider submitting = an=20 abstract by July 24. Also, please = forward this=20 notice to anyone you feel may be interested in=20 participating.
Thanks,
Steve = Richardson,=20 program chair, AMTA 2002
P.S. Online registration for AMTA 2002 is now = available at www.amtaweb.org/AMTA2002.=20
**********************************************= **********************************************
Call for Abstracts for the AMTA workshop = on=20 Customization Strategies for MT
**New = abstract=20 submission deadline: July 24**
Machine translation systems must have = customization=20 capabilities in order to claim success as a commercial product or a = research=20 prototype. Minimally, these include the ability to add translations for = new=20 words and phrases, but may also include more sophisticated functionality = such as=20 adapting to new syntactic structures or writing styles, and may even be = the=20 means of acquiring all the system's translation knowledge (e.g., in = statistical=20 systems), We propose to bring MT developers and researchers together to = discuss=20 the customization capabilities of their systems, with an emphasis on = using=20 common data for the discussion.
This workshop is intended to cover all = types of=20 customization strategies, although preference in selection will be given = to=20 novel strategies. We encourage participation from MT developers of = commercial=20 systems, and researchers working on all types of MT systems (traditional = transfer, interlingua, example-based, or statistical).
To make the discussion more interesting,=20 we would prefer (although it is not required) that = participants=20 demonstrate the customization capabilities of their system using data = freely=20 available to everyone, such as Hansard data for French-English or other = data=20 available from ELRA or LDC. Microsoft has also agreed to make technical = manual=20 data available for several language pairs (English and any of these: = French,=20 Spanish, German, Japanese) for purposes of research related to this = workshop.=20 (For access to this data please send email to=20 CustomWS@microsoft.com.)
We request that interested parties submit = a two page=20 abstract with the following information:
- overview of your MT system
- description of customization=20 capabilities
- comparison of this strategy to other = known=20 strategies
- data that will be used to test=20 capability
- proposed evaluation to determine the = effectiveness=20 of the
customization
- estimate of time that would be required = to=20 customize for the
chosen domain, based on your sample = run.
-
Dates:
Call for participation: June 8
Abstract submission = deadline: July 24
Acceptance notification: Aug 7
Early registration for = AMTA: Aug 11
Papers=20 due: September 6
AMTA workshop October 8
Instructions for submission:
All submissions should be in English, and = it is=20 recommended that they be submitted in one of the following three = formats: PDF=20 (preferred); PostScript; Microsoft Word. All submissions will be = received and=20 processed using the Conference Management Toolkit (CMT) located at = http://cmt.research.microsoft.com/CustomWS= .
Authors should follow the = instructions=20 at the CMT web site to register, enter information about themselves and = their=20 abstract, and upload a copy of their abstract in one of the acceptable = formats=20 by the submission deadline. Report any problems with the website to=20 CustomWS@microsoft.com.
For information on = obtaining Microsoft=20 Data to participate in the workshop, please contact=20 CustomWS@microsoft.com
Organizers: Jessie = Pinkham, Deborah=20 Coughlin, Bill Dolan
|
Hello,
Is there any one conducting any research on=20
bilingual text/topic/product categorization?
What approaches have been used or proposed to be used?
Thanks & regards
|
|||
=09
|
Hi,
We are in the process of developing the first = Pashto-English-Pashto Machine Translation software. Does anyone know if the databases are = developed by anyone previously? Please respond to hatamin@ciyasoft.com
Regards
Naquib Hatami
EVP Business = Development
Ciyasoft Coporation, = Inc.
(703) 899-6060
Hi,
We are in the process of developing the first = Pashto-English-Pashto Machine Translation software. Does anyone know if the databases are = developed by anyone previously? Please respond to hatamin@ciyasoft.com
Regards
Naquib Hatami
EVP Business = Development
Ciyasoft Coporation, = Inc.
(703) 899-6060
Hi,
Ciyasoft has developed the first = English-Farsi-English Machine Translation. Can anyone help us in selling these = products?
Regards
Naquib Hatami
EVP Business = Development
Ciyasoft Coporation, = Inc.
(703) 899-6060
|
Is Latent Semantic Indexing is just for information=20
retrieval?
Can it be used for product categorization?
Let say i have an English query, how can I conduct prouct=20
categorization if the documents are in Malay=20
language? |
|||
=09
|
|
is there anyone doing any research on enriching the concepts o=
f=20
category?
eg. organizing mass information by categorizing documents acco=
rding=20
to their topics in bilingual corpus.
|
|||
=09
|
|
hello,
Can anybody has the solution, suggestion on=
how to=20
build the logical form which is done by Microsoft Research Centre t=
o=20
perform grammar checking. It has been implemented inside the Micros=
oft=20
Word 95 Spelling Checker.
Thank You.
|
|||
=09
|
|
Is anyone know how to categorize Malay and English=20
text/documents or any equaivalent bilingual languages.
if so please send me some references.
thank you |
|||
=09
|