GSoC/GCI Archive
Google Code-in 2012 Apertium

Extract Armenian adverb translations from Wiktionary

completed by: Denis Nikolov

mentors: Francis Tyers, Jonathan

Wiktionary has lots of translations for Armenian adverbs, for example consider the page:

http://en.wiktionary.org/wiki/%D5%A1%D5%A3%D5%A1%D5%B0%D5%A1%D5%A2%D5%A1%D6%80

 

Adverb

ագահաբար (agahabar)

  1. greedily, avidly
  2. eagerly

 

...

 

The idea of this task is to extract these translations into lttoolbox XML format as follows:

<e c=""><p><l>ագահաբար<s n="adv"/></l><r>greedily<s n="adv"/></r></p></e>
<e c=""><p><l>ագահաբար<s n="adv"/></l><r>avidly<s n="adv"/></r></p></e>
<e c=""><p><l>ագահաբար<s n="adv"/></l><r>eagerly<s n="adv"/></r></p></e>

 

You will need to look out for:

 

* Make sure that the translations are from the "Adverb" section -- not the "Adjective" section. Many Armenian adverbs can also be adjectives.

 

 

For further information about this task, join us on IRC: irc.freenode.net #apertium