← @LucasWerkmeistr Twitter archive

Lucas Werkmeister

@LucasWerkmeistr

I built a tool to create a @wikidata lexeme with a set of forms, e. g. declensions of a noun: https://tools.wmflabs.org/lexeme-forms/

6/13/2018, 1:38:41 PM

Favs: 42

Retweets: 9

Lucas Werkmeister

@LucasWerkmeistr

It currently supports German and Latin nouns, but should be able to support other languages and word types (e. g. verb conjugations) – please get in touch!

6/13/2018, 1:38:41 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

If a lexeme with the same lemma and language already exists, the tool warns you about it (this essential feature was suggested by @nightrose – thanks!):

6/13/2018, 1:38:42 PM

Favs: 4

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

translating that warning into Latin was a very special kind of fun :D

6/13/2018, 1:38:43 PM

Favs: 3

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

One limitation is that this doesn’t support variations of forms, e. g. „dem Kind/Kinde“ in German – you’ll have to do that manually afterwards.

6/13/2018, 1:38:43 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

And just to be clear, this is a private project of mine, just like the Wikidata Lexeme Graph Builder two weeks ago. Any problems with it are purely my fault, not the fault of the Wikidata team!

6/13/2018, 1:38:43 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool now offers an “advanced” mode, where you can leave out some forms, e. g. for words like “milk” that don’t have a plural form. Thanks @EvoMRI for the suggestion!

6/14/2018, 12:46:33 AM

Favs: 1

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

(These words are called “pluralia tantum”, by the way, which incidentally is one of my favorite plural forms – the singular is “plurale tantum”. It’s even better in German, where it’s a single word: „Pluraletantum“ becomes „Pluraliatantum“.)

6/14/2018, 12:46:34 AM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

wait, no, fuck, I got that all mixed up

6/14/2018, 3:26:12 AM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

CORRECTION: “milk” is a singulare tantum, not a plurale tantum. I got so excited about the words I mixed up the meanings :D

6/14/2018, 11:41:43 AM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

New Wikidata Lexeme Forms tool feature: in advanced mode, you can also specify a lexeme ID to add some forms to that lexeme instead of creating a new one. (Use with caution: the tool doesn’t check for duplicate forms!)

6/14/2018, 5:35:51 PM

Favs: 2

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

New Wikidata Lexeme Forms tool feature: if JavaScript is enabled, duplicates are searched for as soon as you’ve entered the first form (and any time you edit it). Thanks @EvoMRI for the suggestion! https://twitter.com/EvoMRI/status/1006974984617590784

6/17/2018, 11:44:53 PM

Favs: 1

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool now supports French nouns! Thanks User:Djiboun :)

7/15/2018, 7:09:04 PM

Favs: 0

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

(And just to be clear, the tool should be able to support verbs, adjectives and other kinds of words as well, as long as someone provides the template… nouns just seem to be easiest for now.)

7/15/2018, 7:09:05 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool now supports Finnish nouns, thanks to User:Shinnin! Apparently it’s not complete yet (modeling of comitative case is unclear), but still, at 22 forms it’s already the longest template :)

7/17/2018, 10:31:52 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

By the way, would anyone like to be co-maintainer of the tool? So it’s at least not just me who can restart the service if it goes down ;)

7/17/2018, 10:44:44 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

New Wikidata Lexeme Forms feature: if there are multiple “variants” of a form, you can separate them with slashes (e. g. „dem Kind/Kinde“ in German) and multiple forms will be created.

7/21/2018, 10:33:16 PM

Favs: 0

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

(This is not to be confused with multiple *spelling variations* of the same form, which should be added as additional representations with different language codes to the same form, not as separate forms. “colour/color” would be an example of that.)

7/21/2018, 10:33:16 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool can now create lexemes for Estonian and Russian nouns, thanks to User:Reosarevok and @infovarius! See the index at https://tools.wmflabs.org/lexeme-forms/.

8/2/2018, 1:28:03 PM

Favs: 2

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

And since 'ast' sorts before 'de', this means that German is finally not the first language on the index page anymore, yay :D https://tools.wmflabs.org/lexeme-forms/

8/27/2018, 12:32:05 PM

Favs: 0

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

Over 1000 lexemes have been created using the Wikidata Lexeme Forms tool! That’s mind-blowing. https://quarry.wmflabs.org/query/28791

8/29/2018, 1:52:56 AM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool now supports creating Czech lexemes, thanks to User:Lexicolover!

noun
masculine
inanimate https://tools.wmflabs.org/lexeme-forms/template/czech-noun-masculine-animate/
animate https://tools.wmflabs.org/lexeme-forms/template/czech-noun-masculine-inanimate/
feminine https://tools.wmflabs.org/lexeme-forms/template/czech-noun-feminine/
neuter https://tools.wmflabs.org/lexeme-forms/template/czech-noun-neuter/
adverb https://tools.wmflabs.org/lexeme-forms/template/czech-adverb/

10/17/2018, 9:48:42 PM

Favs: 0

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

I’m thinking of adding proper edit summaries to the tool – the most important element to include is probably the template name (see screenshot for example). Could be plain text, could be a link ([[toolforge:lexeme-forms/template/{{ template_name }}]])… thoughts?

10/24/2018, 12:50:53 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool now adds a link to the template as the edit summary, so you can easily see which template was used to create a lexeme.

10/24/2018, 5:14:14 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool can now create lexemes for Ukrainian nouns, thanks to User:Tohaomg!

• masculine: https://tools.wmflabs.org/lexeme-forms/template/ukrainian-noun-masculine/
• feminine: https://tools.wmflabs.org/lexeme-forms/template/ukrainian-noun-feminine/
• neuter: https://tools.wmflabs.org/lexeme-forms/template/ukrainian-noun-neuter/
• plurale tantum: https://tools.wmflabs.org/lexeme-forms/template/ukrainian-noun-pluraletantum/

10/30/2018, 8:01:10 PM

Favs: 2

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

As of today, 10827 lexemes on @wikidata have been created using the Wikidata Lexeme Forms tool, out of 37886 lexemes total (29%). That’s amazing. https://quarry.wmflabs.org/query/28791

11/24/2018, 6:09:40 PM

Favs: 9

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

The Wikidata Lexeme Forms tool now supports Dutch nouns – thanks to User:MarcoSwart for the templates!
• neuter: https://tools.wmflabs.org/lexeme-forms/template/dutch-neuter-noun/
• masculine: https://tools.wmflabs.org/lexeme-forms/template/dutch-masculine-noun/
• feminine: https://tools.wmflabs.org/lexeme-forms/template/dutch-feminine-noun/
• f/m: https://tools.wmflabs.org/lexeme-forms/template/dutch-fem2masc-noun/

6/8/2019, 4:11:28 PM

Favs: 2

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms can now show you the number of forms and senses of a potential duplicate lexeme – this should hopefully help you decide whether you want to complete the existing lexeme (e. g. add more forms in advanced mode).

6/10/2019, 12:59:02 AM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Note that due to Unfortunate Technical Reasons™, for some lexemes these numbers aren’t yet available. Hopefully this will resolve itself eventually.

6/10/2019, 12:59:02 AM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

The Unfortunate Technical Reasons™ have been resolved (https://phabricator.wikimedia.org/T225510), number of forms and senses should now be available for all lexemes \o/

6/14/2019, 7:18:03 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

This is also another record-breaking template – 73 forms, well beyond the 56 forms of Czech adjectives a bit earlier in the thread.

Short templates are also great, by the way! Feel free to contribute more of those! :D

6/24/2019, 11:17:39 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

(As a side effect of this work, using advanced mode to edit existing Lexemes will now add missing Statements – previously, the example edit above would not have added the missing grammatical gender to the Lexeme.)

7/7/2019, 4:10:48 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

(other adjectives are, to quote them, “horrible” and will have to wait for now ^^)

8/11/2019, 1:37:05 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Thanks a lot to User:JTs for inspiration and discussion at #Wikimania2019, and her and @fagerving for beta testing!

8/23/2019, 1:55:11 AM

Favs: 4

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

@fagerving Wikidata Lexeme Forms now supports Czech adjectives more fully – instead of 56 positive forms, it’s now 168 forms for positive, comparative, and superlative, thanks to User:Adrijaned! https://tools.wmflabs.org/lexeme-forms/template/czech-adjective/

10/17/2019, 12:31:09 AM

Favs: 4

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

@fagerving Wikidata Lexeme Forms’ bulk mode now supports adding forms to existing lexemes. Can be used with lexemes that don’t have any forms yet –

L2713|Wissenschaft|…|Wissenschaften|…

– or to add only missing ones:

L37530||conjugations

(gap between |s = existing singular form)

10/28/2019, 12:18:44 AM

Favs: 3

Retweets: 3

Lucas Werkmeister

@LucasWerkmeistr

@fagerving (note that this won’t fix the existing forms, such as adding missing grammatical features to them – we still need tools for such tasks, I think)

10/28/2019, 12:22:33 AM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

New Wikidata Lexeme Forms feature: *edit* a lexeme’s forms. Append /edit/<lexeme ID> to a template URL to access. Loads forms of the lexeme and sorts them into the input fields (where possible; not all might match), updates the lexeme on submit (edit/add/remove forms as needed).

4/14/2020, 3:01:15 AM

Favs: 6

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

I consider this experimental still and was going to keep it more on the down-low for now, but @vrandezo already retweeted it, so here we go :D let me know how it works for you!

4/14/2020, 3:01:16 AM

Favs: 2

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms now supports Bengali nouns – thanks to User:Bodhisattwa, User:Tanay barisha and @mahir256 for the templates and translations! (links in next tweet)

6/7/2020, 3:39:26 PM

Favs: 5

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms can now group forms înto sections, and (on wide screens) distribute sections across two columns! This should hopefully make the tool easier to use, especially for long templates.

6/13/2020, 1:59:15 PM

Favs: 8

Retweets: 4

Lucas Werkmeister

@LucasWerkmeistr

For now I’ve only defined sections for the German templates, I’ll look over other languages later (going home for lunch now, I think). If you want, you can edit the on-wiki pages to add ---- where it makes sense (as in https://www.wikidata.org/wiki/Wikidata:Wikidata_Lexeme_Forms/German), and I’ll use those divisions :)

6/13/2020, 2:00:24 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

New Wikidata Lexeme Forms feature: the warning about duplicate lexemes now links to edit mode, where you can fix any forms that the existing lexeme might be missing :)

6/14/2020, 12:29:46 PM

Favs: 8

Retweets: 5

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme forms now supports Norwegian Bokmål lexemes, thanks to @danmichaelo and @jhsoby! (links in next tweet)

7/6/2020, 1:00:35 AM

Favs: 5

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

• noun, masculine: https://lexeme-forms.toolforge.org/template/bokm%C3%A5l-noun-masculine/
• noun, feminine: https://lexeme-forms.toolforge.org/template/bokm%C3%A5l-noun-feminine/
• noun, neuter: https://lexeme-forms.toolforge.org/template/bokm%C3%A5l-noun-neuter/
• adjective: https://lexeme-forms.toolforge.org/template/bokm%C3%A5l-adjective/
• verb: https://lexeme-forms.toolforge.org/template/bokm%C3%A5l-verb/

7/6/2020, 1:00:36 AM

Favs: 2

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms now supports Bengali adjectives and verbs, thanks to @mahir256, User:Bodhisattwa and User:Tanay barisha!

• adjective: https://lexeme-forms.toolforge.org/template/bengali-adjective-tatsama-property/ and https://lexeme-forms.toolforge.org/template/bengali-adjective-others/
• verb: https://lexeme-forms.toolforge.org/template/bengali-verb/ and https://lexeme-forms.toolforge.org/template/bengali-verb-ano/

8/5/2020, 12:45:08 AM

Favs: 1

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

My present for Wikidata’s eighth birthday: the Wikidata Lexeme Forms tool now supports editing form representations in other language codes! Add ?language_code=___ to the edit mode URL, e. g. for en-gb/en-ca or de-at/de-ch/de-1901.

10/29/2020, 2:36:34 AM

Favs: 11

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms now lists languages on the index page using their autonyms (language name in that language)! Seems more user-friendly than the language code :)

12/12/2020, 8:01:48 PM

Favs: 3

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

Thanks to User:Nikki, there’s now a gadget that adds links to the Wikidata Lexeme Forms tool on Lexeme pages! Go try it out :)

2/18/2021, 11:27:24 PM

Favs: 3

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

You can now link to a group of templates / a language code on the Wikidata Lexeme Forms index page, suggested by @jhsoby – for example: https://lexeme-forms.toolforge.org/#eu

3/23/2021, 8:43:17 PM

Favs: 3

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms can now mark forms as optional, which means they’re not required even in non-advanced mode. The first template to use this is Norwegian Bokmål adjectives (optional comparative/superlative), but feel free to suggest others as well.

3/25/2021, 8:30:40 PM

Favs: 0

Retweets: 2

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms now supports additional Breton templates, thanks to @belett:
• nouns with mutation: “ktp” https://lexeme-forms.toolforge.org/template/breton-noun-with-mutation-ktp/, “gdb” https://lexeme-forms.toolforge.org/template/breton-noun-with-mutation-gdb/, “m” https://lexeme-forms.toolforge.org/template/breton-noun-with-mutation-m/
• adverbs: https://lexeme-forms.toolforge.org/template/breton-adverb/

4/3/2021, 11:39:32 PM

Favs: 5

Retweets: 3

Lucas Werkmeister

@LucasWerkmeistr

(The second alert is maybe a bit confusing, and also the contrast on the disabled inputs isn’t great, so I may still add a dedicated “view” mode.)

4/14/2021, 11:22:39 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

I’ve renamed the Dutch templates in Wikidata Lexeme Forms from dutch-{gender}-noun to dutch-noun-{gender}, to match the order used in other languages; the old names (example: https://lexeme-forms.toolforge.org/template/dutch-feminine-noun/) redirect to the new ones.

4/26/2021, 9:26:07 PM

Favs: 2

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

I’ve also renamed swedish-absolute-adjective to swedish-adjective-absolute, similar to two tweets above (the old name redirects)

5/13/2021, 4:02:56 PM

Favs: 1

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

I’m thinking about extracting the i18n functionality from the Wikidata Lexeme Forms tool into a library, so that other tools can use it as well; feel free to provide feedback here: https://phabricator.wikimedia.org/T283376

5/21/2021, 6:46:55 PM

Favs: 2

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

New “expert” Wikidata Lexeme Forms feature: You can now add ?target_hash=something to the tool’s URL and #something will be added to the lexeme URL when redirecting you; can be useful with ?form_representation to pass extra data “through” the tool, to be processed by user scripts

6/6/2021, 4:51:09 PM

Favs: 2

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Another power user feature: when bulk mode refuses to create a lexeme because it looks like a duplicate, there’s now a link to create it anyway. (Following it doesn’t immediately create the lexeme, you still need to check the “not a duplicate” box and then submit the form.)

6/7/2021, 11:37:07 PM

Favs: 1

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

Templates in Wikidata Lexeme Forms can now list some grammatical features of a form as optional, which means forms can still be matched in edit mode even if they’re missing those features (and the features will then be added on save).

6/15/2021, 9:37:14 PM

Favs: 0

Retweets: 1

Lucas Werkmeister

@LucasWerkmeistr

For now, I’ve mainly added this to singularia and pluralia tantum templates, e.g. if a form is marked as “nominative”, edit mode can still match it to “nominative singular”, and add the “singular” grammatical feature on save. Feel free to suggest others.

6/15/2021, 9:37:14 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms users will have to log into the tool again, since I’ve had to replace the OAuth consumer (see https://phabricator.wikimedia.org/T286414 for details).

7/11/2021, 7:07:14 PM

Favs: 0

Retweets: 0

Lucas Werkmeister

@LucasWerkmeistr

Wikidata Lexeme Forms has a bunch of Mandarin templates now (https://lexeme-forms.toolforge.org/#zh) – all with just one form, but perhaps still useful

4/27/2022, 10:18:21 PM

Favs: 0

Retweets: 1