ratcreature: RatCreature's toon avatar (Default)
RatCreature ([personal profile] ratcreature) wrote in [site community profile] dw_suggestions2010-10-18 05:49 am

spellcheck should recognize the language

Title:
spellcheck should recognize the language

Area:
entries, comments

Summary:
The site's spellcheck should be smarter. It should make a best guess of what language it likely is from the text entered, and apply the right kind of dictionary.

Description:
I know some browsers have a spellcheck of their own, but I always use the site's spellchecker. Only it is not as smart as a spellcheck should be. Usually I type English comments, and that's mostly fine (though the dictionary could use some upgrade too), but today I hit spellcheck as I always do automatically, but happened to have typed a German comment.

The output was a ridiculous, annoying wall of red. But computers these days are not so bad at recognizing common languages (at least if I let that google translator guess on a language bit rather than choosing, it seems to guess sensibly and the translations actually have gotten less ridiculous than a few years ago too), so why couldn't the DW spellchecker see that the comment clearly wasn't English, recognize the language, and pick the correct dictionary automatically?

I don't know how much effort it would be technically to get text based language recognition, and I wouldn't want the dictionary be chosen dumbly based on location settings or language settings in the browser, because then I might get the red wall for English comments and that would be worse. But a spellchecker would be much more convenient if it was smarter and did things it is supposed to do automatically.

I don't see any downsides, because if the language recognition should fail for some languages, you are not worse of than now, as I assume then it would just default to English as the site language.

Poll #4880 spellcheck should recognize the language
Open to: Registered Users, detailed results viewable to: All, participants: 54


This suggestion:

View Answers

Should be implemented as-is.
24 (44.4%)

Should be implemented with changes. (please comment)
4 (7.4%)

Shouldn't be implemented.
4 (7.4%)

(I have no opinion)
22 (40.7%)

(Other: please comment)
0 (0.0%)

travelingmonkey: Chimp w/ glasses (Default)

[personal profile] travelingmonkey 2010-10-27 11:06 am (UTC)(link)
Personally, I don't like the detect language thing, and it only "detects" by pulling key words (if you remove specific words from things, it'll change its guess, even though the rest is the same). I think the better option is to simply choose to tell it which dictionary you want to use. Firefox has this ability, hence why I never use dictionaries specific to particular sites.
axiom_of_stripe: DC Comics: Kory cries "X'Hal!" (Default)

[personal profile] axiom_of_stripe 2010-10-27 02:53 pm (UTC)(link)
I'd like this if there were a way to override it if it guessed wrong.
azurelunatic: Vivid pink Alaskan wild rose. (Default)

[personal profile] azurelunatic 2010-10-27 10:20 pm (UTC)(link)
Something like: Language autodetected: English (and that's a dropdown list, so you can change it, similar to Google translate)?
axiom_of_stripe: DC Comics: Kory cries "X'Hal!" (Default)

[personal profile] axiom_of_stripe 2010-10-28 12:33 am (UTC)(link)
Yes, exactly!
zeborah: Map of New Zealand with a zebra salient (approval)

[personal profile] zeborah 2010-10-28 09:32 am (UTC)(link)
I like this!

(My own experience is that it's annoying to be told that I've misspelled "colour" when no, actually, I've spelled it exactly as I've always been taught. So if it could manage to autodetect British vs USan English that'd be awesome. But having everything underlined would be even more annoying so seems the main priority.)
msilverstar: (corset)

[personal profile] msilverstar 2010-10-27 04:14 pm (UTC)(link)
Probably better all around if it's easy to set the language, maybe even a dialog box question the first time the user checks the spelling.
azurelunatic: Vivid pink Alaskan wild rose. (Default)

[personal profile] azurelunatic 2010-10-27 10:31 pm (UTC)(link)
My head, working sideways as usual, has gone off on a tangent about assorted loan-words and other instances of primarily speaking in one language but having words from others sprinkled throughout, and has hit on the (likely search-expensive) concept of looking twice at the misspelled words, seeing if they are joined by punctuation to something around them to make a complete word (like words with apostrophes, which do not get parsed correctly in this spellchecker), and then looking through other dictionaries to see if the word that is misspelled in the primary language of the thing is actually a word in another language, then flagging it (in a separate section, or with different markings, to make it clear that it wasn't flagged as a misspelling) so that in case it was a misspelling instead of a deliberate word from another language, that it would be not just ignored. The language would be identified. If it is a word in another language that looks like a typical misspelling of a word in the primary language, both spellcheck suggestions and whatever stuff that's associated with word-in-another-language should be offered.

For bonus shiny, there might be a link to the definition in the other language, and a list of suggested translations into the primary language. That might be useful in the case I've seen every now and then, where someone's talking in one of their non-primary languages, and the word for what they're thinking about in the language they're talking isn't coming to mind readily, but they know the word they're thinking of in a different language. With a tool like that (which is probably more of a language-focused-company type tool, a little beyond Dreamwidth's scope) the word in the language they're talking in that they knew they were looking for might be offered, leading to less headpounding when communicating.
azurelunatic: Vivid pink Alaskan wild rose. (Default)

[personal profile] azurelunatic 2010-10-28 09:58 am (UTC)(link)
Like [personal profile] zeborah, it really annoys me when a spellchecker tells me that a valid word in another language is misspelled when it is a word and I meant to use that word. On the other hand, in situations like you describe, you want to be told when you've used a word from the wrong language.

So what I was trying to describe up there was a sort of compromise between the two: a spellchecker that would tell you when you've used a word in a language that didn't match the main body of text you were writing in, but also wouldn't call it misspelled. It would merely label it as in another language, and let you determine whether it was a misspelling or deliberate.