Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on CK's sentences
Wall messages
Logs
Audio
Transcriptions
Translate CK's sentences
jpn
[#9472440] 日本では、八月八日はヒゲの日です。
eng
[#9673010] The eighth of August is Beard Day in Japan.
deu
[#9672988] In Japan ist der 8.8. der „Tag des Bartes“.
ber
[#9916106] Ass n 8 Ɣuct d Ass n Tamart deg Japun.
🍎 Stats - Sentences per Language - 2022-08-04 compared with one year ago
http://tatoeba.ueuo.com/stats-2...languages.html
🍎 Link Counts 2022-07-30
100 or more links = 169
50 or more links = 1,842
25 or more links = 10,474
20 or more links = 40,765
15 or more links = 91,988
10 or more links = 255,671
9 or more links = 326,627
8 or more links = 422,658
7 or more links = 557,945
6 or more links = 753,609
5 or more links = 1,050,043
1 or more links = 9,029,194
Note that some of these with high counts have multiple alternative translations into just a few languages
If you are interested, you can download the following file with the sentence IDs and the counts for all sentences that have links.
http://study.aitech.ac.jp/link_...2022-07-30.zip (22MB)
sentence_ID + TAB + count
🍎 Stats - 2022-07-30 - Native Speakers with Contributions
http://tatoeba.ueuo.com/stats-2022-07-30.html
Find sentences by native speakers of the languages you are studying.
🍎 Dashboard of Useful Links for Translators
I moved this to an https URL for those who were uncomfortable using this from the previous http URL.
Here are a few examples on how to use it.
German to English
https://domore.web.fc2.com/dash/?f=deu&t=eng
French to Kabyle
https://domore.web.fc2.com/dash/?f=fra&t=kab
Just add the 3-letter codes.
f = the "from" language; t = the "to" language
Optionally, you can also add &u=YOUR_USERNAME for links related to your username.
For example, English to Japanese, set for "KK_kaku_" would look like this.
https://domore.web.fc2.com/dash...jpn&u=KK_kaku_
../dash/?f=eng&t=jpn&u=KK_kaku_
I did a case-insensitive comparison of the English in their "raw" file with last week's exported data from the Tatoeba Corpus.
There were 42,014 case-insensitive matches out of the 2,379,684 unique English lines in the "raw" file,
Note that their "raw" file has a lot of non-sentences such as the following
! country bumpkin!
: no, no, no, no!
,ferris?
,a new species, lived.
, but it also
that no one will ever see?
zut alors! i'm okay!
zoo transfer!
only fitness.
There were also over 20,000 items that had no spaces -- 1 word, etc, like the following
curious,smart,annoying.
fourfourbravothreezero.
howaboutaftermycartoon?
okay,multitasking,boss.
surroundedbyaswarmof...
threeafoursixnineromeo.
wearebothstudentsofwar.
objection.argumentative.
threenewworldrecordsset.
Here are 10 consecutive "pairs" from the "test" file, for those of you who want to quickly get a taste of this without needing to download everything. I just jumped to a random line in the file and grabbed ten items, and then formatted the data so it's a little easier to compare.
i'll never forgive you for that!
ぜったいに ゆるさないんだから!
so, that's it then?
他に話したいことは?
burn a little?
痛みますか? はい
especially for the adults working at mcsweeney's
こんなこととは知らずに契約して
i've got wildling blood in my veins.
俺の血筋には野人の血が流れている
we don't have much time!
時間がない!
but i also am in the fueling business.
ただ私は燃料供給ビジネスもやっている
i could. stay over tonight, if to make you feel any better.
一晩一緒にいてもいいよ それがよければ
i think you're enjoying this.
私はこれを楽しんでいると思います。
and cut!
はい カット
I was just suggesting an alternative way to the way lbdx suggested.
=== begin quote
(7) sentences with a majority of "not OK" reviews are now searchable in the "Rated as 'not OK' 🔴" list at https://tatoeba.org/en/sentences_lists/show/170380
[snip] ... if we update the list of flagged sentences every 15 minute (instead of once a week)...
=== end quote
Instead of creating a list and updating a list, and then selecting that list in the advanced search to find such sentences, to have the search engine refer to such ratings, and give people various query options would be better, in my opinion. The search engines indexes are automatically updated quite often.
People using the search engine could ignore this option. There are likely other options that members ignore.
> we're depending on you to produce and update this list.
A better solution would be make it so that we could limit searches based on ratings, the same way we can limit searches to a certain tag or to a certain list. Perhaps it is too difficult to have this data indexed to use with the search engine.
The way things could be automated.
It perhaps would be possible to perform advanced searches like that following.
* must have at least 3 OK ratings, the number could be decided by the person making the query.
* must have at least one OK rating and no Not OK ratings.
* must have more OK ratings than Not OK ratngs.
* must have a Not OK rating. (This would be good for corpus maintainers.)
* must have at least 1 (at least 2, at least 3, etc.) OK ratings by native speakers.
* etc.
🍎 Stats - Members That Use the Rating System
http://tatoeba.ueuo.com/stats-r...022-07-16.html
There are 299 usernames on this list.
I suspect that if you also removed a number of the very long items that it would reduce the false positives.
Just start browsing, by showing the longest items first and I think you'll see what I mean.
https://tatoeba.org/en/sentence...roved=no&user=
I don't think most people think of these types of things when they want to study quotes.
[Deleted, since lbdx didn't like the post and seemed to think it wasn't an appropriate post for the Wall.]
I moved this to the the bottom of my profile for now, for those who may be interested
Go to https://tatoeba.org/en/user/profile/CK
I think not adding the "quote" tag to all of these would have been a better idea. I wonder if there is any way to undo this, assuming TRANG and other members think it would be a good idea to do so.
We used to be able to browse sentences tagged "quote" and be able to see things we'd more likely think of as quotes.
You can download all the English sentences that are currently tagged "quote" to see how many there are that may be inappropriately tagged.
currently-tagged-as-quote-english.zip
https://gofile.io/d/B9e2GT
I sorted the sentences from longest to shortest.
You could browse all of these using the tatoeba.org interface, but it would be a lot slower.
You'll notice that a lot of these are not what most people would traditionally consider quotes.
For example, look of the sentences owned by Hybrid that are now tagged "quote."
https://tatoeba.org/en/sentence...no&user=Hybrid
🍎 Stats - Audio File Contributors
http://tatoeba.ueuo.com/stats-audio.html
Get the "tatoeba.org/audio/of/USERNAME" links for over 200 voices.
I agree that you should go ahead and add such translations. Language learners often like to know various correct ways to say the same thing. If you, as a native speaker, add a variation that is what a certain language learner has been saying, he/she can then know for certain that what he/she has been saying is correct.
This is the same search as a template, which means you can insert a search query, or just click the "search" button to see any German sentence without any links to English.
This is set for random output, but you can easily change to another "sort," if you like.
https://tatoeba.org/en/sentence...roved=no&user=
A 1,000 sentence limit shouldn't be a problem, since as you add translations, you'll get a different set of 1,000 sentences.
Perhaps you would also like to turn on the "Owned by a self-identified native" option.
Note that when you add an English translation to a German sentence that is not directly-linked to the same pre-existing English sentence, a direct link is created, so doing so can benefit our project. This will happen in cases where the existing English sentence is not even a visible indirect translation, too.
This is because all the sentences with audio files were not in the previous week's exported data (sentences_with_audio.csv), but they are in the most-recent export.
This should only be a temporary problem for this week's "Tatoeba Contributors Chart."
https://tatoeba.j-langtools.com...art/chart2.php
Try this link.
https://tatoeba.org/sentences/s...s_user=maaster
This will show the sentences most-recently added that have been linked to one of your Hungarian sentences. I realize that this isn't exactly what you want. However, this is something that is currently possible.
If you like the results, copy and paste the link into your profile page and you can easily access it whenever you want.
For others who want to do something similar, here are the parts of the URL, which you can change to match your needs.
.../sentences/search?
&sort=created
&trans_filter=limit
&trans_link=direct
&to=hun
&trans_to=hun
&trans_user=maaster
[added 30 minutes later]
Just for fun, here are similar links already set up for some other members.
New sentences linked to ...
Pfirsichbaeumchen's German sentences
https://tatoeba.org/sentences/s...rsichbaeumchen
marafon's Russian sentences
https://tatoeba.org/sentences/s...s_user=marafon
lbdx's French sentences
https://tatoeba.org/sentences/s...rans_user=lbdx