Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on CK's sentences
Wall messages
Logs
Audio
Transcriptions
Translate CK's sentences
It looked like your comment was added to this comment.
https://tatoeba.org/en/wall/sho...#message_38552
So, I thought you were commenting about having sentences on that list show in a random order.
It's not possible to set lists to display sentences in a random order at this time. That's why I suggested using the advanced search showing results in a random order.
You can get that list in random order using the following advanced search.
https://tatoeba.org/sentences/s...59&sort=random
list=169859
sort=random
At this time, my hope is to get audio on all sentences on List 907, which is still growing.
Here are some stats that I put together yesterday, before I imported about another 250 audio files today.
Click the links if you want to translate these.
◼ Sentences that I've added AUDIO to (both mine and other members') that have not yet been translated into any language.
https://tatoeba.org/en/sentence...ated&list=4000
79,270 / 707,137 (11.21%) on April 1, 2022
◼ List 907, not yet with audio
https://tatoeba.org/en/sentence...io=no&list=907
155,426 / 867,078 (17.92%) on April 1, 2022 (About 82% complete)
I would also love to have a native Japanese-speaking member go through these and adopt the natural-sounding Japanese sentences.
◼ Japanese Orphan Sentences from the Tanaka Corpus, Linked directly to English with Audio - Random Selection
https://tatoeba.org/sentences/s...ct&sort=random
28,537 on April 1 2022
Note that I personally don't record audio for items that should be recorded by a female voice, a young child's voice, two or more voices, or a non-American voice.
** Dashboard for Translating English Sentences with Audio
http://a4esl.org/temporary/tato...e/searches.php
We now have over 700,000 English sentences with audio files.
If you want to directly help my projects (http://bit.ly/tatoebaprojects), then ...
1. Translate some of these sentences into your own native language.
2. If your native language is English, then translate sentences owned by native speakers of another language into natural-sounding English.
You don't need to use it if you don't want to. I just thought that you might find it useful, and the list was already created.
The following link will give you a random selection of English sentences with translations into 9 or more languages that are not yet translated into Portuguese.
https://tatoeba.org/eng/sentenc..._to=por&to=und
This is listed as "method 7" on the following page.
http://study.aitech.ac.jp/tatoe...hp?f=eng&t=por
Note that List 7055 was last updated in February of 2020 with data I received from sharptoothed.
https://tatoeba.org/en/sentence.../show/7055/und
English Sentences Linked to More Than 9 Languages (Updated Feb. 2020) - Over 40,000 Sentences - List 7055
** Parts of Speech Tagged Sentences
Today I stumbled on a website that has tagged parts of speech for many sentences from the Tatoeba Corpus
http://study.aitech.ac.jp/wordhelp/
This is a set of pages that I created, so you can easily jump to some of these pages.
/// I grabbed all the English queries, and then counted them.
/// This list is sorted by the counts (column 1)
Here are the top 20 English queries.
Looking at this data, I sort of wonder how useful these counts are. Perhaps some kind of bot did these searches.
65468 AARAMBH
64324 AARDWOLF
62180 AARDWOLVES
57822 A BATTERY
56590 A BIT RATHER THICK
56384 A BITTER CUP
56176 A BOIL
55274 A BUBBLE
54982 A CAPPELLA
53798 BACK PAGE
53486 A CRITICAL
52960 A DAUGHTER OF EVE
52872 Above
52820 Argument
52374 bootstrap__
52298 environment
51968 A FORTIORI
51132 A FUNCTION
50728 A GOOD DEAL
50281 sad
If you want to see them all, you can download the file here.
https://aitstudy.com/temp/queri...2022-03-08.zip (6.2MB)
Since these usernames are not in alphabetical order, I wonder if they are in the order of who has most moved your project forward this week, in another order, or just in a random order.
The average is 11 sentences per username.
Note that there is a GitHub issue related to this.
https://github.com/Tatoeba/tatoeba2/issues/1613
Just for fun, here is the Top 50 list of the days with the most Tatoeba Corpus contributors' listed birthdays.
January 1 (47 contributors)
May 1 (16 contributors)
March 18 (12 contributors)
September 20 (12 contributors)
April 8 (11 contributors)
August 17 (11 contributors)
December 24 (11 contributors)
February 7 (11 contributors)
February 18 (11 contributors)
January 19 (11 contributors)
January 31 (11 contributors)
July 19 (11 contributors)
June 3 (11 contributors)
March 3 (11 contributors)
August 16 (10 contributors)
December 30 (10 contributors)
February 4 (10 contributors)
January 9 (10 contributors)
July 10 (10 contributors)
March 15 (10 contributors)
September 22 (10 contributors)
September 29 (10 contributors)
April 20 (9 contributors)
December 1 (9 contributors)
February 1 (9 contributors)
February 2 (9 contributors)
February 21 (9 contributors)
January 22 (9 contributors)
June 18 (9 contributors)
March 10 (9 contributors)
May 3 (9 contributors)
November 23 (9 contributors)
October 10 (9 contributors)
September 1 (9 contributors)
September 16 (9 contributors)
September 17 (9 contributors)
September 21 (9 contributors)
April 17 (8 contributors)
April 26 (8 contributors)
August 1 (8 contributors)
August 5 (8 contributors)
August 15 (8 contributors)
June 5 (8 contributors)
June 7 (8 contributors)
June 9 (8 contributors)
March 31 (8 contributors)
May 11 (8 contributors)
May 12 (8 contributors)
May 15 (8 contributors)
May 25 (8 contributors)
This list is limited to those on my list of native speaker contributors and those who had their birthdays listed in their profiles.
** New Dutch Voice **
Rose_d has contributed over 1,000 audio files.
https://tatoeba.org/en/sentence...how/169964/und
Happy birthday.
This example finds English sentences beginning with "Tom" and ending with "Mary".
^Tom Mary$
Source: https://en.wiki.tatoeba.org/art...ow/text-search
(Click the "help" in the search bar.)
** Something to Consider for 2022 **
10 English Words Per Day (with links to search the Tatoeba Corpus)
* https://bit.ly/tatoebadaily
The 10 words are roughly in the order of frequency of use, based on the NGSL, NAWL and TSL. I divided the words into 10 groups by level and included one word from each of those levels every day for 366 days.
Perhaps this is something you might want to try in order to focus on translating sentences with different vocabulary every day. Out of the 10 words each day, maybe you will find something interesting to you.
** 5,000 Recently-added English Sentences with Audio **
http://tatoeba.ueuo.com/audio-no-links/1.html
* All of these did not have links to any other sentences in the 2021-12-11 exported data.
* Quickly see 1,000 sentences per page. These are static pages, not needing a connection to the database.
* You can listen to the audio files and choose which sentences you want to click to and translate.
* From 10543840 down to 10185355
** Milestone **
Sentences with audio (total 900,000)
https://tatoeba.org/en/audio/index
2021-12-02 08:55 UTC
Screenshot: https://imgur.com/a/0q8Rey9
Like I said, I don't know if it would be possible or even advisable. However this database was created for a purpose, whether it's for students or researchers, or for some other purpose, or for all purposes.
A database could include a way to put alternative translations in an order of usefulness, naturalness or on how well they match the meaning of the sentences they are attached to, or in some other useful order.
This way the database could perhaps be more effectively utilized by those who want to use it.
I don't know if it would be possible or even advisable, but perhaps the Tatoeba Project needs some method to indicate which of multiple alternative translations would be best for language learners to focus on, or learn first.