Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on CK's sentences
Wall messages
Logs
Audio
Transcriptions
Translate CK's sentences
π Here are files with only 4 fields, omitting the dates.
sentence_ID + tab + language_code + tab + text + tab + sentence_owner
π₯ All the exported sentences
http://study.aitech.ac.jp/4flds...2022-12-17.zip
228 MB
π₯ Just the English sentences
http://study.aitech.ac.jp/4flds...2022-12-17.zip
27 MB
π₯ Just the 891,125 sentences on List 907
http://study.aitech.ac.jp/4flds...2022-12-17.zip
12 MB
Perhaps one of these will help you.
You can more easily just use the advanced search, chose your desired list in the "Belongs to list:" option, set the sort order to random and then click the "search" button.
Here is a template set up with the "7&8-word English Sentences with Audio" list chosen. Just change that to your desired list.
https://tatoeba.org/en/sentence...roved=no&user=
[deleted by CK -- not needed anymore]
π For Those Studying Japanese
http://study.aitech.ac.jp/recent-jpn/
This should use the text-to-speech (TTS) Japanese voice you have setup for your computer, if you have done so. When you click the "next" button, you will hear the sentence being spoken as the next sentence is being loaded.
This does not work on an iPhone, at least not on mine.
It worked in the 9 browsers I tested it with on a Macintosh.
I haven't yet checked it on Windows or Linux.
These were the 10,000 most-recently added Japanese sentences in the 2022-11-26 exported data that were owned by 3 of our prolific native-speaker contributors,bunbuku, small_snow, and KK_kaku_.
This starts with the most-recent sentence in the 2022-11-26 exported data and goes in reverse numerical order.
I suspect that it still doesn't actually work.
https://github.com/Tatoeba/tato...ent-1133129204
> ... substituting the names Tom and Mary with random common male and female names ...
You can also try some of these demos that I made in 2017 to see names being randomly substituted.
http://aitstudy.com/sub/
Each time you load the pages, you'll get another random selection.
Perhaps, ...
dated
* the information in the sentence is dated
* the usage is dated
* the kinds of sentences a language textbook editor might recommend not be included. (Textbook editors often will even suggest not including things that are likely to soon become dated, since they want the help increase the shelf life of the materials they are editing.)
old-fashioned
* similar to "dated," I think.
* words, phrases and sentence patterns that your grandparents might say, but that you wouldn't say.
* things that we don't hear often today, if at all, that you might hear in movies or read in books from or about previous generations. These would be useful for students of a language to understand passively, but should likely not use themselves.
archaic
* things that are very old and maybe even difficult for currently-living native speakers to understand.
* things, even though understood by native speakers, would sound very strange if they said them.
These are idea just off the top my head. The Tatoeba Project has no clear guidelines on usage of these tags.
Here are other related tags.
https://tatoeba.org/en/tags/sho...s_with_tag/412
old-fashioned
And, you can find tags with the word "dated."
https://tatoeba.org/en/tags/view_all/dated
π http://tatoeba.ueuo.com/roulette/
Get a random selection of 10 words chosen from a list of 6,130 English Words
Add your native language's code to the end of the URL like one of those below to also get links to searches limited to sentences that are not yet translated into your language.
http://tatoeba.ueuo.com/roulette/?t=ber
http://tatoeba.ueuo.com/roulette/?t=deu
http://tatoeba.ueuo.com/roulette/?t=epo
http://tatoeba.ueuo.com/roulette/?t=fin
http://tatoeba.ueuo.com/roulette/?t=fra
http://tatoeba.ueuo.com/roulette/?t=hun
http://tatoeba.ueuo.com/roulette/?t=ita
http://tatoeba.ueuo.com/roulette/?t=jpn
http://tatoeba.ueuo.com/roulette/?t=kab
http://tatoeba.ueuo.com/roulette/?t=nld
http://tatoeba.ueuo.com/roulette/?t=pol
http://tatoeba.ueuo.com/roulette/?t=por
http://tatoeba.ueuo.com/roulette/?t=rus
http://tatoeba.ueuo.com/roulette/?t=spa
http://tatoeba.ueuo.com/roulette/?t=tur
http://tatoeba.ueuo.com/roulette/?t=ukr
http://tatoeba.ueuo.com/roulette/?t=heb
To be fair to Synthesia, they didn't advertise that switching languages mid-video in their demo version was a possibility. I just did that as a test, wondering if they were set up to do that.
Interestingly, the iPhone text-to-speech (TTS) will switch languages between English and Japanese without manually choosing to do so, though sometimes if it's just a word or short phrase, the Japanese gets read as Chinese instead of Japanese. On a Mac, the TTS voice needs to be manually changed, depending on the text to be read. I think it's that way on Windows, too, but I haven't recently checked.
π Just for Fun:
AI Voices & Talking Heads
https://domore.web.fc2.com/synthesia.html
Tatoeba Corpus Sentences in Videos Generated by Synthesia
3 Videos: Japanese, English, and Japanese-English
There is a Github issue requesting this.
https://github.com/Tatoeba/tatoeba2/issues/2889
Allow filtering out / excluding of search results based on tag
Anything that is under a license that has a "share alike" condition is NOT compatible with CC-BY. CC-BY-SA is not compatible with CC-BY. That means you can't copy text from Wikipedia into Tatoeba
Source: https://blog.tatoeba.org/2011/0...d-content.html
Though not limited to dialogs, we have a number of sentences that have been tagged that you might find useful.
airport
shopping
taxi
bank
post office
restaurant
bar
hotel
travel
hospital
telephone
at the table
etc.
Using this link, you can search our tags.
https://tatoeba.org/en/tags/view_all
You can also use this old demo page that was used to demonstrate how it might be useful to be able to sort tags by category.
* Go to the following page.
* Click the "Category" header.
* Scroll down to to find the category "situation".
All Tags on One Page
http://tatoeba.ueuo.com/stats-190323-tags.html
This page was created for this Github issue: https://github.com/Tatoeba/tatoeba2/issues/333
Note that you can download all the sentences and all the tags, and if you know how, you can create files of sentences with each tag that you are interested in. (See the "Downloads" link at the bottom of the page.)
All the Esperanto sentences in that file are also on the following list, which should keep growing since PaulP has been very actively rating Esperanto sentences.
https://tatoeba.org/en/sentence.../show/3136/und
OK-rated Esperanto Linked to OK-rated English
π 89,983 Esperanto-English pairs for Anki
http://study.aitech.ac.jp/anki/
epo-eng.zip
Esperanto rated OK by PaulP linked to English rated OK by CK
3 fields
Esperanto
English
Attribution
This is similar to what I have at http://www.manythings.org/anki for other languages.
In fairness to Clozemaster or others who use data from us for commercial uses, our license, CC-BY, allows this as long as attribution is given.
Problems can arise for websites or apps that use our data without carefully harvesting sentences to use since the Tatoeba Corpus contains a number of errors. Using the complete database is not a good idea.
We have a wiki page with suggestions for those who want to use our data in their projects.
https://en.wiki.tatoeba.org/art...tatoeba-corpus
Do you mean that if you look at the following, that you only see "one minute ago" for all comments?
https://tatoeba.org/en/sentence_comments/index
This is how I see it.
https://ibb.co/7Kpc0Px
Of this series of sentences you mention, these are the only directly-linked translations. The others are translations of translations.
ron
Nu pot sΔ vorbesc.
https://tatoeba.org/en/sentences/show/5676528
deu
Ich kann nicht sprechen.
eng
I can't talk.
eng
I can't speak.
epo
Mi ne povas paroli.
fra
Je ne peux pas parler.
nds
Ik kann nich snacken.
nld
Ik kan niet spreken.
por
Não posso falar.
spa
No puedo hablar.
tur
KonuΕamam.
As long as this Romanian sentence can be correctly translated by all the directly-linked sentences, then there is no problem.