menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
CK {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on CK's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate CK's sentences

CK's messages on the Wall (total 1,238)

CK CK December 20, 2022, edited December 20, 2022 December 20, 2022 at 8:38:55 AM UTC, edited December 20, 2022 at 8:49:34 AM UTC link Permalink

🍎 Here are files with only 4 fields, omitting the dates.

sentence_ID + tab + language_code + tab + text + tab + sentence_owner


πŸ₯ All the exported sentences
http://study.aitech.ac.jp/4flds...2022-12-17.zip
228 MB

πŸ₯ Just the English sentences
http://study.aitech.ac.jp/4flds...2022-12-17.zip
27 MB

πŸ₯ Just the 891,125 sentences on List 907
http://study.aitech.ac.jp/4flds...2022-12-17.zip
12 MB


Perhaps one of these will help you.

CK CK December 15, 2022 December 15, 2022 at 10:26:22 PM UTC link Permalink

rom = https://en.wikipedia.org/wiki/Romani_language

CK CK December 10, 2022, edited December 11, 2022 December 10, 2022 at 11:55:49 PM UTC, edited December 11, 2022 at 12:14:04 AM UTC link Permalink

You can more easily just use the advanced search, chose your desired list in the "Belongs to list:" option, set the sort order to random and then click the "search" button.

Here is a template set up with the "7&8-word English Sentences with Audio" list chosen. Just change that to your desired list.

https://tatoeba.org/en/sentence...roved=no&user=

CK CK November 28, 2022, edited February 19, 2023 November 28, 2022 at 1:37:31 AM UTC, edited February 19, 2023 at 11:18:37 PM UTC link Permalink

[deleted by CK -- not needed anymore]

CK CK November 26, 2022, edited November 26, 2022 November 26, 2022 at 8:56:47 AM UTC, edited November 26, 2022 at 10:12:18 AM UTC link Permalink

🍎 For Those Studying Japanese

http://study.aitech.ac.jp/recent-jpn/

This should use the text-to-speech (TTS) Japanese voice you have setup for your computer, if you have done so. When you click the "next" button, you will hear the sentence being spoken as the next sentence is being loaded.

This does not work on an iPhone, at least not on mine.
It worked in the 9 browsers I tested it with on a Macintosh.
I haven't yet checked it on Windows or Linux.

These were the 10,000 most-recently added Japanese sentences in the 2022-11-26 exported data that were owned by 3 of our prolific native-speaker contributors,bunbuku, small_snow, and KK_kaku_.

This starts with the most-recent sentence in the 2022-11-26 exported data and goes in reverse numerical order.

CK CK November 19, 2022 November 19, 2022 at 8:24:15 AM UTC link Permalink

I suspect that it still doesn't actually work.

https://github.com/Tatoeba/tato...ent-1133129204

CK CK November 12, 2022, edited November 12, 2022 November 12, 2022 at 12:57:43 AM UTC, edited November 12, 2022 at 12:58:43 AM UTC link Permalink

> ... substituting the names Tom and Mary with random common male and female names ...

You can also try some of these demos that I made in 2017 to see names being randomly substituted.

http://aitstudy.com/sub/

Each time you load the pages, you'll get another random selection.

CK CK November 3, 2022, edited November 3, 2022 November 3, 2022 at 10:47:10 PM UTC, edited November 3, 2022 at 10:50:18 PM UTC link Permalink

Perhaps, ...

dated
* the information in the sentence is dated
* the usage is dated
* the kinds of sentences a language textbook editor might recommend not be included. (Textbook editors often will even suggest not including things that are likely to soon become dated, since they want the help increase the shelf life of the materials they are editing.)

old-fashioned
* similar to "dated," I think.
* words, phrases and sentence patterns that your grandparents might say, but that you wouldn't say.
* things that we don't hear often today, if at all, that you might hear in movies or read in books from or about previous generations. These would be useful for students of a language to understand passively, but should likely not use themselves.

archaic
* things that are very old and maybe even difficult for currently-living native speakers to understand.
* things, even though understood by native speakers, would sound very strange if they said them.

These are idea just off the top my head. The Tatoeba Project has no clear guidelines on usage of these tags.

CK CK November 2, 2022, edited November 2, 2022 November 2, 2022 at 8:20:22 PM UTC, edited November 2, 2022 at 8:21:12 PM UTC link Permalink

Here are other related tags.

https://tatoeba.org/en/tags/sho...s_with_tag/412
old-fashioned

And, you can find tags with the word "dated."
https://tatoeba.org/en/tags/view_all/dated

CK CK October 23, 2022, edited October 24, 2022 October 23, 2022 at 11:59:12 PM UTC, edited October 24, 2022 at 12:02:33 AM UTC link Permalink

🍎 http://tatoeba.ueuo.com/roulette/

Get a random selection of 10 words chosen from a list of 6,130 English Words

Add your native language's code to the end of the URL like one of those below to also get links to searches limited to sentences that are not yet translated into your language.

http://tatoeba.ueuo.com/roulette/?t=ber
http://tatoeba.ueuo.com/roulette/?t=deu
http://tatoeba.ueuo.com/roulette/?t=epo
http://tatoeba.ueuo.com/roulette/?t=fin
http://tatoeba.ueuo.com/roulette/?t=fra
http://tatoeba.ueuo.com/roulette/?t=hun
http://tatoeba.ueuo.com/roulette/?t=ita
http://tatoeba.ueuo.com/roulette/?t=jpn
http://tatoeba.ueuo.com/roulette/?t=kab
http://tatoeba.ueuo.com/roulette/?t=nld
http://tatoeba.ueuo.com/roulette/?t=pol
http://tatoeba.ueuo.com/roulette/?t=por
http://tatoeba.ueuo.com/roulette/?t=rus
http://tatoeba.ueuo.com/roulette/?t=spa
http://tatoeba.ueuo.com/roulette/?t=tur
http://tatoeba.ueuo.com/roulette/?t=ukr
http://tatoeba.ueuo.com/roulette/?t=heb

CK CK October 23, 2022, edited October 23, 2022 October 23, 2022 at 9:51:00 PM UTC, edited October 23, 2022 at 11:08:00 PM UTC link Permalink

To be fair to Synthesia, they didn't advertise that switching languages mid-video in their demo version was a possibility. I just did that as a test, wondering if they were set up to do that.

Interestingly, the iPhone text-to-speech (TTS) will switch languages between English and Japanese without manually choosing to do so, though sometimes if it's just a word or short phrase, the Japanese gets read as Chinese instead of Japanese. On a Mac, the TTS voice needs to be manually changed, depending on the text to be read. I think it's that way on Windows, too, but I haven't recently checked.

CK CK October 23, 2022 October 23, 2022 at 5:39:55 AM UTC link Permalink

🍎 Just for Fun:

AI Voices & Talking Heads

https://domore.web.fc2.com/synthesia.html

Tatoeba Corpus Sentences in Videos Generated by Synthesia

3 Videos: Japanese, English, and Japanese-English

CK CK October 18, 2022 October 18, 2022 at 10:11:17 PM UTC link Permalink

There is a Github issue requesting this.

https://github.com/Tatoeba/tatoeba2/issues/2889
Allow filtering out / excluding of search results based on tag

CK CK October 15, 2022 October 15, 2022 at 6:38:10 AM UTC link Permalink

Anything that is under a license that has a "share alike" condition is NOT compatible with CC-BY. CC-BY-SA is not compatible with CC-BY. That means you can't copy text from Wikipedia into Tatoeba

Source: https://blog.tatoeba.org/2011/0...d-content.html

CK CK October 12, 2022, edited October 12, 2022 October 12, 2022 at 9:24:38 PM UTC, edited October 12, 2022 at 11:55:33 PM UTC link Permalink

Though not limited to dialogs, we have a number of sentences that have been tagged that you might find useful.

airport
shopping
taxi
bank
post office
restaurant
bar
hotel
travel
hospital
telephone
at the table
etc.

Using this link, you can search our tags.

https://tatoeba.org/en/tags/view_all


You can also use this old demo page that was used to demonstrate how it might be useful to be able to sort tags by category.

* Go to the following page.
* Click the "Category" header.
* Scroll down to to find the category "situation".

All Tags on One Page
http://tatoeba.ueuo.com/stats-190323-tags.html


This page was created for this Github issue: https://github.com/Tatoeba/tatoeba2/issues/333


Note that you can download all the sentences and all the tags, and if you know how, you can create files of sentences with each tag that you are interested in. (See the "Downloads" link at the bottom of the page.)

CK CK September 22, 2022 September 22, 2022 at 10:50:30 PM UTC link Permalink

All the Esperanto sentences in that file are also on the following list, which should keep growing since PaulP has been very actively rating Esperanto sentences.

https://tatoeba.org/en/sentence.../show/3136/und
OK-rated Esperanto Linked to OK-rated English

CK CK September 22, 2022, edited September 22, 2022 September 22, 2022 at 6:53:05 AM UTC, edited September 22, 2022 at 6:58:34 AM UTC link Permalink

🍎 89,983 Esperanto-English pairs for Anki

http://study.aitech.ac.jp/anki/

epo-eng.zip

Esperanto rated OK by PaulP linked to English rated OK by CK

3 fields
Esperanto
English
Attribution

This is similar to what I have at http://www.manythings.org/anki for other languages.

CK CK September 20, 2022, edited September 20, 2022 September 20, 2022 at 11:35:24 PM UTC, edited September 20, 2022 at 11:39:08 PM UTC link Permalink

In fairness to Clozemaster or others who use data from us for commercial uses, our license, CC-BY, allows this as long as attribution is given.

Problems can arise for websites or apps that use our data without carefully harvesting sentences to use since the Tatoeba Corpus contains a number of errors. Using the complete database is not a good idea.

We have a wiki page with suggestions for those who want to use our data in their projects.

https://en.wiki.tatoeba.org/art...tatoeba-corpus

CK CK August 20, 2022 August 20, 2022 at 9:13:43 PM UTC link Permalink

Do you mean that if you look at the following, that you only see "one minute ago" for all comments?

https://tatoeba.org/en/sentence_comments/index

This is how I see it.

https://ibb.co/7Kpc0Px

CK CK August 8, 2022 August 8, 2022 at 5:13:49 PM UTC link Permalink

Of this series of sentences you mention, these are the only directly-linked translations. The others are translations of translations.

ron
Nu pot să vorbesc.
https://tatoeba.org/en/sentences/show/5676528

deu
Ich kann nicht sprechen.
eng
I can't talk.
eng
I can't speak.
epo
Mi ne povas paroli.
fra
Je ne peux pas parler.
nds
Ik kann nich snacken.
nld
Ik kan niet spreken.
por
Não posso falar.
spa
No puedo hablar.
tur
Konuşamam.

As long as this Romanian sentence can be correctly translated by all the directly-linked sentences, then there is no problem.