menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
CK {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on CK's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate CK's sentences

CK's messages on the Wall (total 1,238)

CK CK April 25, 2022 April 25, 2022 at 11:47:03 PM UTC link Permalink

It looked like your comment was added to this comment.

https://tatoeba.org/en/wall/sho...#message_38552

So, I thought you were commenting about having sentences on that list show in a random order.

CK CK April 25, 2022 April 25, 2022 at 11:26:05 PM UTC link Permalink

It's not possible to set lists to display sentences in a random order at this time. That's why I suggested using the advanced search showing results in a random order.

CK CK April 25, 2022 April 25, 2022 at 10:50:15 PM UTC link Permalink

You can get that list in random order using the following advanced search.

https://tatoeba.org/sentences/s...59&sort=random

list=169859
sort=random

CK CK April 2, 2022 April 2, 2022 at 1:07:03 AM UTC link Permalink

At this time, my hope is to get audio on all sentences on List 907, which is still growing.

Here are some stats that I put together yesterday, before I imported about another 250 audio files today.

Click the links if you want to translate these.

◼ Sentences that I've added AUDIO to (both mine and other members') that have not yet been translated into any language.

https://tatoeba.org/en/sentence...ated&list=4000

79,270 / 707,137 (11.21%) on April 1, 2022

◼ List 907, not yet with audio

https://tatoeba.org/en/sentence...io=no&list=907

155,426 / 867,078 (17.92%) on April 1, 2022 (About 82% complete)



I would also love to have a native Japanese-speaking member go through these and adopt the natural-sounding Japanese sentences.

◼ Japanese Orphan Sentences from the Tanaka Corpus, Linked directly to English with Audio - Random Selection

https://tatoeba.org/sentences/s...ct&sort=random

28,537 on April 1 2022


Note that I personally don't record audio for items that should be recorded by a female voice, a young child's voice, two or more voices, or a non-American voice.

CK CK April 1, 2022, edited April 1, 2022 April 1, 2022 at 7:26:57 AM UTC, edited April 1, 2022 at 7:29:11 AM UTC link Permalink

** Dashboard for Translating English Sentences with Audio

http://a4esl.org/temporary/tato...e/searches.php

We now have over 700,000 English sentences with audio files.


If you want to directly help my projects (http://bit.ly/tatoebaprojects), then ...

1. Translate some of these sentences into your own native language.

2. If your native language is English, then translate sentences owned by native speakers of another language into natural-sounding English.

CK CK March 23, 2022 March 23, 2022 at 11:57:01 PM UTC link Permalink

You don't need to use it if you don't want to. I just thought that you might find it useful, and the list was already created.

CK CK March 23, 2022, edited March 23, 2022 March 23, 2022 at 11:31:32 PM UTC, edited March 23, 2022 at 11:37:49 PM UTC link Permalink

The following link will give you a random selection of English sentences with translations into 9 or more languages that are not yet translated into Portuguese.

https://tatoeba.org/eng/sentenc..._to=por&to=und

This is listed as "method 7" on the following page.

http://study.aitech.ac.jp/tatoe...hp?f=eng&t=por


Note that List 7055 was last updated in February of 2020 with data I received from sharptoothed.

https://tatoeba.org/en/sentence.../show/7055/und
English Sentences Linked to More Than 9 Languages (Updated Feb. 2020) - Over 40,000 Sentences - List 7055

CK CK March 18, 2022, edited March 18, 2022 March 18, 2022 at 6:05:02 AM UTC, edited March 18, 2022 at 6:05:53 AM UTC link Permalink

** Parts of Speech Tagged Sentences

Today I stumbled on a website that has tagged parts of speech for many sentences from the Tatoeba Corpus

http://study.aitech.ac.jp/wordhelp/

This is a set of pages that I created, so you can easily jump to some of these pages.

CK CK March 8, 2022, edited March 8, 2022 March 8, 2022 at 1:16:13 AM UTC, edited March 8, 2022 at 1:17:44 AM UTC link Permalink

/// I grabbed all the English queries, and then counted them.
/// This list is sorted by the counts (column 1)

Here are the top 20 English queries.
Looking at this data, I sort of wonder how useful these counts are. Perhaps some kind of bot did these searches.

65468 AARAMBH
64324 AARDWOLF
62180 AARDWOLVES
57822 A BATTERY
56590 A BIT RATHER THICK
56384 A BITTER CUP
56176 A BOIL
55274 A BUBBLE
54982 A CAPPELLA
53798 BACK PAGE
53486 A CRITICAL
52960 A DAUGHTER OF EVE
52872 Above
52820 Argument
52374 bootstrap__
52298 environment
51968 A FORTIORI
51132 A FUNCTION
50728 A GOOD DEAL
50281 sad

If you want to see them all, you can download the file here.

https://aitstudy.com/temp/queri...2022-03-08.zip (6.2MB)

CK CK February 20, 2022, edited February 20, 2022 February 20, 2022 at 12:57:47 AM UTC, edited February 20, 2022 at 12:58:07 AM UTC link Permalink

Since these usernames are not in alphabetical order, I wonder if they are in the order of who has most moved your project forward this week, in another order, or just in a random order.

The average is 11 sentences per username.

CK CK February 18, 2022 February 18, 2022 at 7:10:37 AM UTC link Permalink

Note that there is a GitHub issue related to this.

https://github.com/Tatoeba/tatoeba2/issues/1613

CK CK January 24, 2022 January 24, 2022 at 3:14:55 AM UTC link Permalink

Just for fun, here is the Top 50 list of the days with the most Tatoeba Corpus contributors' listed birthdays.

January 1 (47 contributors)
May 1 (16 contributors)
March 18 (12 contributors)
September 20 (12 contributors)
April 8 (11 contributors)
August 17 (11 contributors)
December 24 (11 contributors)
February 7 (11 contributors)
February 18 (11 contributors)
January 19 (11 contributors)
January 31 (11 contributors)
July 19 (11 contributors)
June 3 (11 contributors)
March 3 (11 contributors)
August 16 (10 contributors)
December 30 (10 contributors)
February 4 (10 contributors)
January 9 (10 contributors)
July 10 (10 contributors)
March 15 (10 contributors)
September 22 (10 contributors)
September 29 (10 contributors)
April 20 (9 contributors)
December 1 (9 contributors)
February 1 (9 contributors)
February 2 (9 contributors)
February 21 (9 contributors)
January 22 (9 contributors)
June 18 (9 contributors)
March 10 (9 contributors)
May 3 (9 contributors)
November 23 (9 contributors)
October 10 (9 contributors)
September 1 (9 contributors)
September 16 (9 contributors)
September 17 (9 contributors)
September 21 (9 contributors)
April 17 (8 contributors)
April 26 (8 contributors)
August 1 (8 contributors)
August 5 (8 contributors)
August 15 (8 contributors)
June 5 (8 contributors)
June 7 (8 contributors)
June 9 (8 contributors)
March 31 (8 contributors)
May 11 (8 contributors)
May 12 (8 contributors)
May 15 (8 contributors)
May 25 (8 contributors)

This list is limited to those on my list of native speaker contributors and those who had their birthdays listed in their profiles.

CK CK January 23, 2022 January 23, 2022 at 12:58:02 AM UTC link Permalink

** New Dutch Voice **

Rose_d has contributed over 1,000 audio files.

https://tatoeba.org/en/sentence...how/169964/und

CK CK January 22, 2022, edited January 24, 2022 January 22, 2022 at 2:11:36 AM UTC, edited January 24, 2022 at 2:22:39 AM UTC link Permalink

Happy birthday.

CK CK January 18, 2022 January 18, 2022 at 7:13:21 AM UTC link Permalink

This example finds English sentences beginning with "Tom" and ending with "Mary".

^Tom Mary$

Source: https://en.wiki.tatoeba.org/art...ow/text-search
(Click the "help" in the search bar.)

CK CK December 31, 2021, edited December 31, 2021 December 31, 2021 at 3:01:44 AM UTC, edited December 31, 2021 at 3:02:46 AM UTC link Permalink

** Something to Consider for 2022 **

10 English Words Per Day (with links to search the Tatoeba Corpus)

* https://bit.ly/tatoebadaily

The 10 words are roughly in the order of frequency of use, based on the NGSL, NAWL and TSL. I divided the words into 10 groups by level and included one word from each of those levels every day for 366 days.

Perhaps this is something you might want to try in order to focus on translating sentences with different vocabulary every day. Out of the 10 words each day, maybe you will find something interesting to you.

CK CK December 11, 2021 December 11, 2021 at 10:27:12 AM UTC link Permalink

** 5,000 Recently-added English Sentences with Audio **

http://tatoeba.ueuo.com/audio-no-links/1.html

* All of these did not have links to any other sentences in the 2021-12-11 exported data.

* Quickly see 1,000 sentences per page. These are static pages, not needing a connection to the database.

* You can listen to the audio files and choose which sentences you want to click to and translate.

* From 10543840 down to 10185355

CK CK December 2, 2021 December 2, 2021 at 9:00:24 AM UTC link Permalink

** Milestone **

Sentences with audio (total 900,000)
https://tatoeba.org/en/audio/index

2021-12-02 08:55 UTC

Screenshot: https://imgur.com/a/0q8Rey9

CK CK November 25, 2021 November 25, 2021 at 2:44:00 AM UTC link Permalink

Like I said, I don't know if it would be possible or even advisable. However this database was created for a purpose, whether it's for students or researchers, or for some other purpose, or for all purposes.

A database could include a way to put alternative translations in an order of usefulness, naturalness or on how well they match the meaning of the sentences they are attached to, or in some other useful order.

This way the database could perhaps be more effectively utilized by those who want to use it.

CK CK November 24, 2021 November 24, 2021 at 12:24:12 AM UTC link Permalink

I don't know if it would be possible or even advisable, but perhaps the Tatoeba Project needs some method to indicate which of multiple alternative translations would be best for language learners to focus on, or learn first.