• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
LLODO – Education and technology

LLODO - Education and technology

Find your international education - university and college study education programs, student Exam, and course information.

  • Technology News
  • Blog Anony
  • Technology Quiz

Subtitles filled with obscene language

03/08/2022 by admin Leave a Comment

There are nearly 400,000 subscribers to a YouTube account called Rob the Robot – Learning Videos For Children. In a 2020 animated video, the protagonist and his friends visit a stadium-themed planet and perform heroic Heracles-inspired feats. Their adventures are elementary-age appropriate, but younger readers with YouTube’s auto-captioning may be surprised to expand their vocabulary. At one point, YouTube’s algorithms misheard the word “brave” for “rape” and captioned a scene where a character aspires to be “strong and raped like Heracles.” “.

Children's videos on YouTube: Subtitles filled with obscene language - Picture 1.

Screenshot from Rob the Robot YouTube channel – Learning Videos For Children

A recent study of YouTube’s algorithmic captioning on children-directed videos documented how text sometimes translates to “very adult” language. In a sample of more than 7,000 videos from 24 top-rated children’s channels, 40% of these each displayed 1,300 “taboo” words about swearing in their subtitles. In about 1% of videos, subtitles include words from a list of “highly inappropriate” terms.

Several videos posted on Ryan’s World, a leading children’s channel with over 30 million subscribers, are the best illustration of this problem. In one video, the phrase “You should also buy corn” given in the footnote is “You should also buy porn”. Because the system AI mistook “corn” for “p*rn”. In other videos, “beach towel” is transliterated as “b*tch towel”, “unusual” (buster) becomes “bastard”, “crab”. (crab) became “crap” and on a craft video teaching how to make a monster themed dollhouse there was the word “bed for p*nis”.

Children's videos on YouTube: Subtitles filled with obscene language - Picture 2.

Screenshot from video on Ryan’s World channel.

“It is surprising and disturbing”said Ashique KhudaBukhsh, an assistant professor at Rochester Institute of Technology who has researched the issue.

Automatic captions are not available on YouTube Kids, the child-directed version of the platform. But, many families often use the standard version of YouTube, where they can also watch it. The Pew Research Center reported in 2020 that 80% of parents with children 11 years of age or younger said their children watched YouTube content, and more than 50% of children did so on a daily basis.

KhudaBukhsh hopes the study will draw attention to a phenomenon he says has received little attention from tech companies and researchers. He calls it the “inappropriate content illusion”. That’s when algorithms add inappropriate content that wasn’t present in the original content. This is similar to how smartphone autocomplete often filters adult language to the point of annoyance, but in the opposite direction.

Meanwhile, YouTube spokeswoman Jessica Gibby said children under 13 should use YouTube Kids, where automatic captions can’t be viewed. On the standard version of YouTube, she also says the feature improves accessibility. She said: “We’re constantly working to improve automatic captioning and reduce errors.”

Alafair Hall, a spokeswoman for Pocket.watch, a children’s entertainment studio that publishes Ryan’s World content, said in a statement that the company “is in close and immediate contact with our platform partners, such as YouTube, to update any inaccurate video subtitles.”

“The benefits of speech-to-text are undeniable, but there are blind spots in these systems that need to be checked and rebalanced.” KhudaBukhsh said.

Those blind spots may not come as a surprise to humans, thanks in part to how much easier it is for us to understand the broader context and meaning of a person’s words. The algorithms are different. Although they have improved their language processing ability, they still lack the ability to fully and comprehensively understand the problem. This has caused problems for companies that rely on machines for word processing. A startup has had to overhaul its adventure game after it was discovered to sometimes depict sexual scenarios involving minors.

Machine learning algorithms will “learn” a task by processing large amounts of training data – in this case, the appropriate translation and audio files. KhudaBukhsh said that YouTube’s system sometimes inserts profanity because its training data mainly consists of adult speech and has few words from children. When the researchers manually examined examples of words that didn’t fit in the captions, they found they often appeared alongside the words of children or people who appeared to be non-native English speakers. Previous studies have also found that transcription services from Google and other big tech companies make more errors in the case of non-white speakers, as well as fewer errors in the English language. Standard American English, compared with other dialects also in America.

Be careful when letting children watch YouTube videos: Subtitles are filled with obscene language - Photo 3.

Children learn very quickly everything they see on YouTube.

Rachael Tatman, a linguist, says a simple list of words that are not used on children’s videos on YouTube solves many problems. But, “Clearly no one is supervising the engineering,” she said.

Still, Tatman says a block list would be an imperfect solution. Inappropriate phrases can be constructed with individual innocuous words. A more sophisticated approach would be to tweak the subtitle system to avoid using adult language when making children’s content, but Tatman says that won’t be perfect either. Machine learning software works with the language of statistics in certain directions, but it is not easily programmed to respect context. According to Tatman, “Language models are not precise tools.”

KhudaBbukhsh and his collaborators have invented and tested systems to correct taboo words in recordings, but even the best systems are less than 30% effective. The team also runs audio from children’s YouTube videos through an automatic transcription service provided by Amazon. It also sometimes makes mistakes that cause content to be edited. Amazon spokeswoman Nina Lindsey declined to comment on the matter, but did provide links to developer documentation on how to correct or filter unwanted words.

Refer Wired


https://genk.vn/can-than-khi-cho-tre-xem-video-tren-youtube-phu-de-chen-day-ngon-ngu-tuc-tiu-20220308170737071.chn

Filed Under: Technology News Tagged With: Internet

Related posts:

  1. 9 bad habits when using a washing machine, no matter where you go, you have to make it a couple of times
  2. A photo of a simple staircase with a hidden value of a thousand words about Japanese culture
  3. Citizens can now apply for passports online at home
  4. Cloud technology promotes Vietnamese game industry to reach out to the big sea
  5. Bill Gates is positive for COVID-19, is in isolation, but still optimistic because he has been vaccinated and ‘has the Internet’
  6. Elon Musk criticized Americans for always ‘trying to avoid going to work’, not as good as Chinese workers ‘not afraid to go to work at 3am’
  7. Bill Gates is positive for COVID-19, is in isolation, but is optimistic because he has been vaccinated and ‘still has the Internet’
  8. Elon Musk confirmed to “unlock” the Twitter account for former President Trump
  9. Peace of mind, why are tech giants like Facebook and Google changing their names?
  10. Monotonous design, just like a wearable store, not ‘surreal’
  11. Google honors Professor, Doctor Ton That Tung – the “father” of the method of dry liver cutting
  12. The first real estate to be sold with crypto, a 2 bedroom apartment worth 3 Bitcoin
  13. The worst cases that technology professionals have ever ‘stumbled’ on while working
  14. Samsung accelerates research and development of metaverse-related technologies
  15. Upgrading fiber optic line to 25 Gbps, Swiss engineer enjoys network speed like a ‘rocket’ at home
  16. How a blockchain ‘unicorn’ turns Vietnam into a hot spot for crypto startups
  17. Billionaire Warren Buffett became the next object because he criticized Bitcoin but kept talking about this digital currency
  18. Japan successfully built a ‘super great robot goalkeeper’ that can block all shots at high speed
  19. Catching the boss of a Bitcoin exchange rate betting line of nearly 2,000 billion VND
  20. Elon Musk wants to charge a fee for embedding Tweets

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Primary Sidebar

Recent Posts

  • Why can astronauts still make phone calls in the vacuum of space?
  • With only 100 USD and 10 seconds, the security expert successfully unlocked and started the Tesla electric car
  • It looks good, but it’s really “hands-on” to use.
  • Bad times are coming for startups
  • 9 bad habits when using a washing machine, no matter where you go, you have to make it a couple of times




Categories

  • Blog Anony (381)
  • Family and Friends 1 (63)
  • Family and Friends 2 (80)
  • Family and Friends 3 (80)
  • Family and Friends 4 (84)
  • Family and Friends 5 (82)
  • Grade 1 Math (61)
  • Grade 2 Math (96)
  • Grade 3 English (68)
  • Grade 3 Math (67)
  • Grade 4 English (68)
  • Grade 4 Math (77)
  • Grade 5 English (68)
  • Grade 5 Math (88)
  • Grade 6 English (104)
  • Grade 6 Math (67)
  • Grade 6 Physics (30)
  • Grade 7 English (104)
  • Grade 7 Math (57)
  • Grade 7 Physics (30)
  • Grade 8 Biology (64)
  • Grade 8 Chemistry (43)
  • Grade 8 English (104)
  • Grade 8 Math (75)
  • Grade 8 Physics (29)
  • Grade 9 Biology (63)
  • Grade 9 Chemistry (56)
  • Grade 9 English (104)
  • Grade 9 Math (61)
  • Grade 9 Physics (62)
  • Houseware (126)
  • Learning English (50)
  • Linux Quiz (300)
  • Software (31)
  • Technology News (7,256)
  • Technology Quiz (850)
  • Windows Quiz (200)

Copyright (c) 2022 · LLODO.COM - About Us - Privacy Policy - Contact Us - Site map
Link: Question Answer English - Hoc edu - Internet Do