Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-language support #11

Open
2 of 3 tasks
Tracked by #2
akdasa opened this issue Jul 29, 2024 · 0 comments
Open
2 of 3 tasks
Tracked by #2

Multi-language support #11

akdasa opened this issue Jul 29, 2024 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@akdasa
Copy link
Member

akdasa commented Jul 29, 2024

Problem Statement:

A track can be available in multiple languages: the original speaker's language, a translator's language, and translated transcripts in various languages. Users should be able to find a track in any language it is presented.

Proposed Solution:

Extend the TrackInfo model to incorporate all information about available languages.

Implementation Plan:

Update the languages field of the TrackInfo model. It should contain an array of the following objects:

{ 
    language: string,
    source: 'track'       // lecture was given or translated into this language
          | 'transcript', // app has transcript on that language
    type: 'original'     // it is original audio
        | 'generated'    // machine-generated
        | 'redacted'     // machine-generated but redacted by human
}

For example, the lecture was given in English and translated into Russian by a translator, and we have transcriptions for Russian and Serbian (automatically translated) it will look like this:

[
  // Lector speaks in English
  { language: 'en', source: 'track', type: 'original' }, 
  
  // Translator translated into his native Russian 
  { language: 'ru', source: 'track', type: 'original' }, 
  
  // Transcript was generated but redacted
  { language: 'ru', source: 'transcript', type: 'redacted' }  
  
  // Transcript was generated, but not redacted
  { language: 'sr', source: 'transcript', type: 'generated' } 
]

So users can find that lecture in Serbian, and we can inform them that only automatically translated transcription is available for that track.

Implementation Plan:

@akdasa akdasa added documentation Improvements or additions to documentation enhancement New feature or request labels Jul 29, 2024
@akdasa akdasa changed the title Update languages field of the TrackInfo model Update languages field of the TrackInfo model Jul 29, 2024
@akdasa akdasa changed the title Update languages field of the TrackInfo model Update languages field of the TrackInfo model Jul 29, 2024
@akdasa akdasa changed the title Update languages field of the TrackInfo model Update languages field of the TrackInfo model Jul 29, 2024
@akdasa akdasa self-assigned this Jul 29, 2024
@akdasa akdasa changed the title Update languages field of the TrackInfo model Extend languages model Jul 29, 2024
@akdasa akdasa added feature New feature request and removed documentation Improvements or additions to documentation enhancement New feature or request labels Jul 29, 2024
@akdasa akdasa changed the title Extend languages model Multi-language support Sep 19, 2024
@akdasa akdasa removed the feature New feature request label Sep 19, 2024
@akdasa akdasa added the enhancement New feature or request label Sep 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Backlog
Development

No branches or pull requests

1 participant