Search

Search Funnelback University

Search powered by Funnelback
1 - 20 of 34 search results for `news corpora` |u:mi.eng.cam.ac.uk
  1. Fully-matching results

  2. Bin Jia, Khe Chai Sim et al: CU-HTK RT03 ...

    mi.eng.cam.ac.uk/research/projects/EARS/pubs/jia_rt03s.pdf
    23 Jun 2003: Language Model. • Sources of data (using LDC character-to-word segmentor)– Acoustic training data (modifier Kneser-Ney)– News corpora: TDT[2,3,4], China Radio, People’s Daily, Xinhua (Good-. ... Acoustic 206.6 190.8AcousticNews Corpora 199.6 179.8
  3. Experiments in Broadcast News Transcription

    mi.eng.cam.ac.uk/reports/full_html/woodland_icassp98.html/
    1 Mar 2000: Experiments in Broadcast News Transcription. P.C. Woodland, T. Hain, S.E. Johnson, T. ... FX. all other speech (e.g. spontaneous non-native). Table 1: Broadcast news focus conditions.
  4. 29 Apr 2024: 19] uFACT: Unfaithful alien-corpora training for semantically consistent data-to-text generation. ... We propose uFACT (Un-Faithful Alien Corpora Training), a training corpus construction method for data-to-text (d2t) generation models.
  5. 22 Nov 2006: The two acoustictraining data sources, and each of the news corpora, were kept asdistinct sources for language model (LM) generation. ... The total contributionfrom all the news corpora was about 0.12, with the majority fromPeople’s Daily (0.09).
  6. 20 Feb 2018: in-domainutterance pairs, and up to 91.4% when adding the out-of-domainbilingual corpora detailed in Section 2.2. ... 11] J. Tiedemann, “News from OPUS - A collection of multi-lingual parallel corpora with tools and interfaces,” in Re-cent Advances
  7. 15 Jun 2018: Experimentswere conducted on the Penn Tree Bank and BBC Multi-GenreBroadcast News (MGB) corpora, where the proposed approachsignificantly outperforms standard forms of recurrent models inperplexity. ... PTB consists mainly oftext related to finance,
  8. IMPROVING BROADCAST NEWS TRANSCRIPTION BY LIGHTLY…

    mi.eng.cam.ac.uk/reports/svr-ftp/chan_icassp2004.pdf
    27 May 2004: The rest of the paper is organised as follows. In Section 2, wedescribe the English broadcast news corpora that used in this work.Then, our lightly supervised discriminative training approach ispresented ... Rich Tran-scription Workshop, 2003. [4] D.
  9. 3 Nov 2023: 13] uFACT: Unfaithful alien-corpora training for semantically consistent data-to-text generation. ... We propose uFACT (Un-Faithful Alien Corpora Training), a training corpus construction method for data-to-text (d2t) generation models.
  10. 23 Dec 2004: The total contribution fromall the news corpora was about 0.12, with the majority from Peo-ple’s Daily (0.09). ... All experiments use the interpolated language modelwith the news corpora. Language Model System (S3) CER (%)dev04.
  11. The 1997 HTK Broadcast News Transcription System

    mi.eng.cam.ac.uk/reports/full_html/woodland_darpa98.html/
    1 Mar 2000: 41-48 (Lansdowne,VA, Feb. 1998). The 1997 HTK BROADCAST NEWS TRANSCRIPTION SYSTEM. ... using the broadcast news training texts, the acoustic training data and 1995 Marketplace transcriptions.
  12. Abstract for evermann_icassp00

    mi.eng.cam.ac.uk/reports/abstracts/evermann_icassp00.html
    27 Jul 2020: The effectiveness of these techniques is demonstrated on the broadcast news and the conversational telephone speech corpora where improvements both in terms of word error rate and normalised cross entropy were
  13. sig-004.dvi

    mi.eng.cam.ac.uk/~sjy/papers/gayo07.pdf
    20 Feb 2018: The reviewconcludes with a case study of LVCSR for Broadcast News andConversation transcription in order to illustrate the techniquesdescribed. ... The N -gram parameters areestimated by counting N -tuples in appropriate text corpora.
  14. 19 Jul 2006: Text data: used to train the ASR language model:– large news corpora available;– systems built on > 1 billion words of data. •
  15. STRUCTURAL METADATA RESEARCH IN THE EARS PROGRAM Yang Liu1,5 ...

    mi.eng.cam.ac.uk/reports/svr-ftp/tomalin_icassp05.pdf
    12 May 2005: 2.3. MDE Corpora. Conversational telephone speech (CTS) and broadcast news (BN)are used for the structural event detection tasks in EARS. ... The MDE effort in theEARS program aims to explore these tasks more extensively, us-ing different corpora and
  16. Bitext Alignment forStatistical Machine Translation Yonggang Deng A…

    mi.eng.cam.ac.uk/~wjb31/ppubs/YDengDissertationDec05.pdf
    16 Feb 2008: 72. 5.9 Percentage of Usable Arabic-English Bitext. English tokens for Arabic-English news and UN parallel corpora under different alignment pro-cedures. ... in real data, for example, parallel corpora mined from web pages, automatic bitext.
  17. 9 Jul 2024: This paradigm has shownimpressive results on standard summarization tasks such as news summarization [159, 340].However, there is a challenge in applying a large foundation model to long-documentsummarization such as
  18. EXPERIMENTS IN BROADCAST NEWS TRANSCRIPTION P.C. Woodland, T. Hain,…

    mi.eng.cam.ac.uk/reports/svr-ftp/woodland_icassp98.pdf
    10 Apr 2000: EXPERIMENTS IN BROADCAST NEWS TRANSCRIPTION. P.C. Woodland, T. Hain, S.E. Johnson, T.R. ... Young S.J. (1997) TheDevelopment of the 1996 Broadcast News Transcription Sys-tem.
  19. Bitext Alignment for Statistical Machine Translation

    mi.eng.cam.ac.uk/~wjb31/ppubs/YDengDefenseDec05.pdf
    16 Feb 2008: English Arabic-English. Used all parallel corpora available from LDCC-E: 200M En. ... words (news, all UN bitexts). Y. Deng (Johns Hopkins) Bitext Alignment for SMT 39 / 42.
  20. THE DEVELOPMENT OF THE1996 HTK BROADCAST NEWS TRANSCRIPTION SYSTEM ...

    mi.eng.cam.ac.uk/reports/svr-ftp/woodland_darpa97.pdf
    8 Mar 2000: THE DEVELOPMENT OF THE1996 HTK BROADCAST NEWS TRANSCRIPTION SYSTEM. P.C. Woodland, M.J.F. ... 5. CONCLUSIONThis paper has described our initial efforts to develop systemsfor broadcast news transcription.
  21. paper.dvi

    mi.eng.cam.ac.uk/~mjfg/liao_ICASSP07.pdf
    15 Aug 2007: Experiments are conductedon theResource Management and Broadcast News corpora. Index Terms— Speech recognition, Robustness. ... 4. EXPERIMENTS. A simplified Broadcast News system based on the 2003 CU-HTKsystem [11] was evaluated.

Related searches for `news corpora` |u:mi.eng.cam.ac.uk

Search history

Recently clicked results

Recently clicked results

Your click history is empty.

Recent searches

Your search history is empty.