Search

Search Funnelback University

Search powered by Funnelback
1 - 50 of 154 search results for `broadcast news data` |u:mi.eng.cam.ac.uk
  1. Fully-matching results

  2. Abstract for woodland_darpa98

    mi.eng.cam.ac.uk/reports/abstracts/speech/woodland_darpa98.html
    27 Jul 2020: Abstract for woodland_darpa98. Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop. ... The complete system yields an overall word error rate of 22.0% on the 1996 unpartitioned broadcast news development test data and just 15.8% on
  3. MultiMedia Document Retrieval (1997-2000)

    mi.eng.cam.ac.uk/research/projects/Multimedia_Document_Retrieval/
    7 Oct 2001: The final HTK system yielded an overall word error rate of 22.0% on the 1996 unpartitioned broadcast news development test data and just 16.2% on the evaluation test set - ... the lowest overall word error rate in the 1997 DARPA broadcast news evaluation,
  4. Abstract for mrva_icslp06

    mi.eng.cam.ac.uk/reports/abstracts/mrva_icslp06.html
    27 Jul 2020: Moreover, using broadcast news language model alone trained on large data under-performs a model that includes additional small amount of broadcast conversations by 1.8% absolute character error rate. ... In addition, it was found that it is possible to
  5. MultiMedia Document Retrieval (1997-2000) - Progress

    mi.eng.cam.ac.uk/research/projects/Multimedia_Document_Retrieval/progress.html
    7 Oct 2001: The final HTK system designed from the results of these experiments yielded an overall word error rate of 22.0% on the 1996 unpartitioned broadcast news development test data and just ... This consisted of two parts. Firstly the automatic transcription
  6. Abstract for tranter_tr476

    mi.eng.cam.ac.uk/reports/abstracts/tranter_tr476.html
    27 Jul 2020: CLUSTER VOTING FOR SPEAKER DIARISATION. S. E. Tranter. May 2004. It is often important to be able to automatically detect 'who spoke when' in audio data. ... The speaker diarisation task attempts to address this problem on Broadcast News data by defining
  7. Abstract for tranter_icassp05

    mi.eng.cam.ac.uk/reports/abstracts/tranter_icassp05.html
    27 Jul 2020: Results are presented on the 6-show RT-03 Broadcast News evaluation data, showing the DER can be reduced by 1.64% and 2.56% absolute using this method when combining
  8. Abstract for chan_icassp2004

    mi.eng.cam.ac.uk/reports/abstracts/chan_icassp2004.html
    27 Jul 2020: Abstract for chan_icassp2004. Proc. ICASSP 2004. IMPROVING BROADCAST NEWS TRANSCRIPTION BY LIGHTLY SUPERVISED DISCRIMINATIVE TRAINING. ... H.Y. Chan and P.C. Woodland. May 2004. In this paper, we present our experiments on lightly supervised
  9. Woodland et al.: English CTS Systems 2003 CU-HTK English ...

    mi.eng.cam.ac.uk/research/projects/EARS/pubs/woodland_rt03s.pdf
    24 Jul 2003: Woodland et al.: English CTS Systems. Automatic Segmentation. • Need to automatically segment the input data this year. • ... 2003 language models. • Training data in 5 portions:– Revised MSU transcripts CHE [3MW]– broadcast news setup (BN
  10. Abstract for woodland_darpa99

    mi.eng.cam.ac.uk/reports/abstracts/speech/woodland_darpa99.html
    27 Jul 2020: Abstract for woodland_darpa99. Proc DARPA Broadcast News Workshop, March 1999 (Herndon, VA). ... interpolated word level language models to combine text sources; increased broadcast news language model training data; and an extra adaptation stage using a
  11. Abstract for woodland_icassp98

    mi.eng.cam.ac.uk/reports/abstracts/speech/woodland_icassp98.html
    27 Jul 2020: Abstract for woodland_icassp98. Proc ICASSP'98, Seattle. EXPERIMENTS IN BROADCAST NEWS TRANSCRIPTION. ... Young. May 1998. This paper presents the recent development of the HTK broadcast news transcription system.
  12. K. Yu, M.J.F. Gales and P.C. Woodland Cambridge University ...

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/yu-interspeech07.pdf
    10 Oct 2007: Experi-ments were carried out on a Mandarin broadcast transcription taskusing both Broadcast News (BN) and Broadcast Conversation (BC)data. ... 9] J. Ma, S. Matsoukas, O. Kimball, & R. Schwartz, “Unsuper-vised Training on Large Amounts of Broadcast
  13. Abstract for woodland_darpa97

    mi.eng.cam.ac.uk/reports/abstracts/speech/woodland_darpa97.html
    27 Jul 2020: THE DEVELOPMENT OF THE 1996 HTK BROADCAST NEWS TRANSCRIPTION SYSTEM. P.C. ... Woodland, M.J.F. Gales, D. Pye & S.J. Young. April 1997. This paper describes our efforts in extending a large vocabulary speech recognition system to handle broadcast news
  14. Experiments in Broadcast News Transcription

    mi.eng.cam.ac.uk/reports/full_html/woodland_icassp98.html/
    1 Mar 2000: That system was constructed using HMMs trained on the Wall Street Journal (WSJ) corpus as a base and then adapted to individual data types of broadcast news data using supervised maximum ... Siegler M.A., Jain U., Raj B. & Stern R.M. (1997).
  15. The 1998 HTK Broadcast News Transcription System: Development and…

    mi.eng.cam.ac.uk/reports/full_html/woodland_darpa99.html/
    2 Mar 2000: Significant progress in the accurate transcription of broadcast news data has been made over the last few years so that we are now at a point where such systems can be ... The soft-clustering technique developed at JHU [9] had shown worthwhile reductions
  16. The 1997 HTK Broadcast News Transcription System

    mi.eng.cam.ac.uk/reports/full_html/woodland_darpa98.html/
    1 Mar 2000: We have found that using the full system with adaptation results in a 20-25% decrease in word error rate on broadcast news data. ... Siegler M.A., Jain U., Raj B. & Stern R.M. (1997). Automatic Segmentation, Classification and Clustering of Broadcast
  17. The Cambridge University Multimedia Document Retrieval Demo System

    mi.eng.cam.ac.uk/reports/full_html/tuerk_sigir00demo.html/
    14 Aug 2000: This system gives a word error rate of 15.9% on the 1998 Hub4 broadcast news evaluation data. ... J. J. Odell, P. C. Woodland, and T. Hain. The CUHTK-Entropic 10xRT Broadcast News Transcription System.
  18. The Cambridge University March 2005 Speaker Diarisation System R. ...

    mi.eng.cam.ac.uk/reports/svr-ftp/sinha_eurospeech05.pdf
    23 Jun 2005: 4. Experiments4.1. Data and Scoring Metric. The experiments reported in this paper use a development set of24 US broadcast news shows, denoteddev24 and a 12-showsubset of this, denoteddev12 which ... 953–956. [5] J.-L. Gauvain, L. Lamel, and G. Adda,
  19. UNSUPERVISED TRAINING FOR MANDARIN BROADCAST NEWS AND…

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/wang_ICASSP07.pdf
    10 Oct 2007: Experi-ments were carried out on a Mandarin transcriptions task. Two typesof test data were considered, Broadcast News (BN) and BroadcastConversations (BC). ... 8] J. Ma, S. Matsoukas, O. Kimball, and R. Schwartz, “Unsu-pervised training on large
  20. IMPLEMENTATION OF AUTOMATIC CAPITALISATIONGENERATION SYSTEMS FOR…

    mi.eng.cam.ac.uk/reports/svr-ftp/auto-pdf/kim_icassp02.pdf
    9 Aug 2005: The correlation between pause lengthsand sentence boundary marks was studied for broadcast news datain [8]. ... Table 2. Database descriptions. 3. EXPERIMENTS. Broadcast News data provides a good test-bed for speech recog-nition, because it requires
  21. Two-way Cluster Voting to Improve Speaker Diarisation Performance

    mi.eng.cam.ac.uk/reports/full_html/tranter_icassp05.html/
    31 Mar 2005: Results are presented on the 6-show RT-03 Broadcast News evaluation data, showing the DER can be reduced by 1.64% and 2.56% absolute using this method when combining ... The Rich Transcription diarisation evaluations[provide a framework to analyse the
  22. The CUHTK-Entropic 10xRT Broadcast News Transcription System

    mi.eng.cam.ac.uk/reports/full_html/odell_darpa99.html/
    1 Mar 2000: The CUHTK-Entropic 10xRT Broadcast News Transcription System. J.J. Odell , P.C. ... Cepstral mean normalisation of each segment is applied. Two sets of cross word triphone context dependent HMMs were produced from the 1997 and 1998 Broadcast news
  23. SPEECH RECOGNITION SYSTEM COMBINATION FOR MACHINE TRANSLATION M.J.F.…

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/gales_ICASSP07.pdf
    10 Oct 2007: 3. STT POST PROCESSING. In processing data such as Broadcast News (BN) or Broadcast Con-versations (BCs) for an STT system, the first stage is to segment thedata into homogeneous blocks, ... The language models for each of the systems were trainedon over
  24. Speaker Diarisation for Broadcast News S. E. Tranter† and ...

    mi.eng.cam.ac.uk/reports/svr-ftp/tranter_odyssey04.pdf
    25 Mar 2004: Each data set consists of one 30 minute extract from 6different US broadcast news shows. ... A library of broadcast news shows was made1 using theEnglish TDT-4 training data, excluding the shows from theRT-03s development sets.
  25. Spoken Document Retrieval for TREC-7 at Cambridge University

    mi.eng.cam.ac.uk/reports/full_html/johnson_trec7.html/
    30 Mar 2000: The input data is presented to our HTK transcription system as complete episodes of broadcast news shows and these are first converted to a set of segments for further processing. ... This was useful, but we went further and developed a new pair of
  26. Who Spoke When? - Automatic Segmentation and Clustering for…

    mi.eng.cam.ac.uk/reports/full_html/johnson_eurospeech99.html/
    2 Mar 2000: For the task of identifying potentially unknown anchor speakers within broadcast news shows, the frame classification error rate is very important. ... The 1996 Hub-4 Broadcast News Transcription development data was used for all the experiments reported
  27. CONSENSUS NETWORK DECODING FOR STATISTICAL MACHINE TRANSLATIONSYSTEM…

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/sim-icassp07.pdf
    26 Mar 2008: Results for both Chinese-English and Arabic-English on two text development sets as well asa broadcast news development set used by the AGILE team in theGALE program4. ... Furthermore the actual evaluation results from the2006 test are included. The 2006
  28. THE CU-HTK MANDARIN BROADCAST NEWS TRANSCRIPTION SYSTEM R. Sinha, ...

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/rs_ICASSP06.pdf
    23 Feb 2006: This data was split between about 34Mwords from broadcast sources (CCTV, NTDTV, VOA) and about6M words from news paper sources. ... Jin, M. Noamany, andT. Shultz, “The ISL RT-04 Mandarin Broadcast News evaluationsystem,” inProc.
  29. TRAINING AND ADAPTING MLP FEATURES FOR ARABIC SPEECH RECOGNITION

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/park_icassp09.pdf
    7 Oct 2009: Allthe schemes are evaluated in a common framework using an ArabicBroadcast News/Conversation transcription task. ... All thesetest sets consist of both Broadcast News and Broadcast Conversationstyles of data.
  30. THE DEVELOPMENT OF THE CAMBRIDGE UNIVERSITY RT-04 DIARISATION SYSTEM…

    mi.eng.cam.ac.uk/reports/svr-ftp/tranter_rt04.pdf
    10 Jan 2005: The Rich Transcription diarisation evaluations[1, 2, 3] providea framework to analyse the performance of such speaker diarisa-tion systems on Broadcast News (BN) data. ... 5] J.-L. Gauvain, L. Lamel, and G. Adda, “Partitioning andTranscription of
  31. Spoken Document Retrieval for TREC-8 at Cambridge University

    mi.eng.cam.ac.uk/reports/full_html/johnson_trec8.html/
    30 Mar 2000: Since a substantial portion of the data to be transcribed was known to be commercials and thus irrelevant to broadcast news queries, an automatic method of detecting and eliminating such commercials ... Three fixed backoff word-based language models were
  32. is2008.dvi

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/raut-interspeech08.pdf
    3 Dec 2008: 1. IntroductionSpeech recognition systems are increasingly being built withfound data such as broadcast news and conversational tele-phone speech recordings. ... The speech data wasparameterised using 12 PLP Cepstral coefficients plus the0thorder (C0)
  33. Audio Indexing and Retrieval of Complete Broadcast News Shows

    mi.eng.cam.ac.uk/reports/full_html/johnson_riao00.html/
    19 Apr 2000: This paper describes a system for retrieving relevant portions of complete broadcast news shows starting with only the audio data. ... Conclusions. This paper has described a system for retrieving relevant portions of complete broadcast news shows when
  34. The Development of the Cambridge University RT-04 Diarisation System

    mi.eng.cam.ac.uk/reports/full_html/tranter_rt04.html/
    10 Jan 2005: The Rich Transcription diarisation evaluations[provide a framework to analyse the performance of such speaker diarisation systems on Broadcast News (BN) data. ... J.-L. Gauvain, L. Lamel, and G. Adda,. Partitioning and Transcription of Broadcast News Data
  35. DEVELOPMENT OF A PHONETIC SYSTEM FOR LARGE VOCABULARY ARABICSPEECH ...

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/gales-asru07.pdf
    26 Mar 2008: The performance and combination ofphonetic and graphemic acoustic models are then compared on bothBroadcast News (BN) and Broadcast Conversation (BC) data. ... Schwartz, “Unsu-pervised training on large amount of broadcast news data,” inProc.
  36. Class-based language model adaptation using mixtures of word-class…

    mi.eng.cam.ac.uk/reports/full_html/moore_icslp00.html/
    2 Nov 2000: A mixture of broadcast news and newswire text was used as training data for the topic model, with 144 million words of Broadcast News text and 25 million words of Los ... Whittaker and S.J. Young, The 1997 HTK Broadcast News Transcription System''; DARPA
  37. INVESTIGATION OF ACOUSTIC MODELLING TECHNIQUES FOR LVCSR SYSTEMS M.…

    mi.eng.cam.ac.uk/reports/svr-ftp/gales_rt04_modelling.pdf
    19 May 2005: Exper-imental results are presented on both broadcast news (BN) andconversational telephone speech (CTS) transcription tasks. ... 1. INTRODUCTION. For many years automatic transcription of broadcast news (BN)and conversational telephone speech (CTS) data
  38. POSTERIOR PROBABILITY DECODING, CONFIDENCE ESTIMATION AND SYSTEM…

    mi.eng.cam.ac.uk/reports/full_html/evermann_stw00.html/
    5 Oct 2000: All the experiments reported are based on this system. The acoustic models used are triphone and quinphone HMMs trained on data from the Switchboard and CallHome corpora. ... A 4-gram language model was trained on the transcripts of the acoustic training
  39. CAMBRIDGE UNIVERSITYENGINEERING DEPARTMENT Automatic Transcription…

    mi.eng.cam.ac.uk/reports/svr-ftp/hain_tr465.pdf
    18 Dec 2003: 50000 most frequent words occurring in 204 million words (MW)of Broadcast News (BN) training data, yielding a vocabulary size of around 55000. ... Again modified Kneser-Ney discountingwas used. The BNLM model was trained on 204MW of Broadcast News data
  40. paper-mdeval-v19_revised.dvi

    mi.eng.cam.ac.uk/reports/svr-ftp/tomalin_rt04.pdf
    12 Jan 2005: Sim, and P. C. Woodland, “Recent Devel-opments at Cambridge in Broadcast News Transcription,”in Proc. ... F. Gales,D. Mrva, K. C. Sim, and P. C. Woodland, “Development ofthe CU-HTK 2004 Broadcast News Transcription Systems,”in Proc.
  41. Deep Activation Mixture Model for Speech Recognition

    mi.eng.cam.ac.uk/UKSpeech2017/posters/c_wu.pdf
    17 Nov 2017: l)k. ). 6. Experiment. I Data and setupI 144-hour English broadcast news dataset (LDC97S44, LDC98S71)I DNN-HMM hybrid ASR frameworkI 5 hidden layers with 1024 units for both DNN
  42. Recent Developments at Cambridgein Broadcast News Transcription D.Y.…

    mi.eng.cam.ac.uk/research/projects/EARS/pubs/kim_rt04.pdf
    15 Feb 2005: Presentation Overview. • RT03 Broadcast News System Review• Training & Test Data• Improved Acoustic Model Building. – ... dev04f representative of the extended broadcast news corpus• No epoch overlap with the acoustic training data.
  43. The Cambridge University March 2005 Speaker Diarisation System

    mi.eng.cam.ac.uk/reports/full_html/sinha_eurospeech05.html/
    22 Sep 2005: J.-L. Gauvain, L. Lamel, and G. Adda,. ''Partitioning and Transcription of Broadcast News Data,'' [ ps ]. in Proc. ICSLP, December 1998, vol. 4, pp. 1335-1338. 6. ... S. E. Tranter and D. A. Reynolds,. ''Speaker Diarisation for Broadcast News,'' [in Proc.
  44. paper.dvi

    mi.eng.cam.ac.uk/~ar527/ragni_is2018a.pdf
    15 Jun 2018: This provides a contrastto conversational telephone speech (CTS), broadcast news andvoice search style data for which numerous systems have beendeveloped [1, 2, 3, 4, 5, 6]. ... This pa-per looked at a particular scenario where the development train-ing
  45. IEEE TRANS. ON SAP, VOL. ?, NO. ??, ????? ...

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/mjfg_ASL.pdf
    23 Feb 2006: II. SEGMENTATION AND CLUSTERING. For Broadcast News transcription, the first stage of pro-cessing is to partition the incoming audio data stream intohomogeneous segments (the segmentation) and to group thesesegments into ... A. Training and Test Data Sets
  46. ./plot_entropy.eps

    mi.eng.cam.ac.uk/~ar527/chen_is2017.pdf
    15 Jun 2018: The performance of the bi-RNNLMs is evaluatedon three speech recognition tasks: broadcast news; meetingtran-scription (AMI); and low-resource systems (Babel data). ... 5. EXPERIMENTS. The performance of the bi-RNNLMs was evaluated on three cor-pora:
  47. Cluster Voting for Speaker Diarisation S.E.…

    mi.eng.cam.ac.uk/reports/svr-ftp/tranter_tr476.pdf
    13 May 2004: 32.2 The Broadcast News Data for Diarisation. 32.3 Diarisation Scoring. 4. ... as convincing on the Broadcast News data, and the system restrictedthe two input segmentations to have the same number of speakers.
  48. K. Yu, M.J.F. Gales and P.C. Woodland Cambridge University ...

    mi.eng.cam.ac.uk/~mjfg/yu-interspeech07.pdf
    11 Jan 2008: Experi-ments were carried out on a Mandarin broadcast transcription taskusing both Broadcast News (BN) and Broadcast Conversation (BC)data. ... 9] J. Ma, S. Matsoukas, O. Kimball, & R. Schwartz, “Unsuper-vised Training on Large Amounts of Broadcast
  49. 22 Feb 2007: Last three are important to achieve good generalisation. • Example Broadcast News LVCSR gains ( 500 1000 hours training data)– typically 200K-300K Gaussian components for each system. ... Tl(w) may be replaced by log(P (w))– allows LM text training
  50. JOURNAL OF IEEE TRANS. ACOUST., SPEECH, SIGNAL PROCESSING, JULY ...

    mi.eng.cam.ac.uk/research/projects/AGILE/publications/sim_SAP06.pdf
    10 Oct 2007: Discriminative training of precision matrices was evaluatedon an English conversational telephone speech (CTS) task,which consists of multi-speaker spontaneous telephone conver-sational speech, and an English broadcast news (BN) task, ... 1.1% and
  51. SPEAKER CLUSTERING USING DIRECT MAXIMISATION OFTHE MLLR-ADAPTED…

    mi.eng.cam.ac.uk/reports/svr-ftp/johnson_icslp98.pdf
    10 Apr 2000: This paper presents two strategies forclustering broadcast news data segments (found by an au-tomatic segmentation algorithm) for subsequent MLLRadaptation. ... 5. EXPERIMENTS. Experiments on various sets of broadcast news data havebeen carried out to

Search history

Recently clicked results

Recently clicked results

Your click history is empty.

Recent searches

Recent searches

Your search history is empty.