The GV-LEx corpus of tales in French: Text and speech corpora enriched with lexical, discourse, structural, phonemic and prosodic annotations
A corpus of French tales is presented. Its two parts, a text corpus and a speech corpus, were designed for studying the relationships between the textual structures of tales and speech prosody, with the targeted application of an expressive text-to-speech synthesis system embedded in a humanoid robo...
        Saved in:
      
    
          | Published in | Language Resources and Evaluation Vol. 49; no. 3; pp. 521 - 547 | 
|---|---|
| Main Authors | , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Dordrecht
          Springer
    
        01.09.2015
     Springer Netherlands Springer Nature B.V Springer Verlag  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1574-020X 1572-8412 1574-0218  | 
| DOI | 10.1007/s10579-015-9306-7 | 
Cover
| Abstract | A corpus of French tales is presented. Its two parts, a text corpus and a speech corpus, were designed for studying the relationships between the textual structures of tales and speech prosody, with the targeted application of an expressive text-to-speech synthesis system embedded in a humanoid robot. The 89-tale text corpus, and the 12-tale speech corpus were annotated using a common tale description framework. Lexical level annotations include extended definitions of enumerations, time, place and person named entities, as well as part of speech tags. Supra-lexical level annotations include the segmentation of tales into a sequence of episodes, the localization and attribution of direct quotations, together with tale protagonists co-references. Annotation distributions and inter-annotator agreement were analyzed. The largest coverage and strongest agreement were observed for person named entities, characters' direct quotations, and their associated coreference chains. Speech corpus annotations were extended to allow the analysis of the relations between tale linguistic information and prosodic properties observed in associated speech. Word and phoneme boundaries were inferred through semi-automatic procedures, resulting in linguistic annotations aligned with the speech signal. Intonation stylization models were used to ease the visual and statistical analysis of tale's prosody. Additional meta-information is provided with the speech corpus, allowing describing tale characters according to their gender, age, size, valence and kind. The corpora described in this article are publicly available through the European Language Resources Association catalog. | 
    
|---|---|
| AbstractList | A corpus of French tales is presented. Its two parts, a text corpus and a speech corpus, were designed for studying the relationships between the textual structures of tales and speech prosody, with the targeted application of an expressive text-to-speech synthesis system embedded in a humanoid robot. The 89-tale text corpus, and the 12-tale speech corpus were annotated using a common tale description framework. Lexical level annotations include extended definitions of enumerations, time, place and person named entities, as well as part of speech tags. Supra-lexical level annotations include the segmentation of tales into a sequence of episodes, the localization and attribution of direct quotations, together with tale protagonists co-references. Annotation distributions and inter-annotator agreement were analyzed. The largest coverage and strongest agreement were observed for person named entities, characters' direct quotations, and their associated coreference chains. Speech corpus annotations were extended to allow the analysis of the relations between tale linguistic information and prosodic properties observed in associated speech. Word and phoneme boundaries were inferred through semi-automatic procedures, resulting in linguistic annotations aligned with the speech signal. Intonation stylization models were used to ease the visual and statistical analysis of tale's prosody. Additional meta-information is provided with the speech corpus, allowing describing tale characters according to their gender, age, size, valence and kind. The corpora described in this article are publicly available through the European Language Resources Association catalog. A corpus of French tales is presented. Its two parts, a text corpus and a speech corpus, were designed for studying the relationships between the textual structures of tales and speech prosody, with the targeted application of an expressive text-to-speech synthesis system embedded in a humanoid robot.The 89-tale text corpus, and the 12-tale speech corpus were annotated using a common tale description framework. Lexical level annotations include extended definitions of enumerations, time, place and person named entities, as well as part of speech tags. Supra-lexical level annotationsinclude the segmentation of tales into a sequence of episodes, the localization and attribution of direct quotations, together with tale protagonists co-references. Annotation distributions and inter-annotator agreement were analyzed. The largest coverage and strongest agreement were observed for person named entities, characters’ direct quotations, and their associated coreference chains. Speech corpus annotations were extended to allow the analysis of the relations between tale linguistic information and prosodic properties observed in associated speech.Word and phoneme boundaries wereinferred through semi-automatic procedures, resulting in linguistic annotations aligned with the speech signal. Intonation stylization models were used to ease the visual and statistical analysis of tale’s prosody. Additional meta-information is provided with the speech corpus, allowing describing tale characters according to their gender, age, size, valence and kind. The corpora described in this article are publicly available through the European Language Resources Association catalog.  | 
    
| Author | Adda-Decker, Martine Doukhan, David Rilliard, Albert Rosset, Sophie d'Alessandro, Christophe  | 
    
| Author_xml | – sequence: 1 givenname: David surname: Doukhan fullname: Doukhan, David – sequence: 2 givenname: Sophie surname: Rosset fullname: Rosset, Sophie – sequence: 3 givenname: Albert surname: Rilliard fullname: Rilliard, Albert – sequence: 4 givenname: Christophe surname: d'Alessandro fullname: d'Alessandro, Christophe – sequence: 5 givenname: Martine surname: Adda-Decker fullname: Adda-Decker, Martine  | 
    
| BackLink | https://shs.hal.science/halshs-01251140$$DView record in HAL | 
    
| BookMark | eNp9kc1u1DAUhSNUJNrCA7BAssSGxRiu7ThO2FVV_6SR2AyIneVxHJJRxg6-DgwPwTvXM0EVYsHGP1ffPT6-56I488G7onjN4D0DUB-QgVQNBSZpI6Ci6llxzqTitC4ZPzudSwocvr4oLhB3ACUvVX1e_N70jtx9oeubA7EhTjOS0JFkRodk8OQ2Om_7j2TjDokY3xKcnLP9CQ3REOfjYHvXkp9D6snoDoM144q0A9owR3QrginONs3xWJ76bHo_2JPSFAOG9nTxIZk0BI8vi-edGdG9-rNfFp9vbzbX93T96e7h-mpNbSkgUQtcqdJWZWtkszWt6GQNIFum8sq6RrbbipV1A7xjVcuFE4LXvKoAmkZsOReXxWrR7c2opzjsTfylgxn0_dVa5xr2qIFxyVgJP1jG3y149vx9dpj0Pn_QjaPxLsyomRJwHL-CjL79B93lQfgsqVnVqErJDGWKLZTNQ8DouicTDPQxT73kmU1IfcxTq9zDlx7MrP_m4l_K_2l6szTtMIX49ErOnoGopXgECleszA | 
    
| CODEN | COHUAD | 
    
| Cites_doi | 10.1007/11573548_86 10.1109/TASL.2006.876129 10.1016/S0167-6393(99)00032-1 10.1121/1.1458024 10.1016/0304-422X(85)90016-6 10.21437/ICSLP.2000-520 10.1007/978-3-540-30228-5_8 10.1145/2361354.2361394 10.1159/000261938 10.1109/ICHR.2006.321322 10.1016/0167-6393(94)90047-7 10.3115/1220575.1220648 10.21437/Interspeech.2011-783 10.2307/469357 10.21437/Eurospeech.1997-684 10.1037/e309842005-008 10.1177/002383098202500104 10.1016/j.knosys.2004.10.011 10.1023/A:1007506220214 10.1006/csla.1995.0013 10.1136/amiajnl-2011-000784 10.1016/j.specom.2005.03.006 10.2307/468554 10.21437/Eurospeech.2003-586 10.21437/Interspeech.2012-203 10.1007/s10579-011-9140-5 10.1207/s15327973rlsi2903_2 10.2307/2529310 10.1515/9783110316469.84 10.1515/ling.1967.5.37.12 10.1609/aaai.v24i1.7720 10.1007/978-3-642-30220-6_33 10.1007/978-3-642-04447-2_59 10.1162/coli.07-034-R2 10.1197/jamia.M1733  | 
    
| ContentType | Journal Article | 
    
| Copyright | Springer Science+Business Media 2015 Springer Science+Business Media Dordrecht 2015 Distributed under a Creative Commons Attribution 4.0 International License  | 
    
| Copyright_xml | – notice: Springer Science+Business Media 2015 – notice: Springer Science+Business Media Dordrecht 2015 – notice: Distributed under a Creative Commons Attribution 4.0 International License  | 
    
| DBID | AAYXX CITATION 3V. 7SC 7T9 7XB 8AL 8FD 8FE 8FG 8FK 8G5 ABUWG AFKRA AIMQZ ALSLI ARAPS AVQMV AZQEC BENPR BGLVJ CCPQU CPGLG CRLPW DWQXO GB0 GNUQQ GUQSH HCIFZ JQ2 K50 K7- L7M LIQON L~C L~D M0N M1D M2O MBDVC P5Z P62 PEJEM PHGZM PHGZT PKEHL PMKZF PQEST PQGLB PQQKQ PQUKI PRINS PRQQA Q9U 1XC BXJBU  | 
    
| DOI | 10.1007/s10579-015-9306-7 | 
    
| DatabaseName | CrossRef ProQuest Central (Corporate) Computer and Information Systems Abstracts Linguistics and Language Behavior Abstracts (LLBA) ProQuest Central (purchase pre-March 2016) Computing Database (Alumni Edition) Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) (purchase pre-March 2016) Research Library ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest One Literature Social Science Premium Collection Advanced Technologies & Computer Science Collection Arts Premium Collection ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College Linguistics Collection Linguistics Database ProQuest Central DELNET Social Sciences & Humanities Collection ProQuest Central Student ProQuest Research Library SciTech Premium Collection ProQuest Computer Science Collection Art, Design & Architecture Collection Computer Science Database Advanced Technologies Database with Aerospace ProQuest One Literature Computer and Information Systems Abstracts  Academic Computer and Information Systems Abstracts Professional Computing Database Arts & Humanities Database ProQuest Research Library Research Library (Corporate) ProQuest advanced technologies & aerospace journals ProQuest Advanced Technologies & Aerospace Collection ProQuest One Visual Arts & Design ProQuest Central Premium ProQuest One Academic ProQuest One Academic Middle East (New) ProQuest Digital Collections ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest One Social Sciences ProQuest Central Basic Hyper Article en Ligne (HAL) HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société  | 
    
| DatabaseTitle | CrossRef ProQuest DELNET Social Sciences and Humanities Collection Research Library Prep Computer Science Database ProQuest Central Student Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College Research Library (Alumni Edition) ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences Linguistics Collection Arts Premium Collection ProQuest Central Korea ProQuest Research Library ProQuest Central (New) ProQuest Art, Design and Architecture Collection Advanced Technologies Database with Aerospace Advanced Technologies & Aerospace Collection Social Science Premium Collection ProQuest Computing ProQuest One Literature - U.S. Customers Only ProQuest One Social Sciences ProQuest Central Basic ProQuest One Literature ProQuest Computing (Alumni Edition) ProQuest One Academic Eastern Edition Linguistics and Language Behavior Abstracts (LLBA) ProQuest Technology Collection ProQuest SciTech Collection Computer and Information Systems Abstracts Professional ProQuest Digital Collections Advanced Technologies & Aerospace Database ProQuest One Academic UKI Edition Linguistics Database ProQuest One Visual Arts & Design Arts & Humanities Full Text ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni)  | 
    
| DatabaseTitleList | ProQuest DELNET Social Sciences and Humanities Collection Computer and Information Systems Abstracts  | 
    
| Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Library & Information Science Computer Science  | 
    
| EISSN | 1572-8412 1574-0218  | 
    
| EndPage | 547 | 
    
| ExternalDocumentID | oai:HAL:halshs-01251140v1 3749763221 10_1007_s10579_015_9306_7 24710385  | 
    
| GeographicLocations | France | 
    
| GeographicLocations_xml | – name: France | 
    
| GrantInformation_xml | – fundername: Agence Nationale de la Recherche grantid: ANR-08-CORD-024 funderid: http://dx.doi.org/10.13039/501100001665  | 
    
| GroupedDBID | -DZ .4H .4S .86 .DC 06D 0R~ 0VY 199 203 29L 2J2 2JN 2JY 2KG 2LR 2~H 30V 4.4 406 408 409 40E 5GY 5VS 67Z 6NX 78A 8FE 8FG 8G5 8TC 8UJ 95- 95. 95~ 96X AAAVM AABHQ AACDK AAGAY AAHCP AAHNG AAIAL AAJBT AAJKR AANZL AAPKM AARTL AASML AATNV AATVU AAUYE AAWCG AAWJA AAYIU AAYQN AAYTO ABAKF ABBBX ABBHK ABBRH ABBXA ABDBE ABDZT ABECU ABECW ABFSG ABFTV ABHLI ABHQN ABJNI ABJOX ABKCH ABKTR ABLJU ABMNI ABMQK ABNWP ABQBU ABRTQ ABSXP ABTEG ABTHY ABTKH ABTMW ABUWG ABWNU ABXPI ABXSQ ACAOD ACDTI ACGFO ACGFS ACHSB ACHXU ACKNC ACMDZ ACMLO ACNXV ACOKC ACOMO ACPIV ACREN ACSTC ACZOJ ADHIR ADKNI ADKPE ADPTO ADRFC ADTPH ADULT ADURQ ADYFF ADYOE ADZKW AEBTG AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMSY AENEX AEOHA AEPYU AESKC AETLH AEUPB AEVLU AEXYK AEZWR AFBBN AFDZB AFFHD AFGCZ AFHIU AFKRA AFLOW AFOHR AFQWF AFWTZ AFYQB AFZKB AGAYW AGDGC AGJBK AGMZJ AGQEE AGQMX AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHEXP AHPBZ AHSBF AHWEU AHYZX AIAKS AIGIU AIIXL AILAN AIMQZ AITGF AIXLP AJBLW AJRNO AJZVZ ALMA_UNASSIGNED_HOLDINGS ALSLI ALWAN AMKLP AMTXH AMXSW AMYLF AOCGG ARAPS ARCSS ARMRJ ATHPR AVQMV AXYYD AYFIA AYQZM AZFZN AZQEC B-. BA0 BDATZ BENPR BGLVJ BGNMA BPHCQ BSONS CCPQU CPGLG CRLPW CS3 CSCUP DDRTE DL5 DNIVK DPUIP DWQXO EBLON EBS EDO EHI EIOEI EJD ESBYG FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FWDCC GB0 GGCAI GGRSB GJIRD GNUQQ GNWQR GQ7 GQ8 GUQSH GXS H13 HCIFZ HF~ HG5 HG6 HLICF HMHOC HMJXF HQYDN HRMNR HVGLF I-F I09 IJ- IKXTQ IPSME ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JAAYA JAB JBMMH JBSCW JCJTX JENOY JHFFW JKQEH JLEZI JLXEF JPL JST JZLTJ K50 K6V K7- KDC KOV LIQON LLZTM M1D M2O M4Y MA- MQGED NB0 NF0 NPVJJ NQJWS NU0 O93 O9G O9I OAM P19 P62 P9Q PEJEM PF- PHGZM PHGZT PMKZF PQGLB PQQKQ PROAC PRQQA PT4 Q2X QF4 QN3 QN7 QOS R89 R9I RHV ROL RPX RSV S16 S27 S3B SA0 SAP SDA SDH SDM SHS SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 TN5 TSG TSK TSV TUC TUS U2A UG4 UOJIU UTJUX UZXMN VC2 VFIZW W23 W48 WK8 YLTOR Z45 ZMTXR ~EX -51 -5C -5G -BR -EM -Y2 -~C 07C 2.D 2P1 2VQ 3EH 3V. AANTL AARHV AAYOK AAYZH ABQSL ABULA ACBXY ADINQ AFEXP AFFNX AHKAY AZRUE BHNFS CAG COF GPZZG GQ6 HZ~ IHE JSODD M0N N2Q NDZJH O9- O9J P-O RIG S1Z S26 S28 SCLPG T16 VQA VXZ Z7X Z83 Z88 Z8R Z8W Z92 ZWUKE AAYXX ADHKG AGQPQ CITATION PUEGO 7SC 7T9 7XB 8AL 8FD 8FK JQ2 L7M L~C L~D MBDVC PKEHL PQEST PQUKI PRINS Q9U 1XC BXJBU  | 
    
| ID | FETCH-LOGICAL-c430t-c02774c64da59bad3f58005d170051f95db6148902f16d23e332826600993b223 | 
    
| IEDL.DBID | U2A | 
    
| ISSN | 1574-020X | 
    
| IngestDate | Tue Oct 14 20:11:37 EDT 2025 Thu Sep 04 20:15:38 EDT 2025 Sat Aug 23 13:28:26 EDT 2025 Wed Oct 01 02:41:54 EDT 2025 Fri Feb 21 02:30:23 EST 2025 Thu Oct 30 12:03:29 EDT 2025  | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 3 | 
    
| Keywords | Intonation stylization Annotation scheme Prosody Direct quotations Fairy tale corpus Inter-annotator agreement Expressivity Text-to-speech  | 
    
| Language | English | 
    
| License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c430t-c02774c64da59bad3f58005d170051f95db6148902f16d23e332826600993b223 | 
    
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23  | 
    
| ORCID | 0000-0001-6490-2386 0000-0002-2629-8752 0000-0003-2154-7438  | 
    
| PQID | 1697675703 | 
    
| PQPubID | 28740 | 
    
| PageCount | 27 | 
    
| ParticipantIDs | hal_primary_oai_HAL_halshs_01251140v1 proquest_miscellaneous_1730057970 proquest_journals_1697675703 crossref_primary_10_1007_s10579_015_9306_7 springer_journals_10_1007_s10579_015_9306_7 jstor_primary_24710385  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2015-09-01 | 
    
| PublicationDateYYYYMMDD | 2015-09-01 | 
    
| PublicationDate_xml | – month: 09 year: 2015 text: 2015-09-01 day: 01  | 
    
| PublicationDecade | 2010 | 
    
| PublicationPlace | Dordrecht | 
    
| PublicationPlace_xml | – name: Dordrecht – name: Dordrect  | 
    
| PublicationTitle | Language Resources and Evaluation | 
    
| PublicationTitleAbbrev | Lang Resources & Evaluation | 
    
| PublicationYear | 2015 | 
    
| Publisher | Springer Springer Netherlands Springer Nature B.V Springer Verlag  | 
    
| Publisher_xml | – name: Springer – name: Springer Netherlands – name: Springer Nature B.V – name: Springer Verlag  | 
    
| References | Adda, G., Adda-Decker, M., Gauvain, J. L., & Lamel, L. (1997). Text normalization and speech recognition in French. In Eurospeech (pp. 2711–2714). Klabbers, E., & van Santen, J. (2004). Clustering of foot-based pitch contours in expressive speech. In Proceedings of 5th ISCA speech synthesis workshop (pp. 73–78). Citeseer. Doukhan, D., Rilliard, A., Rosset, S., & D’Alessandro, C. (2012a). Modelling pause duration as a function of contextual length. In Interspeech. Portland, OR. LandisJRKochGGThe measurement of observer agreement for categorical dataBiometrics197733115917410.2307/2529310 Lendvai, P., Declerck, T., Darányi, S., Gervás, P., Hervás, R., Malec, S., & Peinado, F. (2010). Integration of linguistic markup into semantic models of folk narratives: The fairy tale use case. In LREC. Uzuner, O., Bodnari, A., Shen, S., Forbush, T., Pestian, J., & South, B. R. (2012). Evaluating the state of the art in coreference resolution for electronic medical records. Journal of the American Medical Informatics Association,19(5), 786–791. Elson, D. K., & McKeown, K. R. (2010). Automatic attribution of quoted speech in literary narrative. In Proceedings of AAAI. SluijterAMCTerkenJMBBeyond sentence prosody: Paragraph intonation in DutchPhonetica200950318018810.1159/000261938 BarbosaPBaillyGCharacterisation of rhythmic patterns for text-to-speech synthesisSpeech Communication199415112713710.1016/0167-6393(94)90047-7 Doukhan, D., Rosset, S. Rilliard, A., d’Alessandro, C., & Adda-Decker, M. (2012b). Designing french tale corpora for entertaining text to speech synthesis. In N. Calzolari, K. Choukri, T. Declerck, M. U. Doğan, B. Maegaard, J. Mariani, et al. (Eds.), Proceedings of the eighth international conference on language resources and evaluation (lrec’12). Istanbul, Turkey: European Language Resources Association (ELRA). Francisco, V., Hervás, R., Peinado, F., & Gervás, P. (2012). Emotales: Creating a corpus of folk tales with emotional annotations. Language Resources and Evaluation,46(3), 341–381. Mamede, N., & Chaleira, P. (2004). Character identification in children stories. In J. L. Vicedo, P. Martínez-Barco, R. Muńoz & M. Saiz-Noeda (Eds.), Advances in natural language processing Vol. 3230 of Lecture notes in computer science (pp. 82–90). Heidelberg: Springer. Propp, V. (1968). (orig 1928). Morphology of the folktale. Austin: University of Texas Press. Doukhan, D. (2013). Synthèse de parole expressive au delà du niveau de la phrase: le cas du conte pour enfant. PhD dissertation, Université Paris-Sud 11. Gervás, P. (2010). Corpus annotation for narrative generation research: A wish list. In Amicus workshop. AstesanoCBertrandRBrousseauMChafcouloffMDi CristoAGhioAHirstDLapierreSNicolasPRoméasPThe PACOMUST Project, a corpus of multistyle continue speech: Objectives and methodological choicesTravaux de l’institut de Phonétique d’Aix199516938 Malec, S. (2010). Autopropp: Toward the automatic markup, classification, and annotation of Russian magic tales. In First amicus workshop. ArtsteinRPoesioMInter-coder agreement for computational linguisticsComputational Linguistics200834455559610.1162/coli.07-034-R2 Gelin, R., d’Alessandro, C., Le, Q. A., Deroo, O., Doukhan, D., Martin, J. C., et al. (2010). Towards a storytelling humanoid robot. In AAAI fall symposium series on dialog with robots (pp. 137–138). BoersmaPPGPraat, a system for doing phonetics by computerGlot International200259/10341345 Grasbon, D., & Braun, N. (2001). A morphological approach to interactive storytelling. In Proceedings of cast01, living in mixed realities. special issue of netzspannung. org/journal, the magazine for media production and inter-media research (pp. 337–340). Citeseer. Galibert, O., Quintard, L., Rosset, S., Zweigenbaum, P., Nédellec, C., Aubin, S., et al. (2010). Named and specific entity detection in varied data: The quaero named entity baseline evaluation. In Proceedings of LREC, Valletta, Malta, May 2010. European Language Resources Association (ELRA). GoldenJMInterpreting a tale: Three perspectives on text constructionPoetics198514650352410.1016/0304-422X(85)90016-6 HripcsakGRothschildASAgreement, the f-measure, and reliability in information retrievalJournal of the American Medical Informatics Association200512329629810.1197/jamia.M1733 GreimasAJCourtèsJThe cognitive dimension of narrative discourseNew Literary History19767343344710.2307/468554 Fort, K., François, C., Galibert, O., & Ghribi, M. (2012). Analyzing the impact of prevalence on the evaluation of a manual annotation campaign. In Proceedings of the eight international conference on language resources and evaluation (LREC’12). Istanbul, Turquie. GreimasAJDescription and narrativity: “The piece of string”New Literary History198920361562610.2307/469357 Mani, I. (2014). Computational narratology. In P. Hühn, J. C. Meister, J. Pier & W. Schmid (Eds.), Handbook of narratology (pp. 84–92). Berlin/Boston: Walter de Gruyter GmbH. Goh, H.-N., Soon, L.-K., & Haw, S.-C. (2012). Automatic identification of protagonist in fairy tales using verb. In P.-N. Tan, S. Chawla, C. K. Ho, & J. Bailey (Eds.), Advances in knowledge discovery and data mining, Vol. 7302 of Lecture notes in computer science (pp. 395–406). Springer. Alm, C. O., Roth, D., & Sproat, R. (2005). Emotions from text: Machine learning for text-based emotion prediction. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 579–586). Association for Computational Linguistics. GervásPDíaz-AgudoBPeinadoFHervásRStory plot generation based on CBRKnowledge-based systems200518423524210.1016/j.knosys.2004.10.011 HearstMATexttiling: Segmenting text into multi-paragraph subtopic passagesComputational Linguistics19972313364 Rosset, S., Galibert, O., Bernard, G., Bilinski, E., & Adda, G. (2009). The LIMSI multilingual, multitask QAst system. In Proceedings of CLEF 2008 (pp. 480–487). Berlin: Springer. Widlöcher, A., & Mathet, Y. (2012). The glozz platform: A corpus annotation and mining tool. In Proceedings of the 2012 ACM symposium on document engineering. Doceng ’12 (pp. 171–180). New York, NY: ACM. doi:10.1145/2361354.2361394. van Dijk, T. A. (1982). Episodes as units of discourse analysis. In D. Tannen (Ed.), Analyzing discourse: Text and talk (pp. 177–195). Georgetown: Georgetown University Press. BeefermanDBergerALaffertyJStatistical models for text segmentationMachine Learning199934117721010.1023/A:1007506220214 d’AlessandroCMertensPAutomatic pitch contour stylization using a model of tonal perceptionComputer Speech & Language1995925728810.1006/csla.1995.0013 SteinASchmidHÉtiquetage morphologique de textes français avec un arbre de décisionsTraitement automatique des langues1995361–22335 Adda-DeckerMBoula de MareuilPAddaGLamelLInvestigating syllabic structures and their variation in spontaneous FrenchSpeech Communication200546211913910.1016/j.specom.2005.03.006 Grouin, C., Rosset, S., Zweigenbaum, P., Fort, K., Galibert, O., & Quintard, L. (2011). Proposal for an extension of traditional named entities: From guidelines to evaluation, an overview. In Proceedings of the 5th linguistic annotation workshop (pp. 92–100). Association for Computational Linguistics. Declerck, T., & Scheidel, A. (2010). An information extraction approach to the semantic annotation of folktales. In First international AMICUS workshop on automated motif discovery in cultural heritage and scientific communication texts, Vienna, Austria. Fackrell, J., Vereecken, H., Buhmann, J., Martens, J. P., & Van Coile, B. (2000). Prosodic variation with text type. In Proceedings of ICSLP. Stenetorp, P., Pyysalo, S., Topić, G., Ohta, T., Ananiadou, S., & Tsujii, J. (2012). BRAT: A web-based tool for nlp-assisted text annotation. In Proceedings of the demonstrations at the 13th conference of the European chapter of the Association for Computational Linguistics (pp. 102–107). Avignon: Association for Computational Linguistics. http://www.aclweb.org/anthology/E12-2021. BodoAZTodereanGBuzaOTTS experiments: Romanian prosodyActa Technica Napocensis2009502530 HoltEReporting on talk: The use of direct reported speech in conversationResearch on Language and Social Interaction199629321924510.1207/s15327973rlsi2903_2 TheuneMMeijsKHeylenDOrdelmanRGenerating expressive speech for storytelling applicationsIEEE Transactions on Audio, Speech, and Language Processing20061441137114410.1109/TASL.2006.876129 Weiser, S., & Watrin, P. (2012). Extraction of unmarked quotations in newspapers. In Proceedings of the eight international conference on language resources and evaluation (lrec’12). Istanbul: European Language Resources Association (ELRA). GreimasAJSémantique structurale: recherche et méthode1966ParisLarousse BettelheimBThe uses of enchantment1976New YorkAlfred A. Knopf10.1037/e309842005-008 LevinHSchafferCASnowCThe prosodic and paralinguistic features of reading and telling storiesLanguage and Speech198225143 Gauvain, J. L., Adda, G., Adda-Decker, M., Allauzen, A., Gendner, V., Lamel, L., et al. (2005). Where are we in transcribing French broadcast news? In Ninth European conference on speech communication and technology. ISCA. Ronfard, R., & Szilas, N. (2014). Where story and media meet: Computer generation of narrative discourse. In M. A. Finlayson, J. C. Meister & E. G. Bruneau (Eds.), 2014 Workshop on computational models of narrative (pp. 164–176). Dagstuhl: Schloss Dagstuhl—Leibniz-Zentrum fuer Informatik. Galibert, O. (2009). Approches et méthodologies pour la réponse automatique à des questions adaptées à un cadre interactif en domaine ouvert. Ph.D. dissertation. Orsay: Université Paris Sud. Mutlu, B., Forlizzi, J., & Hodgins, J. (2006). A storytelling robot: Modeling and evaluation of human-like gaze behavior. In 2006 6th iEEE-RAS international conference on humanoid robots (pp. 518–523). IEEE. Adda-DeckerMLamelLPronunciation variants across system configuration, language and speaking styleSpeech Communication1999292–4839810.1016/S0167-6393(99)00032-1 El Maarouf, I. 9306_CR6 PPG Boersma (9306_CR14) 2002; 5 9306_CR30 9306_CR5 9306_CR1 9306_CR33 9306_CR31 9306_CR35 9306_CR39 G Hripcsak (9306_CR43) 2005; 12 A Stein (9306_CR59) 1995; 36 M Adda-Decker (9306_CR3) 2005; 46 G Leech (9306_CR47) 1997 K Krippendorff (9306_CR45) 1980 9306_CR44 MA Hearst (9306_CR40) 1997; 23 A Cheveigné De (9306_CR16) 2002; 111 9306_CR48 WO Hendricks (9306_CR41) 1967; 5 B Bettelheim (9306_CR11) 1976 AJ Greimas (9306_CR36) 1966 9306_CR52 9306_CR51 9306_CR50 P Barbosa (9306_CR9) 1994; 15 9306_CR12 9306_CR56 9306_CR55 M Adda-Decker (9306_CR2) 1999; 29 9306_CR54 9306_CR53 D Beeferman (9306_CR10) 1999; 34 9306_CR57 9306_CR19 9306_CR18 9306_CR17 C d’Alessandro (9306_CR15) 1995; 9 JM Golden (9306_CR34) 1985; 14 J Adell (9306_CR4) 2005; 35 C Astesano (9306_CR8) 1995; 16 P Gervás (9306_CR32) 2005; 18 AJ Greimas (9306_CR38) 1976; 7 9306_CR63 9306_CR62 E Holt (9306_CR42) 1996; 29 9306_CR60 9306_CR23 9306_CR22 9306_CR66 9306_CR21 9306_CR65 9306_CR20 9306_CR64 9306_CR27 9306_CR26 9306_CR25 AMC Sluijter (9306_CR58) 2009; 50 9306_CR24 H Levin (9306_CR49) 1982; 25 R Artstein (9306_CR7) 2008; 34 AJ Greimas (9306_CR37) 1989; 20 9306_CR29 9306_CR28 JR Landis (9306_CR46) 1977; 33 AZ Bodo (9306_CR13) 2009; 50 M Theune (9306_CR61) 2006; 14  | 
    
| References_xml | – reference: Fort, K., François, C., Galibert, O., & Ghribi, M. (2012). Analyzing the impact of prevalence on the evaluation of a manual annotation campaign. In Proceedings of the eight international conference on language resources and evaluation (LREC’12). Istanbul, Turquie. – reference: Rosset, S., Galibert, O., Bernard, G., Bilinski, E., & Adda, G. (2009). The LIMSI multilingual, multitask QAst system. In Proceedings of CLEF 2008 (pp. 480–487). Berlin: Springer. – reference: BettelheimBThe uses of enchantment1976New YorkAlfred A. Knopf10.1037/e309842005-008 – reference: Zhang, J., Black, A., & Sproat, R. (2003). Identifying speakers in children’s stories for speech synthesis. In Proceedings of Eurospeech (pp. 2041–2044). – reference: Doukhan, D., Rosset, S. Rilliard, A., d’Alessandro, C., & Adda-Decker, M. (2012b). Designing french tale corpora for entertaining text to speech synthesis. In N. Calzolari, K. Choukri, T. Declerck, M. U. Doğan, B. Maegaard, J. Mariani, et al. (Eds.), Proceedings of the eighth international conference on language resources and evaluation (lrec’12). Istanbul, Turkey: European Language Resources Association (ELRA). – reference: Doukhan, D. (2013). Synthèse de parole expressive au delà du niveau de la phrase: le cas du conte pour enfant. PhD dissertation, Université Paris-Sud 11. – reference: Adda-DeckerMLamelLPronunciation variants across system configuration, language and speaking styleSpeech Communication1999292–4839810.1016/S0167-6393(99)00032-1 – reference: GreimasAJCourtèsJThe cognitive dimension of narrative discourseNew Literary History19767343344710.2307/468554 – reference: Weiser, S., & Watrin, P. (2012). Extraction of unmarked quotations in newspapers. In Proceedings of the eight international conference on language resources and evaluation (lrec’12). Istanbul: European Language Resources Association (ELRA). – reference: Grasbon, D., & Braun, N. (2001). A morphological approach to interactive storytelling. In Proceedings of cast01, living in mixed realities. special issue of netzspannung. org/journal, the magazine for media production and inter-media research (pp. 337–340). Citeseer. – reference: Stenetorp, P., Pyysalo, S., Topić, G., Ohta, T., Ananiadou, S., & Tsujii, J. (2012). BRAT: A web-based tool for nlp-assisted text annotation. In Proceedings of the demonstrations at the 13th conference of the European chapter of the Association for Computational Linguistics (pp. 102–107). Avignon: Association for Computational Linguistics. http://www.aclweb.org/anthology/E12-2021. – reference: BeefermanDBergerALaffertyJStatistical models for text segmentationMachine Learning199934117721010.1023/A:1007506220214 – reference: Mutlu, B., Forlizzi, J., & Hodgins, J. (2006). A storytelling robot: Modeling and evaluation of human-like gaze behavior. In 2006 6th iEEE-RAS international conference on humanoid robots (pp. 518–523). IEEE. – reference: AdellJBonafonteAEscuderoDAnalysis of prosodic features towards modelling of emotional and pragmatic attributes of speechProcesamiento de Lenguaje Natural200535277284 – reference: GoldenJMInterpreting a tale: Three perspectives on text constructionPoetics198514650352410.1016/0304-422X(85)90016-6 – reference: ArtsteinRPoesioMInter-coder agreement for computational linguisticsComputational Linguistics200834455559610.1162/coli.07-034-R2 – reference: Goh, H.-N., Soon, L.-K., & Haw, S.-C. (2012). Automatic identification of protagonist in fairy tales using verb. In P.-N. Tan, S. Chawla, C. K. Ho, & J. Bailey (Eds.), Advances in knowledge discovery and data mining, Vol. 7302 of Lecture notes in computer science (pp. 395–406). Springer. – reference: El Maarouf, I., & Villaneau, J. (2012). A french fairy tale corpus syntactically and semantically annotated. In Proceedings of the eight international conference on language resources and evaluation (lrec’12). Istanbul: European Language Resources Association (ELRA). – reference: BodoAZTodereanGBuzaOTTS experiments: Romanian prosodyActa Technica Napocensis2009502530 – reference: KrippendorffKContent analysis: An introduction to its methodology1980LondonSage – reference: Lendvai, P., Declerck, T., Darányi, S., Gervás, P., Hervás, R., Malec, S., & Peinado, F. (2010). Integration of linguistic markup into semantic models of folk narratives: The fairy tale use case. In LREC. – reference: Ronfard, R., & Szilas, N. (2014). Where story and media meet: Computer generation of narrative discourse. In M. A. Finlayson, J. C. Meister & E. G. Bruneau (Eds.), 2014 Workshop on computational models of narrative (pp. 164–176). Dagstuhl: Schloss Dagstuhl—Leibniz-Zentrum fuer Informatik. – reference: d’AlessandroCMertensPAutomatic pitch contour stylization using a model of tonal perceptionComputer Speech & Language1995925728810.1006/csla.1995.0013 – reference: Francisco, V., Hervás, R., Peinado, F., & Gervás, P. (2012). Emotales: Creating a corpus of folk tales with emotional annotations. Language Resources and Evaluation,46(3), 341–381. – reference: HripcsakGRothschildASAgreement, the f-measure, and reliability in information retrievalJournal of the American Medical Informatics Association200512329629810.1197/jamia.M1733 – reference: Klabbers, E., & van Santen, J. (2004). Clustering of foot-based pitch contours in expressive speech. In Proceedings of 5th ISCA speech synthesis workshop (pp. 73–78). Citeseer. – reference: Adda, G., Adda-Decker, M., Gauvain, J. L., & Lamel, L. (1997). Text normalization and speech recognition in French. In Eurospeech (pp. 2711–2714). – reference: HearstMATexttiling: Segmenting text into multi-paragraph subtopic passagesComputational Linguistics19972313364 – reference: Alm, C., & Sproat, R. (2005). Emotional sequencing and development in fairy tales. In J. Tao, T. Tan & R. W. Picard (Eds.), Affective computing and intelligent interaction Vol. 3784 of Lecture notes in computer science (pp. 668–674). Heidelberg: Springer. – reference: Propp, V. (1968). (orig 1928). Morphology of the folktale. Austin: University of Texas Press. – reference: Galibert, O., Quintard, L., Rosset, S., Zweigenbaum, P., Nédellec, C., Aubin, S., et al. (2010). Named and specific entity detection in varied data: The quaero named entity baseline evaluation. In Proceedings of LREC, Valletta, Malta, May 2010. European Language Resources Association (ELRA). – reference: Mani, I. (2014). Computational narratology. In P. Hühn, J. C. Meister, J. Pier & W. Schmid (Eds.), Handbook of narratology (pp. 84–92). Berlin/Boston: Walter de Gruyter GmbH. – reference: Elson, D. K., & McKeown, K. R. (2010). Automatic attribution of quoted speech in literary narrative. In Proceedings of AAAI. – reference: Galibert, O. (2009). Approches et méthodologies pour la réponse automatique à des questions adaptées à un cadre interactif en domaine ouvert. Ph.D. dissertation. Orsay: Université Paris Sud. – reference: GervásPDíaz-AgudoBPeinadoFHervásRStory plot generation based on CBRKnowledge-based systems200518423524210.1016/j.knosys.2004.10.011 – reference: van Dijk, T. A. (1982). Episodes as units of discourse analysis. In D. Tannen (Ed.), Analyzing discourse: Text and talk (pp. 177–195). Georgetown: Georgetown University Press. – reference: Doukhan, D., Rilliard, A., Rosset, S., & D’Alessandro, C. (2012a). Modelling pause duration as a function of contextual length. In Interspeech. Portland, OR. – reference: Doukhan, D., Rilliard, A., Rosset, S., Adda-Decker, M., & d’Alessandro, C. (2011). Prosodic analysis of a corpus of tales. In Interspeech (pp. 3129–3132). – reference: SluijterAMCTerkenJMBBeyond sentence prosody: Paragraph intonation in DutchPhonetica200950318018810.1159/000261938 – reference: Widlöcher, A., & Mathet, Y. (2012). The glozz platform: A corpus annotation and mining tool. In Proceedings of the 2012 ACM symposium on document engineering. Doceng ’12 (pp. 171–180). New York, NY: ACM. doi:10.1145/2361354.2361394. – reference: Adda-DeckerMBoula de MareuilPAddaGLamelLInvestigating syllabic structures and their variation in spontaneous FrenchSpeech Communication200546211913910.1016/j.specom.2005.03.006 – reference: LandisJRKochGGThe measurement of observer agreement for categorical dataBiometrics197733115917410.2307/2529310 – reference: Passonneau, R. J. (2004). Computing reliability for coreference annotation. In Proceedings of lrec (Vol. 4, pp. 1503–1506). – reference: De CheveignéAKawaharaHYin, a fundamental frequency estimator for speech and musicThe Journal of the Acoustical Society of America200211141917193010.1121/1.1458024 – reference: TheuneMMeijsKHeylenDOrdelmanRGenerating expressive speech for storytelling applicationsIEEE Transactions on Audio, Speech, and Language Processing20061441137114410.1109/TASL.2006.876129 – reference: HoltEReporting on talk: The use of direct reported speech in conversationResearch on Language and Social Interaction199629321924510.1207/s15327973rlsi2903_2 – reference: GreimasAJDescription and narrativity: “The piece of string”New Literary History198920361562610.2307/469357 – reference: Gelin, R., d’Alessandro, C., Le, Q. A., Deroo, O., Doukhan, D., Martin, J. C., et al. (2010). Towards a storytelling humanoid robot. In AAAI fall symposium series on dialog with robots (pp. 137–138). – reference: BarbosaPBaillyGCharacterisation of rhythmic patterns for text-to-speech synthesisSpeech Communication199415112713710.1016/0167-6393(94)90047-7 – reference: BoersmaPPGPraat, a system for doing phonetics by computerGlot International200259/10341345 – reference: Malec, S. (2010). Autopropp: Toward the automatic markup, classification, and annotation of Russian magic tales. In First amicus workshop. – reference: Bod, R., Fisseni, B., Kurji, A., & Löwe, B. (2012). Objectivity and reproducibility of Proppian narrative annotations. In Workshop on computational models of narrative. – reference: GreimasAJSémantique structurale: recherche et méthode1966ParisLarousse – reference: Grouin, C., Rosset, S., Zweigenbaum, P., Fort, K., Galibert, O., & Quintard, L. (2011). Proposal for an extension of traditional named entities: From guidelines to evaluation, an overview. In Proceedings of the 5th linguistic annotation workshop (pp. 92–100). Association for Computational Linguistics. – reference: Gervás, P. (2010). Corpus annotation for narrative generation research: A wish list. In Amicus workshop. – reference: Alm, C. O., Roth, D., & Sproat, R. (2005). Emotions from text: Machine learning for text-based emotion prediction. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 579–586). Association for Computational Linguistics. – reference: Uzuner, O., Bodnari, A., Shen, S., Forbush, T., Pestian, J., & South, B. R. (2012). Evaluating the state of the art in coreference resolution for electronic medical records. Journal of the American Medical Informatics Association,19(5), 786–791. – reference: LevinHSchafferCASnowCThe prosodic and paralinguistic features of reading and telling storiesLanguage and Speech198225143 – reference: Fackrell, J., Vereecken, H., Buhmann, J., Martens, J. P., & Van Coile, B. (2000). Prosodic variation with text type. In Proceedings of ICSLP. – reference: Declerck, T., & Scheidel, A. (2010). An information extraction approach to the semantic annotation of folktales. In First international AMICUS workshop on automated motif discovery in cultural heritage and scientific communication texts, Vienna, Austria. – reference: Gauvain, J. L., Adda, G., Adda-Decker, M., Allauzen, A., Gendner, V., Lamel, L., et al. (2005). Where are we in transcribing French broadcast news? In Ninth European conference on speech communication and technology. ISCA. – reference: AstesanoCBertrandRBrousseauMChafcouloffMDi CristoAGhioAHirstDLapierreSNicolasPRoméasPThe PACOMUST Project, a corpus of multistyle continue speech: Objectives and methodological choicesTravaux de l’institut de Phonétique d’Aix199516938 – reference: Mamede, N., & Chaleira, P. (2004). Character identification in children stories. In J. L. Vicedo, P. Martínez-Barco, R. Muńoz & M. Saiz-Noeda (Eds.), Advances in natural language processing Vol. 3230 of Lecture notes in computer science (pp. 82–90). Heidelberg: Springer. – reference: LeechGCorpus annotation: Linguistic information from computer text corpora1997LondonLongman – reference: HendricksWOOn the notion ‘beyond the sentence’Linguistics1967537125110.1515/ling.1967.5.37.12 – reference: SteinASchmidHÉtiquetage morphologique de textes français avec un arbre de décisionsTraitement automatique des langues1995361–22335 – ident: 9306_CR5 doi: 10.1007/11573548_86 – ident: 9306_CR17 – volume: 14 start-page: 1137 issue: 4 year: 2006 ident: 9306_CR61 publication-title: IEEE Transactions on Audio, Speech, and Language Processing doi: 10.1109/TASL.2006.876129 – volume: 29 start-page: 83 issue: 2–4 year: 1999 ident: 9306_CR2 publication-title: Speech Communication doi: 10.1016/S0167-6393(99)00032-1 – volume: 5 start-page: 341 issue: 9/10 year: 2002 ident: 9306_CR14 publication-title: Glot International – volume: 111 start-page: 1917 issue: 4 year: 2002 ident: 9306_CR16 publication-title: The Journal of the Acoustical Society of America doi: 10.1121/1.1458024 – volume: 14 start-page: 503 issue: 6 year: 1985 ident: 9306_CR34 publication-title: Poetics doi: 10.1016/0304-422X(85)90016-6 – ident: 9306_CR27 – ident: 9306_CR24 doi: 10.21437/ICSLP.2000-520 – volume: 50 start-page: 25 year: 2009 ident: 9306_CR13 publication-title: Acta Technica Napocensis – volume: 16 start-page: 9 year: 1995 ident: 9306_CR8 publication-title: Travaux de l’institut de Phonétique d’Aix – ident: 9306_CR51 doi: 10.1007/978-3-540-30228-5_8 – ident: 9306_CR65 doi: 10.1145/2361354.2361394 – volume-title: Corpus annotation: Linguistic information from computer text corpora year: 1997 ident: 9306_CR47 – ident: 9306_CR56 – volume: 50 start-page: 180 issue: 3 year: 2009 ident: 9306_CR58 publication-title: Phonetica doi: 10.1159/000261938 – ident: 9306_CR18 – volume: 36 start-page: 23 issue: 1–2 year: 1995 ident: 9306_CR59 publication-title: Traitement automatique des langues – volume: 35 start-page: 277 year: 2005 ident: 9306_CR4 publication-title: Procesamiento de Lenguaje Natural – ident: 9306_CR53 doi: 10.1109/ICHR.2006.321322 – ident: 9306_CR30 – ident: 9306_CR28 – volume: 15 start-page: 127 issue: 1 year: 1994 ident: 9306_CR9 publication-title: Speech Communication doi: 10.1016/0167-6393(94)90047-7 – volume: 23 start-page: 33 issue: 1 year: 1997 ident: 9306_CR40 publication-title: Computational Linguistics – ident: 9306_CR6 doi: 10.3115/1220575.1220648 – ident: 9306_CR19 doi: 10.21437/Interspeech.2011-783 – volume: 20 start-page: 615 issue: 3 year: 1989 ident: 9306_CR37 publication-title: New Literary History doi: 10.2307/469357 – ident: 9306_CR1 doi: 10.21437/Eurospeech.1997-684 – ident: 9306_CR55 – volume-title: The uses of enchantment year: 1976 ident: 9306_CR11 doi: 10.1037/e309842005-008 – volume: 25 start-page: 43 issue: 1 year: 1982 ident: 9306_CR49 publication-title: Language and Speech doi: 10.1177/002383098202500104 – ident: 9306_CR21 – volume: 18 start-page: 235 issue: 4 year: 2005 ident: 9306_CR32 publication-title: Knowledge-based systems doi: 10.1016/j.knosys.2004.10.011 – ident: 9306_CR25 – ident: 9306_CR63 – ident: 9306_CR44 – ident: 9306_CR48 – ident: 9306_CR29 – ident: 9306_CR54 – volume: 34 start-page: 177 issue: 1 year: 1999 ident: 9306_CR10 publication-title: Machine Learning doi: 10.1023/A:1007506220214 – ident: 9306_CR31 – volume: 9 start-page: 257 year: 1995 ident: 9306_CR15 publication-title: Computer Speech & Language doi: 10.1006/csla.1995.0013 – ident: 9306_CR62 doi: 10.1136/amiajnl-2011-000784 – ident: 9306_CR12 – ident: 9306_CR35 – volume: 46 start-page: 119 issue: 2 year: 2005 ident: 9306_CR3 publication-title: Speech Communication doi: 10.1016/j.specom.2005.03.006 – volume: 7 start-page: 433 issue: 3 year: 1976 ident: 9306_CR38 publication-title: New Literary History doi: 10.2307/468554 – ident: 9306_CR50 – ident: 9306_CR66 doi: 10.21437/Eurospeech.2003-586 – ident: 9306_CR20 doi: 10.21437/Interspeech.2012-203 – ident: 9306_CR39 – ident: 9306_CR26 doi: 10.1007/s10579-011-9140-5 – volume: 29 start-page: 219 issue: 3 year: 1996 ident: 9306_CR42 publication-title: Research on Language and Social Interaction doi: 10.1207/s15327973rlsi2903_2 – volume: 33 start-page: 159 issue: 1 year: 1977 ident: 9306_CR46 publication-title: Biometrics doi: 10.2307/2529310 – volume-title: Content analysis: An introduction to its methodology year: 1980 ident: 9306_CR45 – volume-title: Sémantique structurale: recherche et méthode year: 1966 ident: 9306_CR36 – ident: 9306_CR52 doi: 10.1515/9783110316469.84 – volume: 5 start-page: 12 issue: 37 year: 1967 ident: 9306_CR41 publication-title: Linguistics doi: 10.1515/ling.1967.5.37.12 – ident: 9306_CR60 – ident: 9306_CR22 – ident: 9306_CR23 doi: 10.1609/aaai.v24i1.7720 – ident: 9306_CR33 doi: 10.1007/978-3-642-30220-6_33 – ident: 9306_CR64 – ident: 9306_CR57 doi: 10.1007/978-3-642-04447-2_59 – volume: 34 start-page: 555 issue: 4 year: 2008 ident: 9306_CR7 publication-title: Computational Linguistics doi: 10.1162/coli.07-034-R2 – volume: 12 start-page: 296 issue: 3 year: 2005 ident: 9306_CR43 publication-title: Journal of the American Medical Informatics Association doi: 10.1197/jamia.M1733  | 
    
| SSID | ssj0042478 ssj0002228  | 
    
| Score | 2.0313702 | 
    
| Snippet | A corpus of French tales is presented. Its two parts, a text corpus and a speech corpus, were designed for studying the relationships between the textual... | 
    
| SourceID | hal proquest crossref springer jstor  | 
    
| SourceType | Open Access Repository Aggregation Database Index Database Publisher  | 
    
| StartPage | 521 | 
    
| SubjectTerms | Annotations Computational Linguistics Computer Science Corpus analysis Corpus linguistics Embedded systems Enrichment European languages Fairy tales French language Humanities and Social Sciences Humanoid Intonation Language and Literature Library and information sciences Linguistics Localization Mathematical models Original Paper Phonemes Prosody Robots Segmentation Social Sciences Speech Speech recognition Speech synthesis Statistical analysis Texts Valence Visual signals Voice recognition  | 
    
| SummonAdditionalLinks | – databaseName: ProQuest Central dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8MwDLZgXLjwRpSXggQcQBF9JA05IARoY0IwIQRot6pLWo1LN-iG-PnYXTMeElybqI1sx_lS258B9rW26POs5Xh6-FxIi3vOiJD7KYEJNJDTlP5D3nXi9pO46cruDHRcLQylVTqfWDlqOzD0j_wkiDXxjqCBng9fOXWNouiqa6GR1q0V7FlFMTYLcyExYzVg7rLZuX9wvlmEovLNgVSCI1DqujjnpJhOKsodklwjjubqx0k126c8yUnK4g8w-it-Wh1LrSVYqPEku5gYwDLMZMUKLLpeDazeuiuwUxcosENWVyCRRtz4KuyjvbDrZ37b_GB4IR2OSzbIGSLzrGQvBWtRUWB_DZ5azcerNq87KHAjIn_EDUVohYmFTaXupTbKJQJEaYmUTwa5lrZHRKDaD_MgtmGURRFeweKYcGPUQ-SwDo1iUGQbwNLIygoeKOqYrqiVX-anuTEqMtr4vgdHTlrJcEKUkXxRIpNoExRtQqJNlAcHKM_pPKK4bl_cJvis7Jc4j-49wn8PPFivJD6difojRnfpwbZTQVJvujL5MhEP9qbDuF0oBpIW2WCMcyp-fqUVLvjYqe7bK_5a8eb_H9yC-ZAsp0o-24bG6G2c7SBaGfV2axP8BHLL3mE priority: 102 providerName: ProQuest  | 
    
| Title | The GV-LEx corpus of tales in French: Text and speech corpora enriched with lexical, discourse, structural, phonemic and prosodic annotations | 
    
| URI | https://www.jstor.org/stable/24710385 https://link.springer.com/article/10.1007/s10579-015-9306-7 https://www.proquest.com/docview/1697675703 https://www.proquest.com/docview/1730057970 https://shs.hal.science/halshs-01251140  | 
    
| Volume | 49 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVLSH databaseName: SpringerLink Journals customDbUrl: mediaType: online eissn: 1572-8412 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: AFBBN dateStart: 19970101 isFulltext: true providerName: Library Specific Holdings – providerCode: PRVPQU databaseName: Arts & Humanities Database customDbUrl: eissn: 1572-8412 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: M1D dateStart: 20050201 isFulltext: true titleUrlDefault: https://search.proquest.com/artshumanities providerName: ProQuest – providerCode: PRVPQU databaseName: Linguistics Database customDbUrl: eissn: 1572-8412 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: CRLPW dateStart: 20050201 isFulltext: true titleUrlDefault: https://search.proquest.com/linguistics providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 1572-8412 dateEnd: 20241102 omitProxy: true ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: BENPR dateStart: 20050201 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Technology Collection customDbUrl: eissn: 1572-8412 dateEnd: 20241102 omitProxy: true ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: 8FG dateStart: 20050201 isFulltext: true titleUrlDefault: https://search.proquest.com/technologycollection1 providerName: ProQuest – providerCode: PRVAVX databaseName: SpringerLINK - Czech Republic Consortium customDbUrl: eissn: 1572-8412 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: AGYKE dateStart: 19970101 isFulltext: true titleUrlDefault: http://link.springer.com providerName: Springer Nature – providerCode: PRVAVX databaseName: SpringerLink Journals (ICM) customDbUrl: eissn: 1572-8412 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0042478 issn: 1574-020X databaseCode: U2A dateStart: 20050201 isFulltext: true titleUrlDefault: http://www.springerlink.com/journals/ providerName: Springer Nature  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT8MwDLYYXLjwRpTHFCTgAIrUR9KQ44A9BAMhxGCcqq5pNS4dohvi52N3zXgIDpwiNVYUOY7zpba_ABxobdDnGcPx9HC5kAb3XCJ87sYEJtBATmP6D3l9E3Z64rIv-1Udd2Gz3W1IsvTUX4rdpKLcHsk14lyuarAgic0LjbjnN6z7Fb4o3a8nleCIhfo2lPnbEN8Oo9qQUiGnWYnf8OaPEGl58rRWYKmCjKwxXeNVmEvzNVi2zzGwaneuwV5Vg8COWFVkREq3_etwgCbB2g-823xneOd8mRRslDEE32nBnnPWorq_4Qb0Ws378w6vHkngiQjcMU8oCCuSUJhY6kFsgkwiBpSGePekl2lpBsT1qV0_80LjB2kQ4C0rDAkaBgMEB5swn4_ydAtYHBhZIgBFj6Ireq0vdeMsSVSQ6MR1HTi22opeplwY0SfrMak2QtVGpNpIOXCI-pzJEYt1p9GN8FsxLFCOrjbCffMc2Cw1PpPE9SPSdunArl2CqNpXReSFmthn0E05sD_rxh1BYY44T0cTlCkp-JVWOOETu3Rfhvhrxtv_kt6BRZ8MqUw324X58esk3UN8Mh7UoXbaatdhodF-umpie9a8ub3D9vyue_uIvdfeRb202Q9UuNsD | 
    
| linkProvider | Springer Nature | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB6V9gAXHoWK9AFGajmArCaxHdeHChXYZUu3K4RatDeTjRMtl-y22aXw5_htzGTtLUWCW6-xlVjjseebzMw3ALvGOLzznONoPWIulcMzV8iUxzmBCVSQg5z-Q54Ost65_DhUwxX4FWphKK0y3IntRe0mBf0j308yQ7wjqKBvphecukZRdDW00Mh9awV32FKM-cKOk_LnFbpwzeHxe9zvvTTtds7e9bjvMsALKeIZLyiKKYtMulyZUe5EpRBEKUfEdSqpjHIjIss0cVolmUtFKQS6KVlG2EqMUiI-QBOwJoU06Pytve0MPn0OtkCmsrUFidKSIzAbhrjqonhPacpVUtwgbuf6hmW8M6a8zEWK5A3w-1e8tjWD3Ydw3-NXdrRQuEewUtbr8CD0hmD-qliHHV8QwV4yX_FEGhDGH8Mu6if78IX3Oz8YOsDTecMmFUNPoGzYt5p1qQhx_ATOb0WWG7BaT-ryKbBcONXCEU0d2jW1DizjvCoKLQpTxHEEr4K07HRBzGGvKZhJtBZFa0m0Vkewh_JcziNK7d5R3-KzZtzgPPKzZPw9iWCjlfhyJu4fMcirCLbDFlh_yBt7rZIRvFgO4_GkmEtel5M5zmn7AWijccGvw9b98Yp_rXjz_x98Dnd7Z6d92z8enGzBvZS0qE1824bV2eW83EGkNBs98-rI4Ottn4DfKHMYCA | 
    
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9QwEB6VIiEuLRQqAi0YqeUAcpuHHeMDQhXbZUu3FQeK9mYSO9FWSNmF7PL6af0r_TPMJPGWVoJbD1wTK89vXp6ZbwC2tHao85zjaD1CLqRDmbMi5mFGzgQC5GVG-5BHx-ngRLwbydESnPleGCqr9DqxUdRuYmmPfDdKNfGOIEB3y64s4n2v_3r6hdMEKcq0-nEaLUQOi5_fMXyrXx308F9vx3F__8ObAe8mDHArknDGLWUwhU2Fy6TOM5eUEh0o6Yi0Tkalli4nokwdxmWUujgpkgRDlDQlvyrJYyI9QPV_UwkpSbqOop63AiIWjRWIpBIcXbKRz6i2bXtSUZWS5Bo9dq4u2cQbY6rIbIsjL7m9VzK1jQHsr8K5_3Rt3cvnnfks37G_rrBK_p_f9g6sdH4522sF6S4sFdUarPqZF6xTgWuw2TV6sGes6-QiZPvz92AL5Y69_ciH-z8YBvbTec0mJcMIp6jZacX61Fw5vg8n1_Iu67BcTariAbAscbJxsxRNnlc0ErEIs9JalVhtwzCA5x4LZtoSjpgLamkCjkHgGAKOUQFsI1oW64gqfLA3NHisHte4juJHEX6LAlhv8LRYiegkZnwZwIZHhOmUV20u4BDA08VpVDuUS8qqYjLHNc2cA6UVPvALD8w_LvG3J3747xs-gVuIPDM8OD58BLdjEpGmnm8Dlmdf58UmOoCz_HEjaQw-XTf8fgPg4Ftz | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+GV-LEx+corpus+of+tales+in+French%3A+Text+and+speech+corpora+enriched+with+lexical%2C+discourse%2C+structural%2C+phonemic+and+prosodic+annotations&rft.jtitle=Language+Resources+and+Evaluation&rft.au=Doukhan%2C+David&rft.au=Rosset%2C+Sophie&rft.au=Rilliard%2C+Albert&rft.au=d%27Alessandro%2C+Christophe&rft.date=2015-09-01&rft.pub=Springer&rft.issn=1574-020X&rft.eissn=1572-8412&rft.volume=49&rft.issue=3&rft.spage=521&rft.epage=547&rft_id=info:doi/10.1007%2Fs10579-015-9306-7&rft.externalDocID=24710385 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1574-020X&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1574-020X&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1574-020X&client=summon |