{"id":8,"date":"2019-02-09T19:46:54","date_gmt":"2019-02-09T18:46:54","guid":{"rendered":"http:\/\/mlia.lip6.fr\/soulier\/?page_id=8"},"modified":"2023-08-01T10:02:51","modified_gmt":"2023-08-01T08:02:51","slug":"research","status":"publish","type":"page","link":"https:\/\/pages.isir.upmc.fr\/soulier\/research\/","title":{"rendered":"Research interests"},"content":{"rendered":"<div class=\"col-xs-12 col-sm-12\">&nbsp;<\/div>\n\n\n<h1 class=\"wp-block-heading alignwide\" id=\"we-re-a-studio-in-berlin-with-an-international-practice-in-architecture-urban-planning-and-interior-design-we-believe-in-sharing-knowledge-and-promoting-dialogue-to-increase-the-creative-potential-of-collaboration\" style=\"font-size:35px;line-height:1.1\">Deep learning in information retrieval (IR) and natural language processing  (NLP)<\/h1>\n\n\n\n<p><\/p>\n\n\n\n<p>My research is motivated by the proposal of new models based on deep learning for information retrieval and automatic natural language processing.<br> The common objective of these models is to process and access textual data. (Large) language models are at the core of my various projects, which have applications related to semantic learning, human-machine interaction (in information retrieval and robotics), and continual learning. Since 2018, I have been involved in two major research projects focusing on data-to-text generation and conversational information retrieval.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Data-to-text generation<\/h2>\n\n\n\n<p>The objective is to generate textual descriptions of structured inputs (tables\/graphs\/&#8230;). This task is particularly interesting in the financial domain, sport journalism, or health since it allows to synthetize and reason over large set of structured data which might be hardly readable for humans.<br>  <br><img loading=\"lazy\" decoding=\"async\" width=\"1828\" height=\"1158\" class=\"wp-image-4684\" style=\"width: 1500px\" src=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.40.39.png\" alt=\"\" srcset=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.40.39.png 1828w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.40.39-300x190.png 300w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.40.39-1024x649.png 1024w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.40.39-768x487.png 768w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.40.39-1536x973.png 1536w\" sizes=\"auto, (max-width: 1828px) 100vw, 1828px\" \/><br><br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Conversational search<\/h2>\n\n\n\n<p>This work is supported by the <a href=\"https:\/\/sesams.isir.upmc.fr\/\">ANR JCJC SESAMS<\/a>.<br>The objective is to support users&rsquo; search through interactive and proactive systems, anticipating their needs and guiding users for solving their task. Information need refinement\/understanding, belief tracker, dialog systems, and language generation are examples of tasks that can be addressed in this topic.<br><br><img loading=\"lazy\" decoding=\"async\" width=\"1878\" height=\"1162\" class=\"wp-image-4686\" style=\"width: 1500px\" src=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.42.08.png\" alt=\"\" srcset=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.42.08.png 1878w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.42.08-300x186.png 300w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.42.08-1024x634.png 1024w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.42.08-768x475.png 768w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.42.08-1536x950.png 1536w\" sizes=\"auto, (max-width: 1878px) 100vw, 1878px\" \/><br><br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Language models for robotics<\/h2>\n\n\n\n<p>One emerging hypothesis in robotics is that reinforcement learning algorithms aiming to predict robot actions might be enhanced by the semantics underlying language models. Our objective is to design hybrid models combining RL and LLM to generate instructions in natural language, aiming to guide the action prediction. <br><br><img loading=\"lazy\" decoding=\"async\" width=\"1860\" height=\"1138\" class=\"wp-image-4682\" style=\"width: 1500px\" src=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.39.27.png\" alt=\"\" srcset=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.39.27.png 1860w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.39.27-300x184.png 300w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.39.27-1024x627.png 1024w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.39.27-768x470.png 768w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.39.27-1536x940.png 1536w\" sizes=\"auto, (max-width: 1860px) 100vw, 1860px\" \/><br><br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Continual learning and domain adaptation<\/h2>\n\n\n\n<p>Language models and, more generally neural models, might suffer from catastrophic forgetting while being fine-tuned on additional data. Our objective is to leverage this limitation while maintaining the knowledge learned on previous tasks.<br><br><img loading=\"lazy\" decoding=\"async\" width=\"1848\" height=\"1122\" class=\"wp-image-4688\" style=\"width: 1500px\" src=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.44.46.png\" alt=\"\" srcset=\"https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.44.46.png 1848w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.44.46-300x182.png 300w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.44.46-1024x622.png 1024w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.44.46-768x466.png 768w, https:\/\/pages.isir.upmc.fr\/soulier\/wp-content\/uploads\/sites\/15\/2023\/08\/Capture-de\u0301cran-2023-08-01-a\u0300-08.44.46-1536x933.png 1536w\" sizes=\"auto, (max-width: 1848px) 100vw, 1848px\" \/><\/p>\n\n\n\n<ul class=\"wp-block-list\" id=\"block-f12d09d0-3d72-4461-aa3f-721762b0af63\">\n<li><\/li>\n<\/ul>\n\n\n<h1>\u00a0<\/h1>\n<h1>Invited talks<\/h1>\n<ul>\n<li>BNF (June 202&Prime;) &#8211; Panelist \u00ab\u00a0Quelles articulations entre les diff\u00e9rentes formes de recommandation, algorithmiques et humaines ?\u00a0\u00bb<\/li>\n<li>NormaSTIC &#8211; Universit\u00e9 de Caen (June 2022) \u00ab\u00a0Data-to-text generation: let your data speak fluently\u00a0\u00bb<\/li>\n<li>LISN &#8211; S\u00e9minaire TLC (February 2022) \u00ab\u00a0Data-to-text generation: let your data speak fluently\u00a0\u00bb<\/li>\n<li>DGA (September 2021) \u00ab\u00a0Recherche d\u2019information neuronale: Enjeux et perspectives\u00a0\u00bb<\/li>\n<li><a href=\"https:\/\/europe.naverlabs.com\/\">NaverlLabs<\/a> (July 2021) \u00ab\u00a0Data-to-text generation: let your data speak fluently\u00a0\u00bb<\/li>\n<li><a href=\"https:\/\/gdr-tal.ls2n.fr\/etal-2021\/\">Summer school ETAL<\/a> (June 2021): Lecturer &#8211; \u00ab\u00a0Information retrieval models\u00a0\u00bb and practical activities<\/li>\n<li><a href=\"https:\/\/www.google.com\/search?q=%22THL+et+multimodalit%C3%A9&amp;oq=%22THL+et+multimodalit%C3%A9&amp;aqs=chrome..69i57j33.359j0j4&amp;sourceid=chrome&amp;ie=UTF-8\">\u00ab\u00a0THL et multimodalit\u00e9\u00a0\u00bb<\/a> Days &#8211; THL\/AFIA (oct 2020): \u00ab\u00a0From multimodal representation learning to multimodal information access\u00a0\u00bb<\/li>\n<li>LIS seminar &#8211; Marseille (December 2020): \u00ab\u00a0From multimodal representation learning to multimodal information access\u00a0\u00bb<\/li>\n<li><a href=\"http:\/\/www.sphere.univ-paris-diderot.fr\/spip.php?article2385\" data-rich-text-format-boundary=\"true\">PhisIA seminar<\/a> &#8211; Univ Paris Diderot (nov 2019): \u00ab\u00a0<em>Le symbolique au service du connexionnisme et vice-versa\u00a0: apprentissage de repr\u00e9sentation augment\u00e9, extraction d\u2019information et bases de connaissances<\/em>\u00ab\u00a0<\/li>\n<li><a href=\"https:\/\/eric.msh-lse.fr\/seminaire-laure-soulier-lip6\/\">ERIC lab seminar<\/a> (oct 2019): \u00ab\u00a0<em>Apprentissage de repr\u00e9sentations textuelles augment\u00e9es bases de connaissances: application \u00e0 la Recherche d\u2019information<\/em>\u00ab\u00a0<\/li>\n<li>GDR IA &#8211; 2019: \u00ab\u00a0Ancrage visuel et conceptuel du texte pour l&rsquo;apprentissage de repr\u00e9sentation\u00a0\u00bb<\/li>\n<li>Panelist for the Pr\u00e9-GDR TAL (March 2019)<\/li>\n<li>Laboratoire ERIC \u00ab\u00a0De la Recherche d&rsquo;information collaborative \u00e0 la recherche d&rsquo;information socio-collaborative : fondements, mod\u00e8les et perspectives\u00a0\u00bb<\/li>\n<\/ul>\n<h2>\u00a0<\/h2>\n<h1>Projects<\/h1>\n<ul>\n<li>2022-2026: ANR PRCE ACDC. Data-to-text generation<br \/>Consortium: MLIA@Sorbonne, LAMSADE@ParisDauphine\/PSL MHNH@Sorbonne, Recital<\/li>\n<li>2019-2024:\u00a0 <a href=\"http:\/\/mlia.lip6.fr\/sesams\">ANR JCJC SESAMS<\/a>. Search-oriented Conversational systems\u00a0 \u00a0 \u00a0 &#8212; <strong>Coordinator\u00a0<\/strong><br \/>Consortium: Vincent Guigue (MLIA-LIP6), Ludovic Denoyer (FAIR Paris), Jian-Yun Nie (Univ. Montr\u00e9al Canada), Philippe Preux (Univ. Lille)<\/li>\n<li>2014-2019: <a href=\"http:\/\/muster.lip6.fr\">CHIST-ERA MUSTER<\/a>. Ground language in perception (visual inputs) and extract representations of meaning tied to the physical world.<br \/><em>Consortium:<\/em> KU Leuven, Belgium; ETH Zurich, Switzerland; LIP6 UPMC, France; University of the Basque Country, Spain<\/li>\n<li>2014-2015: PEPS CNRS. EXPloration sur l&rsquo;usage des m\u00e9dias sociaux pour un Acc\u00e8s Collaboratif \u00e0 l&rsquo;information.<br \/>Consortium: IRIT- SIG ; CNRS March Bloch, Berlin ; Maths, SMMA, Universit\u00e9 Paris Sorbonne<\/li>\n<\/ul>\n<ul>\n<li style=\"list-style-type: none\">\u00a0<\/li>\n<\/ul>","protected":false},"excerpt":{"rendered":"<p>&nbsp; Deep learning in information retrieval (IR) and natural language processing (NLP) My research is motivated by the proposal of new models based on deep learning for information retrieval and automatic natural language processing. The common objective of these models is to process and access textual data. (Large) language models are at the core of [&hellip;]<\/p>\n","protected":false},"author":260,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-8","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/pages\/8","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/users\/260"}],"replies":[{"embeddable":true,"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/comments?post=8"}],"version-history":[{"count":12,"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/pages\/8\/revisions"}],"predecessor-version":[{"id":4710,"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/pages\/8\/revisions\/4710"}],"wp:attachment":[{"href":"https:\/\/pages.isir.upmc.fr\/soulier\/wp-json\/wp\/v2\/media?parent=8"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}