久久婷婷五月综合色99啪AK,香蕉久久夜色精品国产,91论坛单男3p心酸又刺激27

課程簡介

Feature representation of different modalities is the main focus of current cross-modal information retrieval research. Existing models typically project texts and images into the same embedding space. In this talk, we will introduce some basic ideas of text and image modeling and how can we build cross-modal relations using deep learning models. In details, we will discuss a joint model by using metric learning to minimize the similarity of the same content from different modalities. We will also introduce some recent research developments in image captioning and vision question answering (VQA)

【工作坊大綱】
1. 語義鴻溝
2. 圖像建模與CNN
3. 文本模型與詞向量
4. 聯(lián)合模型
5. 自動標注
6. 文本生成
7. 視覺問答

目標收益

了解到深度學習的前沿研究，了解如何利用深度學習進行圖像、文本信息的聯(lián)合建模并如何跨模態(tài)的實現(xiàn)語義搜索和圖像問答系統(tǒng)。

培訓對象

課程內(nèi)容

Feature representation of different modalities is the main focus of current cross-modal information retrieval research. Existing models typically project texts and images into the same embedding space. In this talk, we will introduce some basic ideas of text and image modeling and how can we build cross-modal relations using deep learning models. In details, we will discuss a joint model by using metric learning to minimize the similarity of the same content from different modalities. We will also introduce some recent research developments in image captioning and vision question answering (VQA)。

outline：
-語義鴻溝
-圖像建模與CNN
-文本模型與詞向量
-聯(lián)合模型
-自動標注
-文本生成
-視覺問答

深度學習時代的跨模態(tài)信息建模

前Keep首席科學家北京航空航天大學副教授

課程費用

5800.00 /人

課程時長

3小時

課程簡介

目標收益

培訓對象

課程內(nèi)容

課程評論

課程費用

5800.00 /人

課程時長

3小時

近期公開課推薦

近期公開課推薦

chatGPT驅(qū)動下的自動化測試技術(shù)能力進階

深度學習時代的跨模態(tài)信息建模

前Keep首席科學家 北京航空航天大學副教授

課程費用

5800.00 /人

課程時長

3小時

課程簡介

目標收益

培訓對象

課程內(nèi)容

課程評論

課程費用

5800.00 /人

課程時長

3小時

近期公開課推薦

近期公開課推薦

chatGPT驅(qū)動下的自動化測試技術(shù)能力進階

前Keep首席科學家北京航空航天大學副教授