CN

Admission for M.A.
Time: Mar 16.2022

The 2022 Master's Enrollment of Language Data Science and Applications of Institute of Corpus Studies and Applications, Shanghai International Studies University

Welcome to apply for the 2022 Master's Degree in Language Data Science and Applications of Institute of Corpus Studies and Applications at Shanghai International Studies University!

01 Introduction to the ICSA 

The Institute of Corpus Studies and Applications is an independent research institute affiliated to Shanghai International Studies University. As the first institute of its like established in China in November 2019, the Institute has persisted to conduct frontier research for further market applications. Its top priority includes the notions of internationalization, interdisciplinarity, and convergence of research and applications, which promote dialogue between technology and humanities, basic research, and its applications. Drawing on the self-established databases and corpus with a specialization in multiple languages parallel corpus, the Institute conducts language-data-based research informing such fields as linguistics, translation studies, wisdom education and language intelligence. Besides, it pioneers in establishing a new MA program of Language Data Science and Applications, with an aim to educate expert talents to meet the increasing needs from the market and the academia.

02  Discipline Introduction

The Institute of Corpus Studies and Applications of Shanghai International Studies University has recently established a new discipline of Language Data Science and Applications in 2020 with 20 master students and 5 doctoral students. Responding to the New Liberal Arts Development Initiative by the Ministry of Education in China and embracing interdisciplinarity, the new discipline aims to educate talents in the field of language intelligence. 

The discipline of Language Data Science and Applications involves multiple disciplines like Information Science, Statistics, Linguistics and Translation. It aims to study various types, states, attributes of language data, so as to reveal the laws behind human language and language behavior and explore the applications of language data in the fields of intelligence education and artificial intelligence. Based on the applications of corpus and database, this discipline carries out language-data-driven researches in language, translation, intelligence education and other related fields of artificial intelligence, so as to realize the organic combination of data science and research in the fields of linguistics, translatology, intelligence education and language intelligence, and reveal and explain the essence of language and translation and promote the applications of language data in the fields of intelligence education and language intelligence. The main research directions are language data and language research, language data and translation research, language data and intelligence education, and language data and artificial intelligence.

Language Data and Language Studies 

On the basis of quantitative research on language, this direction combines multivariate statistics and visualization methods to study semantics, morphology, phonetics, lexicology, syntax and discourse analysis, and formally describe language laws. The specific research fields are corpus linguistics, statistical linguistics, econometric linguistics, and computational linguistics. 

Language Data and Translation Studies 

This direction mainly focuses on corpus-based translation studies, digital humanities and translation studies, as well as the construction of language database or corpus.  

Language Data and Intelligence Education 

As the main carrier of knowledge, language data plays an important role in the input, digestion, processing, output and evaluation of knowledge. This direction combines the mining and analysis technology of big educational data to explore the research and applications of language data in AI enabled education, support scientific decision-making and implement intelligent education. 

Language data and AI 

This direction focuses on language intelligence, machine translation, deep learning and other fields. It is based on massive corpus data, uses the information processing mechanism of artificial intelligence, and promotes the industry-and-research cooperation of language intelligence research through the organic combination of linguistics and artificial intelligence.

03 Subject Goal 

Cultivate high-end talents in the field of language data science and applications 

Cultivate high-end talents in the fields of language data and translation studies, language data and language studies, language data and intelligence education, language data and AI, and language intelligence, which are connected with the big data era and the development of language intelligence industry, so as to alleviate the current shortage of talents in language data science and applications. 

Promote interdisciplinary research in foreign languages and digital humanities 

Give full play to the advantages of foreign language disciplines and data science, and realize the organic integration of linguistics, translation studies, intelligence education, artificial intelligence and data science, so as to promote interdisciplinary research in foreign language disciplines and digital humanities research.

Promote the development of language intelligence industry 

Based on massive corpus data, construct artificial intelligence model algorithms and develop language intelligent products with completely independent intellectual property rights by using the information processing mechanism of artificial intelligence and the organic combination of linguistics and artificial intelligence, so as to rank in the forefront of artificial intelligence development.

04 Object of talent cultivation

◆Master the professional knowledge and skills of language database construction and application, statistical methods, data mining and natural language processing, and have the ability to apply these to study linguistic problems. 

◆Master linguistic theories, and have the ability to apply them to solve tasks of natural language processing, intelligence education and language intelligence.

05 Introduction to representative supervisor

Hu Kaibao, The professor, doctoral supervisor, postdoctoral cooperative supervisor, dean of ICSA, Shanghai International Studies University, full-time scientific researcher, and distinguished professor of the National Major Talent Program.

Han Ziman, The professor, doctoral supervisor, post-doctoral cooperative supervisor, deputy dean of ICSA, Shanghai International Studies University, and full-time scientific researcher.

Hong Huaqing, The professor, doctoral supervisor, post-doctoral cooperative supervisor, and full-time scientific researcher of ICSA, Shanghai International Studies University.

Xu Hongzhi, The Ph.D., assistant researcher, full-time scientific researcher of ICSA, Shanghai International Studies University, and Zhiyuan Young Scholar.

06 Enrollment and research direction

Professional code (name): 0502J2 Language Data Science and Applications

Research directions: 

1. Language data and language studies

2. Language data and translation studies

3. Language data and intelligence education

4. Langauge data and AI

Number of students to be enrolled (the specific enrollment quota depends on the source of students and the development of the university, and there will be an appropriate increase or decrease): 14

07 Registration Information

Time of online pre-registration: September 24 to September 27, 2021, 9:00-22:00 every day. 

Time of online registration: October 5 to October 25, 2021, 9:00-22:00 every day. 

Registration site: https://yz.chsi.cn 

Specialization: Language Data Science and Applications (Master of Academic Degrees, professional code 0502J2) 

Examination date: The preliminary examination is from December 25 to December 26, 2021; The secondary examination is to be determined.

08 If you have any questions, welcome to consult 

Office of Institute of Corpus Studies and Applications of Shanghai International Studies University 

Contact: Director Liu 

Tel: 021-67705180 

Email: 2020215@shisu.edu.cn 

Official website: http://corpus.shisu.edu.cn 

Official account: 上外语料库研究院

ICSA of SISU

September 2022


上海外国语大学语料库研究院2022年语言数据科学与应用专业硕士研究生统考招生启事


欢迎报考上海外国语大学语料库研究院语言数据科学与应用专业2022年硕士研究生!

01

研究院简介

上海外国语大学语料库研究院系上海外国语大学校级跨学科实体研究机构。研究院积极对接国际学术研究前沿和国家发展战略,实施“国际化”、“学科交叉”和“产学研相结合”等发展战略,致力于技术与人文之间的交叉与融合,基础研究和应用研究并重。研究院以语料库和数据库的建设与应用为基础,以多语种平行语料库的建设为重要建设内容,开展语言数据与语言研究、语言数据与翻译研究、语言数据与智慧教育以及语言数据与人工智能等领域的研究,培养对接国家发展重大需求和国际学术研究前沿的语言数据科学与应用领域的高端人才。

02

学科简介

上海外国语大学语料库研究院于2020年新设语言数据科学与应用学科,现有在校硕士研究生20名,博士研究生5名。本学科一方面对接教育部新文科发展战略,顺应当代学术研究交叉与融合的趋势,另一方面积极响应国家人工智能重大发展战略,培养语言智能领域的高端人才。

语言数据科学与应用学科是基于信息科学、统计学、语言学和翻译学的新兴交叉学科,旨在以语料库和数据库的应用为基础,研究语言数据的各种类型、状态、属性及其变化规律,开展语言数据驱动的语言研究、翻译研究、智慧教育以及人工智能相关领域的研究,最大程度地揭示和解释语言和翻译的本质,推进语言数据在智慧教育和人工智能等领域中的应用。语言数据科学与应用学科的研究方向主要为:语言数据与语言研究、语言数据与翻译研究、语言数据与智慧教育以及语言数据与人工智能。

语言数据与语言研究

在对语言进行定量研究的基础之上,本方向结合多元统计和可视化方法,研究语义学、形态学、语音学、词汇学、句法学和话语分析等,对语言规律进行形式化描述,具体研究领域为语料库语言学、计量语言学和计算语言学等。

语言数据与翻译研究

本方向主要关注语料库翻译学、数字人文与翻译研究以及语言数据库或语料库的建设等。

语言数据与智慧教育

语言数据是知识的主要载体,在知识的输入、消化、加工、输出、评价等各阶段至关重要。本方向结合教育大数据的挖掘分析技术,探索语言数据在AI赋能教育上的研究应用,支持科学决策和实施智慧教育。

语言数据与人工智能

专注语音处理、机器翻译和深度学习等领域的研究。本方向基于海量语料数据,利用人工智能的信息加工机制,通过语言学与人工智能的有机结合,推动语言智能研究的产学研合作。

03

学科目标

培养语言数据科学与应用领域高端人才

培养对接大数据时代和语言智能产业发展的语言数据与翻译研究、语言数据与语言研究、语言数据与智慧教育、语言数据与人工智能和语言智能领域的高端人才,以缓解目前语言数据科学与应用人才供不应求的矛盾。

推进外语学科跨学科研究和数字人文研究 

发挥外语学科和数据科学的优势,实现语言学、翻译学、智慧教育、人工智能与数据科学的有机融合,以推进外语学科跨学科研究和数字人文研究。 

推动语言智能产业发展

基于海量语料数据,利用人工智能的信息加工机制,通过语言学与人工智能的有机结合,构建人工智能模型算法,研发具有完全自主知识产权的语言智能产品,跻身人工智能发展的前沿队列。 

04

人才培养目标

◆掌握语言数据库建设与应用、统计方法、数据挖掘和自然语言处理等专业知识和技术,并具有应用这些知识和技术解决语言学问题的能力。

◆掌握语言学理论和知识,并具备将语言学知识应用于解决自然语言处理、智慧教育和语言智能任务的能力。

05

导师代表简介

胡开宝

胡开宝,教授,博士生导师,博士后合作导师,上海外国语大学语料库研究院院长、专职科研人员,入选国家重大人才计划特聘教授。

韩子满

韩子满,教授,博士生导师,博士后合作导师,上海外国语大学语料库研究院副院长、专职科研人员。

洪化清

洪化清,教授,博士生导师,博士后合作导师,上海外国语大学语料库研究院专职科研人员。

许洪志

许洪志,博士,助理研究员,上海外国语大学语料库研究院专职科研人员,志远青年学者。


06

招生人数与研究方向


专业代码名称

研究方向

拟招生人数(具体招生名额视生源情况和学校发展需要确定,会有适量增减)

0502J2
语言数据科学与应用

1.语言数据与语言研究
2.语言数据与翻译研究

3.语言数据与智慧教育
4.语言数据与人工智能

14

07

报考信息

网上预报名时间:2021 年 9 月 24 日至 9 月 27 日,每天 9:00—22:00。

网上报名时间:2021 年 10 月 5 日至 10 月 25 日,每天 9:00—22:00。

报名网址:https://yz.chsi.cn

专业方向:语言数据科学与应用(学术学位硕士,专业代码0502J2)

考试时间:初试2021年12月25日至12月26日;复试待定。

08

如有疑问,欢迎咨询

上海外国语大学语料库研究院办公室

联系人:刘老师

联系电话:021-67705180

电子邮箱:2020215@shisu.edu.cn

研究院官网:http://corpus.shisu.edu.cn

官方微信公众号:上外语料库研究院


上海外国语大学语料库研究院

2021年9月