基于情感词典和深度学习的乳腺癌患者细粒度文本情感分类模型的构建

注册号:

Registration number:

ChiCTR2400092433 

最近更新日期:

Date of Last Refreshed on:

2024-11-15 16:44:07 

注册时间:

Date of Registration:

2024-11-15 00:00:00 

注册号状态:

补注册

Registration Status:

Retrospective registration

注册题目:

基于情感词典和深度学习的乳腺癌患者细粒度文本情感分类模型的构建

Public title:

The construction of fine-grained text sentiment classification model for breast cancer patients based on emotional lexicon and deep learning

注册题目简写:

English Acronym:

研究课题的正式科学名称:

基于情感词典和深度学习的乳腺癌患者细粒度文本情感分类模型的构建

Scientific title:

The construction of fine-grained text sentiment classification model for breast cancer patients based on emotional lexicon and deep learning

研究课题代号(代码):

Study subject ID:

在二级注册机构或其它机构的注册号:

The registration number of the Partner Registry or other register:

申请注册联系人:

邓师思 

研究负责人:

吴艳妮 

Applicant:

Shisi Deng 

Study leader:

Yanni Wu 

申请注册联系人电话:

Applicant telephone:

+86 156 7517 2337

研究负责人电话:

Study leader's
telephone:

+86 188 1886 0076

申请注册联系人传真 :

Applicant Fax:

研究负责人传真:

Study leader's fax:

申请注册联系人电子邮件:

Applicant E-mail:

2609357139@qq.com

研究负责人电子邮件:

Study leader's E-mail:

yanniwuSMU@126.com

申请单位网址(自愿提供):

Applicant website(voluntary supply):

研究负责人网址(自愿提供):

Study leader's website(voluntary supply):

申请注册联系人通讯地址:

广州市白云区广州大道北1838号

研究负责人通讯地址:

广州市白云区广州大道北1838号

Applicant address:

No. 1838, Guangzhou Avenue North, Baiyun District, Guangzhou

Study leader's address:

No. 1838, Guangzhou Avenue North, Baiyun District, Guangzhou

申请注册联系人邮政编码:

Applicant postcode:

研究负责人邮政编码:

Study leader's postcode:

申请人所在单位:

南方医科大学南方医院

Applicant's institution:

Nanfang Hospital, Southern Medical University

研究负责人所在单位:

南方医科大学南方医院

Affiliation of the Leader:

Nanfang Hospital, Southern Medical University

是否获伦理委员会批准:

Approved by ethic committee:

Yes

伦理委员会批件文号:

Approved No. of ethic committee:

NFEC-2023-585

伦理委员会批件附件:

Approved file of Ethical Committee:

查看附件View

批准本研究的伦理委员会名称:

南方医科大学南方医院医学伦理委员会

Name of the ethic committee:

Medical Ethics committee of NanFang Hospital of Southern Medical University

伦理委员会批准日期:

Date of approved by ethic committee:

2023-12-26 00:00:00

伦理委员会联系人:

胡兴媛

Contact Name of the ethic committee:

Xingyuan Hu

伦理委员会联系地址:

广州市广州大道北 1838 号

Contact Address of the ethic committee:

No. 1838, Guangzhou Avenue North, Guangzhou

伦理委员会联系人电话:

Contact phone of the ethic committee:

+86 20 6278 7238

伦理委员会联系人邮箱:

Contact email of the ethic committee:

研究实施负责(组长)单位:

南方医科大学南方医院

Primary sponsor:

Nanfang Hospital, Southern Medical University

研究实施负责(组长)单位地址:

广州市白云区广州大道北1838号

Primary sponsor's address:

No. 1838, Guangzhou Avenue North, Baiyun District, Guangzhou

试验主办单位(项目批准或申办者):

Secondary sponsor:

国家:

中国

省(直辖市):

广东

市(区县):

广州

Country:

China

Province:

Guangdong

City:

Guangzhou

单位(医院):

南方医科大学南方医院

具体地址:

广州大道北1838 号

Institution
hospital:

Nanfang Hospital, Southern Medical University

Address:

No. 1838, Guangzhou Avenue North, Guangzhou

经费或物资来源:

国家自然科学基金青年基金(no. 72304131)

Source(s) of funding:

National Natural Science Foundation of China (no. 72304131)

研究疾病:

乳腺癌  

Target disease:

Breast cancer

研究疾病代码:

Target disease code:

研究类型:

观察性研究

Study type:

Observational study

研究所处阶段:

探索性研究/预试验 

Study phase:

0

研究设计:

横断面 

Study design:

Cross-sectional 

研究目的:

通过分布式爬虫方法获取乳腺癌患者在国内最大社交网络平台之一的新浪微博发布的文本语料,基于课题组前期构建的乳腺癌患者领域情感词典,结合深度学习模型对所获取语料进行情感分类,以构建乳腺癌患者细粒度情感分类模型。  

Objectives of Study:

The text corpus posted by breast cancer patients on Sina Weibo, one of the largest social network platforms in China, is obtained by the distributed crawler method. Based on the domain sentiment dictionary of breast cancer patients constructed by the group in the early stage, the acquired corpus is combined with a deep learning model to perform sentiment classification, in order to build a fine-grained sentiment classification model for breast cancer patients.

药物成份或治疗方案详述:

 

Description for medicine or protocol of treatment in detail:

 

纳入标准:

Inclusion criteria

排除标准:

① 内容有权限或无法访问的帖子; ② 重复的帖子。

Exclusion criteria:

1. Posts whose contents have permissions or cannot be accessed; 2. Duplicate posts

研究实施时间:

Study execute time:

From 2023-06-01 00:00:00 To 2025-06-01 00:00:00  

征募观察对象时间:

Recruiting time:

From 2023-12-30 00:00:00 To 2024-06-01 00:00:00

干预措施:

Interventions:

组别:

训练组

样本量:

10500

Group:

training group

Sample size:

干预措施:

干预措施代码:

0

Intervention:

No intervention

Intervention code:

组别:

测试组

样本量:

4500

Group:

test group

Sample size:

干预措施:

干预措施代码:

Intervention:

No intervention

Intervention code:

研究实施地点:

Countries of recruitment and research settings:

国家:

中国

省(直辖市):

广东 

市(区县):

广州 

Country:

China

Province:

Guangdong

City:

Guangzhou

单位(医院):

南方医科大学南方医院 

单位级别:

三甲 

Institution
hospital:

Nanfang Hospital, Southern Medical University

Level of the institution:

Tertiary A

测量指标:

Outcomes:

指标中文名:

情感值

指标类型:

主要指标

Outcome:

emotion value

Type:

Primary indicator

测量时间点:

测量方法:

Measure time point of outcome:

Measure method:

采集人体标本:

Collecting sample(s)
from participants:

标本中文名:

组织:

Sample Name:

none

Tissue:

人体标本去向

其它  

说明

Fate of sample:

0thers  

Note:

征募研究对象情况:

Recruiting status:

结束

/Completed

年龄范围:

Participant age:

最小 Min age years
最大 Max age years

性别:

男女均可

Gender:

Both

随机方法(请说明由何人用什么方法产生随机序列):

Randomization Procedure (please state who generates the random number sequence and by what method):

None

是否公开试验完成后的统计结果:

Calculated Results after the Study Completed public access:

公开/Public

盲法:

Blinding:

None

试验完成后的统计结果(上传文件):

Calculated Results after
the Study Completed(upload file):

是否共享原始数据:

IPD sharing

是Yes

共享原始数据的方式(说明:请填入公开原始数据日期和方式,如采用网络平台,需填该网络平台名称和网址):

2025年1月 ResMan, http://www.medresman.org.cn/login.aspx

The way of sharing IPD”(include metadata and protocol, If use web-based public database, please provide the url):

January 2025 ResMan, http://www.medresman.org.cn/login.aspx

数据采集和管理(说明:数据采集和管理由两部分组成,一为病例记录表(Case Record Form, CRF),二为电子采集和管理系统(Electronic Data Capture, EDC),如ResMan即为一种基于互联网的EDC:

病例报告表上有关受试者数据应以受试者编码方式记录,受试者只能通过受试者编码或其姓名首字母缩写识别,确定数据后进行数据锁定,并上交主要研究者,采用专人进行数据管理。

Data collection and Management (A standard data collection and management system include a CRF and an electronic data capture:

Data about subjects on the case report form should be recorded by subject coding. Subjects can only be identified by subject-coded or acronym names. The data should be locked and submitted to the principal investigator, using personnel for data management.

数据与安全监察委员会:

Data and Safety Monitoring Committee:

暂未确定/Not yet

注册人:

Name of Registration:

 2024-11-15 16:43:44