Evaluation of Representative Papers: Based on Nobel Prize Papers
基金资助: |
Corresponding authors:
该文从文献计量角度分析诺贝尔生理学或医学奖获奖论文,探究国际医学领域代表性成果的特征,通过引入施引文献指标进行补充评估,为代表作的选取提供思路。利用Web of Science数据库和InCites数据库,统计分析获奖论文的各项指标数据,对于未达到期待被引频次的获奖论文,通过引入TOPCM指标,将其定义为低被引频次但具影响力的出版物。研究发现,大部分诺奖论文发表在高影响因子期刊上,引用半衰期长,被引频次高,绝大部分获奖论文CNCI值高。被引频次不足的诺奖论文其TOPCM值较高,施引文献CNCI值与高被引诺奖论文没有显著差异。建议在筛选代表作时应考虑期刊来源,区分顶级期刊和普通 SCI 期刊,CNCI值可用作代表作选取的重要指标。增加论文的施引文献引文指标,可以有效地弥补代表作论文选取时只依靠引用指标的局限性。另外,考虑到出版周期等原因,代表作的评价应采取定量与定性相结合的方法。
This paper aims to analyze Nobel Prize-winning papers in Physiology or Medicine from the perspective of bibliometrics and explore the characteristics of representative papers. We introduced the citing papers indicators for supplementary evaluation. The key indicators were seclected to provide reference for the selection of representative papers. We searched the Nobel Prize-winning papers in Physiology or Medicine from Web of Science database and Incites database. For the winning papers that did not reach the expected number of citation, a new index TOPCM was introduced to define them as influential publications with low citation. Most of the Nobel Prize-winning papers were published in high-impact factor journals with long half-life and high citation, and most of the winning papers had high CNCI value. The TOPCM value were mostly high for the winning papers with insufficient citation. There was no significant difference in CNCI value of citing papers between the different citations of the winning papers. Representative papers selection considering source journals should distinguish between top journals and ordinary SCI journals. CNCI values can be used as an important indicator for representative selection. Adding citation indicator of citing papers can effectively supplement the limitation of citation indictor for representative papers selection. Considering the publication cycle and other reasons, the evaluation of representative works should adopt a combination of quantitative and qualitative method.
李芙蓉, 丁佐奇.
Li Furong, Ding Zuoqi.
1 引言
2 数据来源与研究方法
2.1 数据来源
2.2 研究方法
2.2.1 指标选择
2.2.2 TOPCM指标选取
TOPCM2(A) = μ1
$\operatorname{TOPCM} 3(\mathrm{~A})=\frac{2}{3}\left(\mu 1+\frac{1}{2} \mu 2\right)=\frac{2}{3} \mu 1+\frac{1}{3} \mu 2$
$\operatorname{TOPCM}(n+1)(\mathrm{A})=\sum_{i=1}^{n} \frac{1}{\mathrm{i} !} \mu \mathrm{i} / \sum_{i=1}^{n} \frac{1}{\mathrm{i} !}$
图 2
3 研究结果
3.1 文献计量特征分析
3.1.1 诺奖论文总被引频次普遍较高
诺奖论文的被引频次分布显示在图3中。被引用的范围分布很广,从21次到14 996次,分布最多的区间是1 000~5 000次,四分之三的论文引用次数达到500次以上。3篇诺奖论文的引用达1万次以上,且有3篇论文的引用低于100次。进一步分析这3篇被引频次低于100的文章,一篇为2010年诺奖获得者Edwards在2001年发表的关于体外受精的社论(editorial),而他做出关键突破的文章集中发表在20世纪70年代,这篇社论是之后的总结,可能因此未获得更多的引用。另外两篇引用低于100的文章是2003年诺奖获得者关于核磁共振的论文,获奖者在利用磁共振可视化不同结构方面取得了开创性的发现,这些发现促成了现代磁共振成像MRI的发展,这代表了医学诊断和研究领域的突破。但文章发表时磁共振在医学应用层面还处于未完善阶段,这可能是文章未取得较多引用的原因。而对于被引频次在100到300之间的论文,作为高影响力的论文并没有获得足够高的引用,我们在后文引入新的指标进行详细分析。总体来说,诺奖论文普遍具有较高的被引频次。
我们把诺奖论文从出版到获奖的时间间隔定义为诺奖论文的测试期。如图4所示,诺贝尔奖的测试期大致在10到30年之间。在小于10年的测试期间内,有2人的获奖论文被引用超过1万次,可见,有少数被引频次极高的获奖者在很短的时间内获得了诺贝尔奖。例如,山中伸弥(Shinya Yamanaka)仅仅花了6年的时间。被引频次与诺贝尔奖的测试期之间存在反比趋势。
3.1.2 大多数诺奖论文被引半衰期长
被引半衰期(Cited Half-life)反映的是文献自身老化的速度,被引半衰期越长,文献的影响越深远。以期刊为例,一般研究型期刊的被引半衰期相对长,时效性期刊的被引半衰期相对短。统计诺奖论文的被引半衰期,并计算诺奖论文被引半衰期占出版至今总年数的比例。结果如图6所示,大部分诺奖论文的被引半衰期较长,55篇论文的被引半衰期达10年以上,26篇达20年以上。同时考虑到诺奖论文出版时间较早,计算了被引半衰期占总年数的比例,28篇占50%以上,60篇占30%以上。
3.1.3 诺奖论文大多发表在高影响力期刊上
3.1.4 诺奖论文CNCI值普遍非常高
3.1.5 诺奖论文施引文献CNCI值普遍较高
针对少数高质量论文被引频次不高的情况,我们考虑施引文献质量来弥补被引指标的不足。我们统计了28篇诺奖论文的CNCI 值和其施引文献的CNCI值及篇均被引频次,结果如表1所示。除第28篇社论外,其余27篇诺奖论文的施引文献CNCI值普遍较高, 平均达到2.04。按照被引频次大小将28篇诺奖论文分为两组,发现不同被引频次之间的诺奖论文其施引文献CNCI值和篇均被引次数均没有显著差异。
表1 诺奖论文施引文献指标
被引频次分组 | 序号 | 诺奖论文 | 施引文献 | |||
获奖年 | 出版年 | CNCI值 | CNCI值 | 篇均被引频次 | ||
第1组 | 1 | 2012 | 2006 | 210.50 | 1.32 | 40.50 |
2 | 2006 | 1998 | 114.42 | 1.42 | 60.20 | |
3 | 2011 | 1998 | 107.77 | 2.02 | 93.60 | |
4 | 2019 | 1995 | 49.97 | 1.90 | 81.93 | |
5 | 2019 | 2001 | 72.08 | 1.75 | 77.73 | |
6 | 2019 | 1999 | 71.08 | 2.01 | 95.25 | |
7 | 2019 | 2001 | 41.62 | 1.81 | 81.37 | |
8 | 2018 | 2000 | 49.95 | 2.97 | 68.53 | |
9 | 2018 | 1996 | 39.98 | 3.28 | 76.89 | |
10 | 2011 | 1996 | 32.89 | 2.33 | 119.58 | |
11 | 2013 | 1993 | 27.45 | 1.34 | 75.56 | |
12 | 2018 | 1999 | 29.85 | 3.26 | 93.24 | |
13 | 2014 | 2005 | 33.47 | 1.71 | 43.21 | |
14 | 2018 | 1992 | 17.91 | 2.93 | 80.34 | |
第2组 | 15 | 2018 | 2003 | 12.49 | 2.55 | 80.93 |
16 | 2016 | 2000 | 15.33 | 2.19 | 88.70 | |
17 | 2016 | 1993 | 25.53 | 2.53 | 118.96 | |
18 | 2016 | 1998 | 12.65 | 2.49 | 114.00 | |
19 | 2016 | 1992 | 9.53 | 2.01 | 103.37 | |
20 | 2014 | 2006 | 15.07 | 1.89 | 48.74 | |
21 | 2014 | 2004 | 12.95 | 1.73 | 58.19 | |
22 | 2017 | 1998 | 7.17 | 1.44 | 84.70 | |
23 | 2013 | 1993 | 6.01 | 1.60 | 97.04 | |
24 | 2018 | 1997 | 5.60 | 2.10 | 78.59 | |
25 | 2017 | 1994 | 3.04 | 1.40 | 86.72 | |
26 | 2018 | 2005 | 4.73 | 2.82 | 81.89 | |
27 | 2017 | 1992 | 2.03 | 1.78 | 94.99 | |
28 | 2010 | 2001 | 0.70 | 0.59 | 10.58 |
3.1.6 特定诺奖论文的TOPCM指标值较高
作者 | 出版物 | 出版时间 | 被引量 | μ1 | μ2 | TOPCM |
James P. Allison | Proceedings of the National Academy of Sciences of the United States of America | 1997 | 291 | 783 | 728.5 | 764.8 |
Tasuku Honjo | International Immunology | 2005 | 268 | 866 | 533 | 755 |
Michael W. Young | Science | 1994 | 276 | 619 | 952 | 730 |
100多年来,科学家一直试图使免疫系统参与抗癌工作。2018年由于发现抑制负向免疫调节的新型癌症疗法,James P. Allison和Tasuku Honjo共同获得了诺贝尔医学奖或生理学奖。James P. Allison在20世纪90年代,利用小鼠实验证实阻击CTLA-4会解除T细胞受到的束缚,使其全力对抗癌细胞[12]。Tasuku Honjo在研究了PD-L1可以通过与PD-1的相互作用来抑制免疫反应的机制之后,在2005年进一步研究了PD-1缺陷小鼠中免疫原性差的B16黑色素瘤细胞向肝脏的血源性扩散受到抑制[13]。而这两篇论文作为他们获得诺奖的重要研究,均未获得很高的引用。我们对其引文进行分析,结果如图10和图11所示。对于James P. Allison在1997年发表论文的三篇一代引文(图10),两篇是基于CTLA-4 的单克隆抗体“伊匹单抗”的临床试验,还有一篇是CTLA-4相关机制的进一步研究 [14,15,16]。进一步分析第二代引文,引用前两篇的大部分是相关改进的临床试验,值得注意的是,引用Wolchok论文的有一篇被引频次达到了8 000以上,是关于伊匹单抗提高转移性黑色素瘤患者生存率的论文[17]。同理,我们分析Tasuku Honjo在2005年发表的论文(图11),三篇一代引文是关于免疫抑制机制的[18,19,20],而第二代引文中大部分是相关抗体的临床试验和进一步研究。
2017年获得诺贝尔奖的Michael W. Young在1994年发表的论文[21]被引频次为267,而其发表于1998年、引用了该论文的文章[22],一同被认定为获奖论文,获得了634次引用。我们对论文内容进行具体分析,1994年,Michael Young发现了第二个时钟基因“timeless”,该基因能够编码正常昼夜节律所需要的关键蛋白——TIM蛋白,研究者发现,当TIM同PER绑定后,两种蛋白就会进入到细胞核中,在细胞核中阻断period基因的活性关闭抑制反馈回路。这样的调节性反馈机制能够解释细胞中蛋白水平发生波动的机制,但仍然存在无法解释的问题,即到底是什么控制着波动(摆动)的频率。1998年,Michael Young鉴别出了另外一个关键基因doubletime,其能够编码名为DBT的蛋白,该蛋白能够减缓PER蛋白的积累,这能够帮助阐明这种昼夜节律波动是如何被调节来精密适应每天24小时循环的。也就是说1998年的论文引用了1994年的论文,在其研究基础上进一步解释了昼夜节律波动的原理。
4 讨论
4.1 重视学科内权威期刊的积极表征作用
4.2 引入施引文献指标辅助筛选代表作
4.3 因地制宜,定量评价与定性评价结合
我们的研究发现,诺奖论文普遍具有较高的引用次数。同时,被引频次与测试期成反比,这种现象在被引频次极高时更加明显。科睿唯安的“引文桂冠奖”便是基于世界范围内高被引论文的文献计量分析,目前已成功预测多名诺奖得主,而诺贝尔奖的获得无疑是世界顶尖同行做出的评审结果。虽然论文品质高低不取决于被引量甚或论文长短, 但被引量体现着学术影响力[30]。尤金·加菲尔德博士发现引文分析能够成为同行评议的一种补充,用于确保评价的客观性或者为专家提供更多的信息。人们不应该用论文和引文数据来代替阅读以及来评估科研人员的成果,进而代替同行的判断。应该利用一系列标准化指标进行综合分析。当我们通过被引频次位于前百分之一、前千分之一乃至前万分之一的论文来定量分析一位科研人员时,可以充分表明该科研人员贡献了一些非常有价值、甚至有重要意义的东西,当他发表了多项这类成果的时候,我们可以更准确地判断该科研人员的研究具有重要的学术影响力[31]。因此,被引频次依旧是重要指标。
[EB/OL]. (
Peer reviews and bibliometric indicators: a comparative study at a Norwegian university
[J]. ,
Correlations between bibliometrics and peer evaluation for all disciplines: the evaluation of Brazilian scientists
[J]. ,
'Peer review' for scientific manuscripts: Emerging issues, potential threats, and possible remedies
[J]. ,Reviewers play a vital role in ensuring quality control of scientific manuscripts published in any journal. The traditional double blind peer review, although a time-tested method, has come under increasing criticism in the face of emerging trends in the review process with the primary concern being the delays in completion of the review process. Other issues are the inability to detect errors/fraud, lack of transparency, lack of reliability, potential for bias, potential for unethical practices, lack of objectivity, inconsistencies amongst reviewers, lack of recognition and motivation of reviewers. Alternative options to classical peer review being propagated are: open review, immediate self-publication using preprint servers, nonselective review focusing primarily on the scientific content, and post-publication review. These alternative review processes, however, may suffer from the inability to validate quality control. In addition, anecdotal instances of peer review frauds are being reported more often than earlier. Suggested means to ensure quality of peer review process includes:(a) each journal to have its own database of reviewers, (b) verification of email IDs of reviewers provided by authors along with details of their institutions, (c) ensure credibility of reviewers before requesting for review, (d) check for plagiarism at the editorial level, (e) editors to distinguish between a good review from a possible biased/bad review, and (f) give recognition for reviewers once in a year. To conclude, quickness of review and publication should not dictate the scientific publication process at the cost of quality of contents.
Understanding Noble Prizes winning articles: A bibliometric analysis
[J]. ,
InCites—基于Web of Science 权威数据的科研评估工具
[DB/OL]. (
Scientific influence is not always visible: The phenomenon of under-cited influential publications
[J]. ,
Bibliometric analysis of Nobelists' awards and landmark papers in physiology or medicine during 1983-2012
[J]. ,
Manipulation of T cell costimulatory and inhibitory signals for immunotherapy of prostate cancer
[J]. ,
PD-1 blockade inhibits hematogenous spread of poorly immunogenic tumor cells by enhanced recruitment of effector T cells
[J]. ,Since metastasis is the major cause of death for cancer patients, there is an urgent need to develop new therapies to control hematogenous dissemination of cancer cells. Previously we and others demonstrated a novel mechanism that allows tumors to escape from the host immune response by expressing PD-L1 which can negatively regulate immune response through the interaction with PD-1, an immunoinhibitory receptor belonging to the CD28 family. In this study, we report that hematogenous spread of poorly immunogenic B16 melanoma cells to the liver was inhibited in PD-1-deficient mice. After inoculation to spleen, PD-L1 was induced on tumor cells, which did not express PD-L1 in vitro. As compared with wild-type mice, intrasplenic injection of B16 cells into PD-1-deficient mice showed enhanced induction of effector T cells in spleen, prolonged T cell proliferation and cytokine production, and augmented homing of effector T cells to tumor sites in the liver, resulting in accumulation of effector T cells in the tumor sites. PD-1 blockade by genetic manipulation or antibody treatment inhibited not only hematogenous dissemination of B16 melanoma cells to the liver on the C57BL/6 background, but also dissemination of CT26 colon cancer cells to the lung on the BALB/c background. These results suggest that PD-1 blockade may be a powerful tool for treatment of hematogenous spread of various tumor cells.
Ipilimumab monotherapy in patients with pretreated advanced melanoma: a randomised, double-blind, multicentre, phase 2, dose-ranging study
[J]. ,Ipilimumab is a human monoclonal antibody that blocks cytotoxic T-lymphocyte antigen 4 and has shown promising activity in advanced melanoma. We aimed to ascertain the antitumour efficacy of ipilimumab in patients with advanced melanoma.We undertook a randomised, double-blind, phase 2 trial in 66 centres from 12 countries. 217 patients with previously treated stage III (unresectable) or stage IV melanoma were randomly assigned a fixed dose of ipilimumab of either 10 mg/kg (n=73), 3 mg/kg (n=72), or 0.3 mg/kg (n=72) every 3 weeks for four cycles (induction) followed by maintenance therapy every 3 months. Randomisation was done with a permuted block procedure, stratified on the basis of type of previous treatment. The primary endpoint was best overall response rate (the proportion of patients with a complete or partial response, according to modified WHO criteria). Efficacy analyses were done by intention to treat, whereas safety analyses included patients who received at least one dose of ipilimumab. This study is registered with ClinicalTrials.gov, number NCT00289640.The best overall response rate was 11.1% (95% CI 4.9-20.7) for 10 mg/kg, 4.2% (0.9-11.7) for 3 mg/kg, and 0% (0.0-4.9) for 0.3 mg/kg (p=0.0015; trend test). Immune-related adverse events of any grade arose in 50 of 71, 46 of 71, and 19 of 72 patients at doses of 10 mg/kg, 3 mg/kg, and 0.3 mg/kg, respectively; the most common grade 3-4 adverse events were gastrointestinal immune-related events (11 in the 10 mg/kg group, two in the 3 mg/kg group, none in the 0.3 mg/kg group) and diarrhoea (ten in the 10 mg/kg group, one in the 3 mg/kg group, none in the 0.3 mg/kg group).Ipilimumab elicited a dose-dependent effect on efficacy and safety measures in pretreated patients with advanced melanoma, lending support to further studies at a dose of 10 mg/kg.Bristol-Myers Squibb.Copyright 2010 Elsevier Ltd. All rights reserved.
Ipilimumab versus placebo after radiotherapy in patients with metastatic castration-resistant prostate cancer that had progressed after docetaxel chemotherapy (CA184-043): a multicentre, randomised, double-blind, phase 3 trial
[J]. ,
Combination immunotherapy of B16 melanoma using anti-cytotoxic T lymphocyte-associated antigen 4 (CTLA-4) and granulocyte/macrophage colony-stimulating factor (GM-CSF)-producing vaccines induces rejection of subcutaneous and metastatic tumors accompanied by autoimmune depigmentation
[J]. ,
Improved survival with ipilimumab in patients with metastatic melanoma
[J]. ,
Programmed cell death 1 ligand 1 and tumor-infiltrating CD8(+) T lymphocytes are prognostic factors of human ovarian cancer
[J]. ,The ligands for programmed cell death 1 (PD-1), an immunoinhibitory receptor belonging to CD28/cytotoxic T lymphocyte antigen 4 family, are PD-1 ligand 1 and 2 (PD-Ls). Recent reports suggest that the aberrant expression of PD-Ls on tumor cells impairs antitumor immunity, resulting in the immune evasion of the tumor cells. Although an inverse correlation between the expression level of PD-Ls and patients' prognosis has been reported for several malignant tumors, the follow-up period was limited because of the lack of the antibody (Ab) applicable to paraffin-embedded specimens. Here we generated a new Ab against PD-1 ligand 1 (PD-L1) and analyzed the expression level of PD-Ls in human ovarian cancer using paraffin-embedded specimens. Patients with higher expression of PD-L1 had a significantly poorer prognosis than patients with lower expression. Although patients with higher expression of PD-1 ligand 2 also had a poorer prognosis, the difference was not statistically significant. A significant inverse correlation was observed between PD-L1 expression and the intraepithelial CD8(+) T lymphocyte count, suggesting that PD-L1 on tumor cells directly suppresses antitumor CD8(+) T cells. Multivariate analysis showed the expression of PD-L1 on tumor cells and intraepithelial CD8(+) T lymphocyte count are independent prognostic factors. The PD-1/PD-L pathway can be a good target for restoring antitumor immunity in ovarian cancer.
Loss of tumor suppressor PTEN function increases B7-H1 expression and immunoresistance in glioma
[J]. ,
Immune inhibitory molecules LAG-3 and PD-1 synergistically regulate T-cell function to promote tumoral immune escape
[J]. ,
Block in nuclear-localization of period protein by a 2nd clock mutation timeless
[J]. ,In wild-type Drosophila, the period protein (PER) is found in nuclei of the eyes and brain, and PER immunoreactivity oscillates with a circadian rhythm. The studies described here indicate that the nuclear localization of PER is blocked by timeless (tim), a second chromosome mutation that, like per null mutations, abolishes circadian rhythms. PER fusion proteins without a conserved domain (PAS) and some flanking sequences are nuclear in tim mutants. This suggests that a segment of PER inhibits nuclear localization in tim mutants. The tim gene may have a role in establishing rhythms of PER abundance and nuclear localization in wild-type flies.
Double-time is a novel Drosophila clock gene that regulates PERIOD protein accumulation
[J]. ,We have isolated three alleles of a novel Drosophila clock gene, double-time (dbt). Short- (dbtS) and long-period (dbtL) mutants alter both behavioral rhythmicity and molecular oscillations from previously identified clock genes, period and timeless. A third allele, dbtP, causes pupal lethality and eliminates circadian cycling of per and tim gene products in larvae. In dbtP mutants, PER proteins constitutively accumulate, remain hypophosphorylated, and no longer depend on TIM proteins for their accumulation. We propose that the normal function of DOUBLETIME protein is to reduce the stability and thus the level of accumulation of monomeric PER proteins. This would promote a delay between per/tim transcription and PER/TIM complex function, which is essential for molecular rhythmicity.
[J]. ,
[EB/OL]. (
[EB/OL]. (
Finding scientific gems with Google’s PageRank algorithm
[J]. ,
How do NIHR peer review panels use bibliometric information to support their decisions?
[J]. ,Bibliometrics is widely used as an evaluation tool to assist prospective R&D decision-making. In the UK, for example, the National Institute for Health Research (NIHR) has employed bibliometric analysis alongside wider information in several awarding panels for major funding schemes. In this paper, we examine various aspects of the use of bibliometric information by members of these award selection panels, based on interviews with ten panel members from three NIHR panels, alongside analysis of the information provided to those panels. The aim of the work is to determine what influence bibliometrics has on their decision-making, to see which types of bibliometric measures they find more and less useful, and to identify the challenges they have when using these data. We find that panel members broadly support the use of bibliometrics in panel decision-making, and that the data are primarily used in the initial individual assessment of candidates, playing a smaller role in the selection panel meeting. Panel members felt that the most useful measures of performance are normalised citation scores and the number or proportion of papers in the most highly cited % (e.g. 5, 10%) for the field. Panel members expressed concerns around the comparability of bibliometrics between fields, but the discussion suggested this largely represents a lack of understanding of bibliometric techniques, confirming that effective background information is important. Based on the evidence around panel behaviour and concerns, we set out guidance around providing bibliometrics to research funding panels.
〈 | 〉 |