第7篇英语翻译

重点单词:

determine  v.确定,查明,决定

align    v.平行 ,使一致 ,排成一条线 ,调整

score   v.评分, 计分

matter  n.事情,问题 , 情况 

as a matter of fact  n. 事实上

optimal adj.最佳的,最优的。

match  v.匹配 ,符合 ,使一致 ,比得上, 满足 n. 比赛 , 敌手 火柴

minus n . 减号 , 负号 adj. 不利的,小于零的, 负面的。

出处:https://acs.jxnu.edu.cn/contest/22/board/challenge/F

Human Gene Functions

描述:

It is well known that a human gene can be considered as a sequence, consisting of four nucleotides, which are simply denoted by four letters, A, C, G, and T. Biologists have been interested in identifying human genes and determining their functions, because these can be used to diagnose human diseases and to design new drugs for them.   众所周知,人类基因可以被视为一个序列,由四个核苷酸组成,它们简单地由四个字母a、C、G和T表示。生物学家一直对识别人类基因和确定其功能感兴趣,因为这些基因可以用于诊断人类疾病和为其设计新药。

A human gene can be identified through a series of time-consuming biological experiments, often with the help of computer programs. Once a sequence of a gene is obtained, the next job is to determine its function.   人类基因可以通过一系列耗时的生物学实验来识别,通常需要计算机程序的帮助。一旦获得了基因序列,下一步的工作就是确定它的功能。
One of the methods for biologists to use in determining the function of a new gene sequence that they have just identified is to search a database with the new gene as a query. The database to be searched stores many gene sequences and their functions – many researchers have been submitting their genes and functions to the database and the database is freely accessible through the Internet.
生物学家在确定刚刚确定的新基因序列的功能时使用的一种方法是搜索带有新基因的数据库作为查询。要搜索的数据库存储了许多基因序列及其功能——许多研究人员一直在向数据库提交他们的基因和功能,数据库可以通过互联网*访问。
A database search will return a list of gene sequences from the database that are similar to the query gene.
Biologists assume that sequence similarity often implies functional similarity. So, the function of the new gene might be one of the functions that the genes from the list have. To exactly determine which one is the right one another series of biological experiments will be needed.

数据库搜索将返回数据库中与查询基因相似的基因序列列表。

生物学家认为序列相似性通常意味着功能相似性。因此,新基因的功能可能是列表中的基因所具有的功能之一。为了准确地确定哪一个是正确的,还需要一系列的生物学实验。

 

Your job is to make a program that compares two genes and determines their similarity as explained below. Your program may be used as a part of the database search if you can provide an efficient one. 你的工作是制作一个程序来比较两个基因并确定它们的相似性,如下所述。如果您能提供一个高效的程序,您的程序可以用作数据库搜索的一部分。
Given two genes AGTGATG and GTTAG, how similar are they? One of the methods to measure the similarity
of two genes is called alignment. In an alignment, spaces are inserted, if necessary, in appropriate positions of
the genes to make them equally long and score the resulting genes according to a scoring matrix.

给定两个基因AGTGATG和GTTAG,它们有多相似?相似性度量方法之一

两个基因的结合称为对齐。在路线中,如有必要,将在路线的适当位置插入空间

这些基因使它们同样长,并根据评分矩阵对结果基因进行评分。



For example, one space is inserted into AGTGATG to result in AGTGAT-G, and three spaces are inserted into GTTAG to result in –GT--TAG. A space is denoted by a minus sign (-). The two genes are now of equal
length. These two strings are aligned:

AGTGAT-G
-GT--TAG

例如,将一个空格插入AGTGATG以生成AGTGAT-G,将三个空格插入GTTAG以生成–GT--TAG。空格由减号(-)表示。这两个基因现在具有相同的功能

长这两个字符串是对齐的:

 

AGTGAT-G

-GT--TAG


In this alignment, there are four matches, namely, G in the second position, T in the third, T in the sixth, and G in the eighth. Each pair of aligned characters is assigned a score according to the following scoring matrix.
第7篇英语翻译

denotes that a space-space match is not allowed. The score of the alignment above is (-3)+5+5+(-2)+(-3)+5+(-3)+5=9.

Of course, many other alignments are possible. One is shown below (a different number of spaces are inserted into different positions):

在这个排列中,有四个匹配,即G在第二个位置,T在第三个位置,T在第六个位置,G在第八个位置。每对对齐字符根据以下评分矩阵分配一个分数。

 


表示不允许空间匹配。以上排列的得分为(-3)+5+5+(-2)+-3)+5+(-3)+5=9。

 

当然,许多其他路线也是可能的。下面显示了一个(在不同位置插入不同数量的空格):

 

AGTGATG

-GTTA-G


This alignment gives a score of (-3)+5+5+(-2)+5+(-1) +5=14. So, this one is better than the previous one. As a matter of fact, this one is optimal since no other alignment can have a higher score. So, it is said that the
similarity of the two genes is 14.

此对齐方式的得分为(-3)+5+5+(-2)+5+(-1)+5=14。所以,这个比前一个好。事实上,这一条是最优的,因为没有其他路线可以有更高的分数。因此,据说

这两个基因的相似性为14。

输入:

The input consists of T test cases. The number of test cases ) (T is given in the first line of the input file. Each test case consists of two lines: each line contains an integer, the length of a gene, followed by a gene sequence. The length of each gene sequence is at least one and does not exceed 100. 输入由T个测试用例组成。测试用例数)(T在输入文件的第一行中给出。每个测试用例由两行组成:每行包含一个整数,即基因长度,后跟一个基因序列。每个基因序列的长度至少为一个,且不超过100。

输出:

The output should print the similarity of each test case, one per line.

样例输入:

2 
7 AGTGATG 
5 GTTAG 
7 AGCTATT 
9 AGCTTTAAA 

样例输出:

14
21 
上一篇:配置文件的使用


下一篇:### Error querying database. Cause: org.apache.ibatis.reflection.ReflectionException: There is no g