201:). What is unique about NCBI-curated domains is that they use 3D-structure information to explicitly define domain boundaries, align blocks, amend alignment details, and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. To provide a non-redundant view of the data, CDD clusters similar domain models from various sources into superfamilies.
74:
25:
185:
sequence/structure/function relationships. To do this, CDD Curators include the following types of information in order to supplement and enrich the traditional multiple sequence alignments that form the foundation of domain models: 3-dimensional structures and conserved core motifs, conserved features/sites, phylogenetic organization, links to electronic literature resources.
213:
query and retrieval system, crosslinked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models
175:
Domains can be thought of as distinct functional and/or structural units of a protein. These two classifications coincide rather often, as a matter of fact, and what is found as an independently folding unit of a polypeptide chain also carries specific function. Domains are often identified as
184:
The goal of the NCBI conserved domain curation project is to provide database users with insights into how patterns of residue conservation and divergence in a family relate to functional properties, and to provide useful links to more detailed information that may help to understand those
180:
such domains may have been utilized as building blocks, and may have been recombined in different arrangements to modulate protein function. CDD defines conserved domains as recurring units in molecular evolution, the extents of which can be determined by sequence and structure analysis.
248:
Marchler-Bauer, A.; Zheng, C.; Chitsaz, F.; Derbyshire, M. K.; Geer, L. Y.; Geer, R. C.; Gonzales, N. R.; Gwadz, M.; Hurwitz, D. I.; Lanczycki, C. J.; Lu, F.; Lu, S.; Marchler, G. H.; Song, J. S.; Thanki, N.; Yamashita, R. A.; Zhang, D.; Bryant, S. H. (2012).
46:
104:
33:
193:
CDD content includes NCBI manually curated domain models and domain models imported from a number of external source databases (
319:
160:
324:
329:
38:
228:, that allows the computation and download of annotation for large sets of protein queries.
223:
8:
177:
302:
275:
250:
216:
143:
280:
270:
262:
176:
recurring (sequence or structure) units, which may exist in various contexts. In
99:
164:
313:
284:
266:
247:
89:
Conserved Domain
Database for the functional annotation of proteins.
198:
73:
251:"CDD: Conserved domains and protein three-dimensional structure"
24:
210:
305:. United States National Center for Biotechnology Information.
226:. United States National Center for Biotechnology Information.
219:. United States National Center for Biotechnology Information.
194:
163:
models and derived database search models, for ancient
303:"Conserved Domains Database (CDD) and Resource Group"
144:
https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml
214:can be queried with novel protein sequences via *
311:
243:
241:
105:National Center for Biotechnology Information
238:
274:
204:
49:of all important aspects of the article.
312:
209:The collection is also part of NCBI's
45:Please consider expanding the lead to
18:
13:
159:) is a database of well-annotated
14:
341:
295:
122:Marchler-Bauer & al. (2013)
72:
23:
37:may be too short to adequately
47:provide an accessible overview
1:
261:(Database issue): D348–D352.
231:
170:
7:
161:multiple sequence alignment
10:
346:
188:
167:and full-length proteins.
153:Conserved Domain Database
139:
134:
126:
118:
110:
98:
93:
85:
80:
71:
217:"the CD-Search service"
255:Nucleic Acids Research
205:Searching the database
224:"the Batch CD-Search"
320:Biological databases
267:10.1093/nar/gks1243
197:, SMART, COG, PRK,
178:molecular evolution
114:Aron Marchler-Bauer
68:
16:Biological database
66:
325:Protein structure
149:
148:
64:
63:
337:
306:
289:
288:
278:
245:
227:
220:
119:Primary citation
76:
69:
65:
59:
56:
50:
27:
19:
345:
344:
340:
339:
338:
336:
335:
334:
330:Protein domains
310:
309:
301:
298:
293:
292:
246:
239:
234:
222:
215:
207:
191:
173:
100:Research center
60:
54:
51:
44:
32:This article's
28:
17:
12:
11:
5:
343:
333:
332:
327:
322:
308:
307:
297:
296:External links
294:
291:
290:
236:
235:
233:
230:
206:
203:
190:
187:
172:
169:
147:
146:
141:
137:
136:
132:
131:
128:
124:
123:
120:
116:
115:
112:
108:
107:
102:
96:
95:
91:
90:
87:
83:
82:
78:
77:
62:
61:
41:the key points
31:
29:
22:
15:
9:
6:
4:
3:
2:
342:
331:
328:
326:
323:
321:
318:
317:
315:
304:
300:
299:
286:
282:
277:
272:
268:
264:
260:
256:
252:
244:
242:
237:
229:
225:
218:
212:
202:
200:
196:
186:
182:
179:
168:
166:
162:
158:
154:
145:
142:
138:
133:
129:
125:
121:
117:
113:
109:
106:
103:
101:
97:
92:
88:
84:
79:
75:
70:
58:
55:February 2012
48:
42:
40:
35:
30:
26:
21:
20:
258:
254:
208:
192:
183:
174:
156:
152:
150:
127:Release date
52:
36:
34:lead section
86:Description
314:Categories
232:References
171:Philosophy
221:, or at*
39:summarize
285:23197659
199:TIGRFAMs
276:3531192
189:Content
165:domains
140:Website
111:Authors
94:Contact
81:Content
283:
273:
211:Entrez
135:Access
281:PMID
195:Pfam
151:The
130:2003
271:PMC
263:doi
157:CDD
67:CDD
316::
279:.
269:.
259:41
257:.
253:.
240:^
287:.
265::
155:(
57:)
53:(
43:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.