Alex L Mitchell 1 , Teresa K Attwood 2 , Patricia C Babbitt 3 , Matthias Blum 1 , Peer Bork 4 , Alan Bridge 5 , Shoshana D Brown 3 , Hsin-Yu Chang 1 , Sara El-Gebali 1 , Matthew I Fraser 1 , Julian Gough 6 , David R Haft 7 , Hongzhan Huang 8 , Ivica Letunic 9 , Rodrigo Lopez 1 , Aurélien Luciani 1 , Fabio Madeira 1 , Aron Marchler-Bauer 10 , Huaiyu Mi 11 , Darren A Natale 12 , Marco Necci 13 , 14 , 15 , Gift Nuka 1 , Christine Orengo 16 , Arun P Pandurangan 6 , Typhaine Paysan-Lafosse 1 , Sebastien Pesseat 1 , Simon C Potter 1 , Matloob A Qureshi 1 , Neil D Rawlings 1 , Nicole Redaschi 5 , Lorna J Richardson 1 , Catherine Rivoire 5 , Gustavo A Salazar 1 , Amaia Sangrador-Vegas 1 , Christian J A Sigrist 5 , Ian Sillitoe 16 , Granger G Sutton 7 , Narmada Thanki 10 , Paul D Thomas 11 , Silvio C E Tosatto 13 , Siew-Yit Yong 1 , Robert D Finn 1
06 November 2018
The InterPro database ( http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities.