The program searches for the two protein profiles of the integron-integrase using hmmsearch with default parameters from HMMER suite version 3.1b1 and for the attC sites with the default mode of cmsearch from INFERNAL 1.1 (Figure
The results of the searches for the elements of the integron are put together to class the loci in three categories (Figure
To obtain a better compromise between accuracy and running time, IntegronFinder can re-run INFERNAL to search for attC sites with more precision using the Inside algorithm (‘- - max’ option in INFERNAL), but only around previously identified elements (‘- - local_max’ option in IntegronFinder). More precisely, if a locus contains an integron-integrase and attC sites (complete integron), the search is constrained to the strand encoding attC sites between the end of the integron-integrase and 4 kb after its most distant attC. If other attC sites are found after this one, the search is extended by 4 kb in that direction until no more new sites are found. If the element contains only attC sites (CALIN), the search is performed on the same strand on both directions. If the integron is In0, the search for attC sites is done on both strands in the 4 kb flanking the integron-integrase on each side. The program then searches for promoters and attI sites near the integron-integrase. Finally, it can annotate the integron genes’ cassettes (defined in the program as the CDS found between intI and 200 bp after the last attC site, or 200 bp before the first and 200 bp after the last attC site if there is no integron-integrase) using a database of protein profiles (option ‘- - func_annot’). For example, in the present study we used the ResFams database to search for antibiotic resistance genes. One can use any hmmer-compatible profile databases with the program.
The program outputs tabular and GenBank files listing all the identified genetic elements associated with an integron. The program also produces a figure in pdf format representing each complete integron. For an interactive view of all the hits, one can use the GenBank file as input in specific programs such as Geneious (48 (link)).
The user can change the profiles of the integrases and the covariance model of the attC site. Thus, if novel models of attC sites were to be built in the future, e.g., for novel types of attC sites, they could easily be plugged in IntegronFinder.