| Literature DB >> 19909804 |
Charles A Steward1, Sean Humphray, Bob Plumb, Matthew C Jones, Michael A Quail, Stephen Rice, Tony Cox, Rob Davies, James Bonfield, Thomas M Keane, Michael Nefedov, Pieter J de Jong, Paul Lyons, Linda Wicker, John Todd, Yoshihide Hayashizaki, Omid Gulban, Jayne Danska, Jen Harrow, Tim Hubbard, Jane Rogers, David J Adams.
Abstract
Non-obese diabetic (NOD) mice spontaneously develop type 1 diabetes (T1D) due to the progressive loss of insulin-secreting beta-cells by an autoimmune driven process. NOD mice represent a valuable tool for studying the genetics of T1D and for evaluating therapeutic interventions. Here we describe the development and characterization by end-sequencing of bacterial artificial chromosome (BAC) libraries derived from NOD/MrkTac (DIL NOD) and NOD/ShiLtJ (CHORI-29), two commonly used NOD substrains. The DIL NOD library is composed of 196,032 BACs and the CHORI-29 library is composed of 110,976 BACs. The average depth of genome coverage of the DIL NOD library, estimated from mapping the BAC end-sequences to the reference mouse genome sequence, was 7.1-fold across the autosomes and 6.6-fold across the X chromosome. Clones from this library have an average insert size of 150 kb and map to over 95.6% of the reference mouse genome assembly (NCBIm37), covering 98.8% of Ensembl mouse genes. By the same metric, the CHORI-29 library has an average depth over the autosomes of 5.0-fold and 2.8-fold coverage of the X chromosome, the reduced X chromosome coverage being due to the use of a male donor for this library. Clones from this library have an average insert size of 205 kb and map to 93.9% of the reference mouse genome assembly, covering 95.7% of Ensembl genes. We have identified and validated 191,841 single nucleotide polymorphisms (SNPs) for DIL NOD and 114,380 SNPs for CHORI-29. In total we generated 229,736,133 bp of sequence for the DIL NOD and 121,963,211 bp for the CHORI-29. These BAC libraries represent a powerful resource for functional studies, such as gene targeting in NOD embryonic stem (ES) cell lines, and for sequencing and mapping experiments. 2009 Elsevier Inc. All rights reserved.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19909804 PMCID: PMC2824108 DOI: 10.1016/j.ygeno.2009.10.004
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736
| BAC clones in library | 196,032 |
|---|---|
| Attempted BAC clone end-reads | 378,896 |
| BAC clone end-reads sequenced successfully | 332,535 |
| Passed BAC clones sequenced on both strands | 150,878 |
| BAC clone end-reads with an alignment to the genome | 318,065 |
| Aligned BAC clone end-reads with a unique match | 170,029 |
| Aligned BAC clone end-reads with multiple matches | 148,036 |
| BAC clones positioned using unique end-read pairs | 41,468 |
| BAC clones positioned uniquely | 125,266 |
Sequencing and alignment summary for the DIL NOD BAC library.
| BAC clones in library | 110,976 |
|---|---|
| Attempted BAC clone end-reads | 207,321 |
| BAC clone end-reads sequenced successfully | 170,159 |
| Passed BAC clones sequenced on both strands | 75,046 |
| BAC clone end-reads with an alignment to the genome | 159,574 |
| Aligned BAC clone end-reads with a unique match | 80,710 |
| Aligned BAC clone end-reads with multiple matches | 78,864 |
| BAC clones positioned using unique end-read pairs | 18,257 |
| BAC clones positioned uniquely | 62,162 |
Sequencing and alignment summary for the CHORI-29 BAC library.
Fig. 1DIL NOD clones have an average insert size of 149,809 bp and CHORI-29 clones have an average insert size of 205,413 bp. (A) Frequency distribution of the insert size of the 125,266 DIL NOD clones (green) and the 62,162 CHORI-29 clones (blue). (B) CHEF gel of NotI digested clones showing the difference in size between the NOD libraries. The two different vector bands are illustrated with white arrows. Marker (M) is a DNA marker of 24–300 kb size range.
| Chr. name | Total chr. length bp | Non-redundant clone-length bp | Clone-length total bp | Passed aligned sequence bp | Clone depth | % total coverage |
|---|---|---|---|---|---|---|
| 1 | 197,195,432 | 189,902,383 | 1,385,443,533 | 13,593,208 | 7.03 | 96.3 |
| 2 | 181,748,087 | 177,813,774 | 1,348,761,746 | 13,129,801 | 7.42 | 97.84 |
| 3 | 159,599,783 | 155,666,906 | 1,151,145,108 | 11,396,911 | 7.21 | 97.54 |
| 4 | 155,630,120 | 150,659,365 | 1,134,909,770 | 10,944,727 | 7.29 | 96.81 |
| 5 | 152,537,259 | 146,223,145 | 1,009,782,448 | 9,773,915 | 6.62 | 95.86 |
| 6 | 149,517,037 | 145,684,306 | 1,049,822,257 | 10,281,087 | 7.02 | 97.44 |
| 7 | 152,524,553 | 140,866,414 | 1,007,473,869 | 9,637,732 | 6.61 | 92.36 |
| 8 | 131,738,871 | 123,985,369 | 907,535,329 | 8,826,036 | 6.89 | 94.11 |
| 9 | 124,076,172 | 120,302,728 | 916,247,038 | 8,800,500 | 7.38 | 96.96 |
| 10 | 129,993,255 | 125,971,520 | 897,618,520 | 8,764,981 | 6.91 | 96.91 |
| 11 | 121,843,856 | 118,333,935 | 924,494,065 | 8,842,955 | 7.59 | 97.12 |
| 12 | 121,257,530 | 117,098,461 | 854,759,223 | 8,258,226 | 7.05 | 96.57 |
| 13 | 120,284,312 | 116,142,787 | 888,982,416 | 8,609,816 | 7.39 | 96.56 |
| 14 | 125,194,864 | 119,290,632 | 846,844,974 | 8,247,811 | 6.76 | 95.28 |
| 15 | 103,494,974 | 100,431,693 | 735,484,668 | 7,116,922 | 7.11 | 97.04 |
| 16 | 98,319,150 | 94,516,190 | 692,930,172 | 6,811,564 | 7.05 | 96.13 |
| 17 | 95,272,651 | 91,304,604 | 690,760,336 | 6,680,800 | 7.25 | 95.84 |
| 18 | 90,772,031 | 86,713,096 | 655,698,203 | 6,460,483 | 7.22 | 95.53 |
| 19 | 61,342,430 | 57,670,646 | 421,571,236 | 4,064,957 | 6.87 | 94.01 |
| X | 166,650,296 | 160,833,108 | 1,095,695,031 | 10,991,837 | 6.57 | 96.51 |
| Y | 15,902,555 | 0 | 0 | 0 | 0 | 0 |
All data in Table 1 and Table 2 are derived from the NCBIm37 mouse assembly. The DIL NOD library covers over 95.6% of the mouse genome at an average depth of 7.1-fold across the autosomes and 6.6-fold across the X chromosome. Column 2 is the C57BL/6J chromosome length. The non-redundant clone-length field (column 3) is the non-redundant BAC clone sequence estimated for that chromosome, using both read-pairs. This may count each base several times, the number of times being the number of BACs overlapping that position. The clone-length total field (column 4) is the total sequence estimated for all the BACs for that chromosome, using both read-pairs. The passed aligned sequence column (5) is the total number of sequenced bases per chromosome that have successfully passed quality control and been mapped. The clone depth field (column 6) is the clone-length total (column 4) divided by the total chromosome length (column 2). The % total coverage (column 7) is the non-redundant clone-length total (counting each base covered only once) as a percentage of the total chromosome length.
| Chr. Name | Total chr. length bp | Non-redundant clone-length bp | Clone-length total bp | Passed aligned sequence bp | Clone depth | % total coverage |
|---|---|---|---|---|---|---|
| 1 | 197,195,432 | 188,433,001 | 1,046,993,130 | 7,724,484 | 5.31 | 95.56 |
| 2 | 181,748,087 | 175,429,538 | 942,386,790 | 6,916,061 | 5.18 | 96.52 |
| 3 | 159,599,783 | 154,711,914 | 875,284,125 | 6,490,661 | 5.48 | 96.94 |
| 4 | 155,630,120 | 148,498,769 | 827,737,353 | 6,127,905 | 5.32 | 95.42 |
| 5 | 152,537,259 | 144,530,268 | 733,517,218 | 5,404,820 | 4.81 | 94.75 |
| 6 | 149,517,037 | 143,599,805 | 775,532,299 | 5,768,104 | 5.19 | 96.04 |
| 7 | 152,524,553 | 135,008,655 | 644,735,047 | 4,788,996 | 4.23 | 88.52 |
| 8 | 131,738,871 | 121,214,563 | 606,581,664 | 4,522,109 | 4.60 | 92.01 |
| 9 | 124,076,172 | 118,654,230 | 598,157,862 | 4,362,992 | 4.82 | 95.63 |
| 10 | 129,993,255 | 124,749,578 | 675,601,360 | 4,987,613 | 5.20 | 95.97 |
| 11 | 121,843,856 | 115,930,403 | 605,273,345 | 4,474,962 | 4.97 | 95.15 |
| 12 | 121,257,530 | 115,274,597 | 604,649,354 | 4,482,487 | 4.98 | 95.07 |
| 13 | 120,284,312 | 114,926,992 | 609,116,213 | 4,517,988 | 5.06 | 95.55 |
| 14 | 125,194,864 | 116,442,268 | 607,970,623 | 4,531,884 | 4.85 | 93.01 |
| 15 | 103,494,974 | 99,239,967 | 543,851,152 | 3,998,763 | 5.25 | 95.89 |
| 16 | 98,319,150 | 92,840,807 | 534,474,134 | 3,928,889 | 5.44 | 94.43 |
| 17 | 95,272,651 | 89,677,039 | 468,395,835 | 3,456,144 | 4.92 | 94.13 |
| 18 | 90,772,031 | 85,715,767 | 456,115,712 | 3,373,974 | 5.02 | 94.43 |
| 19 | 61,342,430 | 56,286,798 | 291,811,698 | 2,148,157 | 4.75 | 91.76 |
| X | 166,650,296 | 148,955,095 | 469,399,660 | 3,543,869 | 2.82 | 89.38 |
| Y | 15,902,555 | 2,427,223 | 9,762,866 | 80,060 | 0.62 | 15.26 |
The CHORI-29 library covers 93.9% of the mouse genome at an average depth of 5.0-fold across the autosomes and 2.8-fold across the X chromosome. For further information regarding the data, please see legend for Table 2A.
Fig. 2NOD BAC clones are displayed on the Ensembl genome browser under the DAS sources, http://www.ebi.ac.uk/das-srv/genomicdas/das/nod_clones_m37 and http://www.ebi.ac.uk/das-srv/genomicdas/das/chori29_clones_m37. The NOD/MrkTac mouse strain is designated DIL NOD (Sanger/Ensembl prefix bQ) and the NOD/ShiLtJ mouse strain is designated CHORI-29 (Sanger/Ensembl prefix bCN). The DIL NOD clones are displayed as black and red bars and the CHORI-29 clones are displayed as blue and green bars, the colours indicating the orientation of the DNA insert in the vector. BAC clones on the forward strand are drawn above the DNA contig while clones on the reverse strand are drawn below. The clone end-reads are shown as small arrows in the corresponding color to the relevant BAC clone. Links to the end-read sequences in the Ensembl trace archive can be found by clicking on the clone end of interest.