The strategy for map construction (described in Methods),
which made extensive use of the program SEGMAP, was utilized to
localize 2150 chromosome 7 STSs relative to 3892 YACs, resulting in the
construction of 36 contigs (Table 1). A representative sample of a
SEGMAP-constructed contig map is provided in Figure 4. Overall, the
YAC-based STS-content analysis allowed 90% of the mapped STSs to be
uniquely ordered within each contig. Furthermore, >95% of adjacent STS
pairs are connected by two or more YACs (i.e., double-linked), thereby
providing strong evidence for their close proximity on chromosome 7.
For each of the 36 YAC contigs, an electronic summary table listing the
STSs in their deduced map order and containing relevant information
about each STS is available on the World Wide Web (Table 2), with a
representative example shown in Figure 5.
Of the 36 contigs, 22 are anchored in place relative to the Genethon
genetic, RH, and/or cytogenetic maps (Table 1,
Figures 2 and 3)
accounting for >98% of the mapped STSs. For the subset of STSs
corresponding to markers on the Genethon genetic and chromosome 7 RH
maps, the STS order deduced by YAC-based mapping is remarkably
consistent with that determined by these alternate methods. However, a
small number of inconsistencies remain between the YAC contig map and
the Genethon genetic (e.g., sWSS3069, sWSS3046, sWSS1221) and
chromosome 7 RH (e.g., sWSS2745, sWSS1090, sWSS1713) maps.
Specifically, 17 of 260 (7%) Genethon genetic markers mapped to YAC
contigs and 12 of 259 (5%) RH markers mapped to YAC contigs appear to
be at discrepant locations relative to their positions established by
genetic and RH mapping (at >1000:1 odds), respectively. Four of the RH
mapping cases correspond to markers mapping near the centromere, which
is not unexpected given the difficulties inherent to the analysis of
regions residing near centromeres by RH mapping methods (Cox et al.
1990; James et al. 1994; Walter et al. 1994; Hudson et al. 1995;
Schuler et al. 1996; Stewart et al. 1997). All discrepancies with the
Genethon genetic and RH maps are indicated in the electronic summary
tables (Table 2, Figures 2 and
3).
Each of the 22 anchored YAC contigs has been assigned two names: one
given by SEGMAP that reflects the lowest-numbered STS in the contig and
one indicating its long-range position relative to the other anchored
contigs (A through V, respectively). Within the first and last anchored
contigs, A (sWSS1361) and V (sWSS25), are 7p and 7q telomere-containing
YACs, respectively. Neither of the telomeric contigs contains STSs
corresponding to markers present on the Genethon genetic or RH maps;
however, their terminal locations and relative orientations are known
by the positions of the telomere-containing YACs. Of the remaining
contigs, 18 are anchored on both the Genethon genetic and RH maps,
while contig B (sWSS457) is only anchored on the Genethon genetic map
and contig S (sWSS806) is only anchored on the RH map. Despite the
occasional minor discrepancies with the Genethon genetic or RH maps
(e.g., contigs P and R relative to the Genethon genetic map, contigs G,
R, and S relative to the RH map), the predicted relative order of the
anchored contigs seems reasonable in light of the available long-range
mapping information. In addition to establishing the relative order for
most of the anchored contigs, the Genethon genetic and RH maps allow
determination of the relative orientation for all but a few contigs
(Table 1). In total, 19 of the anchored YAC contigs, containing just
under 98% of the mapped STSs, are both ordered and oriented. Figure 6
provides a global overview of the predicted relative positioning of the
22 anchored YAC contigs across the chromosome.
The 22 anchored YAC contigs are highly variable in size and complexity
(Table 1). The number of STSs in each contig ranges from 2 to 634 (with
an average of 96), while the number of YACs range from 5 to 1260 (with
an average of 175). In routine usage, SEGMAP depicts contigs so that
the STSs are evenly spaced and the size of each YAC, in general,
reflects the number of STSs it contains (see Figure 4). However, SEGMAP
is capable of accounting for the measured insert size and STS content
of each YAC to estimate the relative STS spacing and clone overlaps.
The resulting computed maps thus provide estimates of total contig
sizes, with the 22 anchored contigs averaging 7.95 Mb in size (Table
1). The largest YAC contig [E (sWSS9)] contains 634 STSs and 1260 YACs,
spans an estimated 49.6 Mb, represents most of the short arm of the
chromosome, and, to our knowledge, is the largest single YAC contig
assembled to date.
While the computed maps provide some insight about STS spacing and
overall contig lengths, they are largely based on the measured size of
each YAC. This measurement is limited by a number of factors, including
the frequent presence of large chimeric segments in YACs [particularly
clones from total human genomic libraries (Green et al. 1991b)] that
make the cloned insert inappropriately large, the common occurrence of
internal deletions in YACs that make the cloned insert inappropriately
small, and the inherent inaccuracies associated with the
high-throughput sizing of 3892 YACs by pulsed-field gel
electrophoresis. Note that the cumulative size of the 36 YAC contigs
exceeds 186 Mb, whereas chromosome 7 is estimated to contain ~170 Mb of
DNA (Trask et al. 1989; Morton 1991). This discrepancy could reflect
the above problems with calculating YAC contig sizes and/or an
underestimate for the total size of the chromosome.