Since 1989, human immunodeficiency virus type 1 (HIV-1) has spread explosively through the heterosexual population in Thailand. This epidemic is caused primarily by viruses classified as "subtype E", which, on the basis of limited sequence comparisons, appear to represent hybrids of subtypes A (gag) and E (env). However, the true evolutionary origins of "subtype E" viruses are still obscure since no complete genomes have been analyzed, and only one full-length subtype A sequence has been available for phylogenetic comparison. In this study, we determined full-length proviral sequences for "subtype E" viruses from Thailand (93TH253) and the Central African Republic (90CR402) and for a subtype A virus from Uganda (92UG037). We also sequenced the long terminal repeat (LTR) regions from 16 virus strains representing clades A, C, E, F, and G. Detailed phylogenetic analyses of these sequences indicated that "subtype E" viruses do indeed represent A/E recombinants with multiple points of crossover along their genomes. The extracellular portion of env, parts of vif and vpr, as well as most of the LTR are of subtype E origin, whereas the remainder of the genome is of subtype A origin. The possibility that the discordant phylogenetic positions of "subtype E" viruses in gag- and env-derived trees are the result of unusual rates or patterns of evolution was also considered but was ruled out on the basis of two lines of evidence: (i) phylogenetic trees constructed for synonymous and nonsynonymous substitutions yielded the same discordant branching orders for "subtype E" gag and env gene sequences, thus excluding selection-driven evolution, and (ii) multiple crossovers in the viral genome are most consistent with the copy choice model of recombination and have been observed in other documented examples of HIV-1 intersubtype recombination. Thai and CAR "subtype E" viruses exhibited the same pattern of A/E mosaicism, indicating that the recombination event occurred in Africa prior to the spread of virus to Asia. Finally, all "subtype E" viruses were found to contain a distinctive two-nucleotide bulge in their transactivation response (TAR) elements. This feature was present only in viruses which also contained a subtype A 5' pol region (i.e., subtype A viruses or A/D and A/E recombinants), raising the possibility of a functional linkage between the TAR region and the polymerase. The implications of epidemic spread of a recombinant HIV-1 strain to viral natural history and vaccine development are discussed.