The development of an effective human immunodeficiency virus type 1 (HIV-1) vaccine is likely to depend on knowledge of circulating variants of genes other than the commonly sequenced gag and env genes. In addition, full-genome data are particularly limited for HIV-1 subtype C, currently the most commonly transmitted subtype in India and worldwide. Likewise, little is known about sequence variation of HIV-1 in India, the country facing the largest burden of HIV worldwide. Therefore, the objective of this study was to clone and characterize the complete genome of HIV-1 from seroconverters infected with subtype C variants in India. Cocultured HIV-1 isolates were obtained from six seroincident individuals from Pune, India, and virtually full-length HIV-1 genomes were amplified, cloned, and sequenced from each. Sequence analysis revealed that five of the six genomes were of subtype C, while one was a mosaic of subtypes A and C, with multiple breakpoints in env, nef, and the 3' long terminal repeat as determined by both maximal chi2 analysis and phylogenetic bootstrapping. Sequences were compared for preservation of known cytotoxic T lymphocyte (CTL) epitopes. Compared with those of the HIV-1LAI sequence, 38% of well-defined CTL epitopes were identical. The proportion of nonconservative substitutions for Env, at 61%, was higher (P < 0.001) than those for Gag (24%), Pol (18%), and Nef (32%). Therefore, characterized CTL epitopes demonstrated substantial differences from subtype B laboratory strains, which were most pronounced in Env. Because these clones were obtained from Indian seroconverters, they are likely to facilitate vaccine-related efforts in India by providing potential antigens for vaccine candidates as well as for assays of vaccine responsiveness.