Linkage Newsletter

Vol. 6 No. 2 July 1992

Published by Jürg Ott, Columbia University, New York.


Editorial Assistant: Katherine Montague
Fax: +1-212-568-2750
Tel. 212-960-2507
e-mail: OTT@NYSPI.BITNET
Postal address:
Columbia University, Unit 58
722 West 168th Street, New York, NY 10032


EDITORIAL

This summer issue of the newsletter is rather short. It is again being distributed via e- mail (to the GENETIC-LINKAGE newsgroup of the BIOSCI network and to individual subscribers) as well as by postal mail. Hope everyone has a nice summer!


LINKAGE COURSES

The following linkage courses are scheduled: Zurich (Advanced course): October 19-23, 1992, in the Computer Center at the University of Zurich, Irchel campus (see full-page course announcement attached). New York (Advanced course): January 11-15, 1993, in the microcomputer classroom of the Health Sciences library, Columbia University. Course fee $100 (supported by National Center for Human Genome Research; only a small number of participants from outside the U.S. can be admitted). Registration is open; topics covered are as in the Zurich course. A formal course announcement will be made later. In the spring of 1993, there will also be introductory courses in New York and Zurich but dates have not yet been fixed. For information and application forms for any course, please write (preferably by fax) to Katherine Montague, course coordinator, at the address given above.


SOFTWARE NOTES

Bug in Vax version of MLINK?

Joseph Terwilliger recently noticed the following discrepancy between the Vax version and the PC version of the MLINK program. Consider two parents and their child who are typed for two 2-allelic markers. Both parents are heterozygous, 1/2, at each locus and the child is homozygous, 1/1, at each locus. There is complete linkage disequilibrium, P(1-1) = P(2-2) = 0.5, and P(1-2) = P(2-1) = 0. Without allelic association, of course, this family is uninformative for linkage. Under complete disequilibrium, however, given the offspring's phenotype, parental phases are known and lead to two nonrecombinations. Thus, the lod score is given by Z(é) = log[4(1-é)2], which has a maximum of 0.60 at é=0. This is also the result furnished by the PC version of MLINK, but the Vax version reports zero lod scores at any é. We are investigating this matter and will report results as soon as possible.


Undeclared loops

In the LINKAGE programs, when loops (marriage or consanguinity) are present in a pedigree, this must be specified in the pedigree file (the safest method is to use the MAKEPED program). If a loop remains undeclared, depending on the type of loop, the analysis programs may terminate with an error or, worse, they may appear to terminate normally but give incorrect results. Xiaoli Xie has now written a program to detect loops (Xie X, Ott J [1992] Am J Hum Genet, abstr, in press). It is best run after MAKEPED in which case it catches any loops the user failed to declare. This LOOP program is based on the depth-first search algorithm in graph theory and is freely available.


Simulating under heterogeneity in SLINK

Both the SIMLINK (M. Boehnke) and SLINK (D. Weeks) programs allow for simulating pedigree data under heterogeneity, that is, with a given proportion of families simulated without linkage between trait locus and marker loci. In the analysis, however, there is a major difference between the two programs: The SIMLINK program analyses the data under heterogeneity while the analysis programs of the SLINK package do not. Some users have previously been unaware of this and have analyzed the data under homogeneity even though they had been generated by SLINK under heterogeneity (the resulting expected lod scores are too small). The program ELODHET was developed to allow analysis under heterogeneity for data generated by the SLINK program. It is part of the current package of SLINK.


Change in the LINKLODS program

The LINKLODS program, which comes with the PC version of LINKAGE, calculates lod scores for individual families from the output of MLINK or LINKMAP. Dr. Chantal M‚rette recently pointed out that occasionally the LINKLODS program gives incorrect "total" lod scores. This occurred when a large number of families was analyzed and the total log likelihood was smaller than the program constant lowlod, which previously was set to -500. This constant has now been changed to a value -10000 so that the error should no longer occur. The current program version is 1.70 (6 July 1992). Linkage analysis with highly polymorphic markers

Large numbers of alleles can pose problems in linkage analysis. Various exact and approximate ways of overcoming these problems have been proposed. Before discussing an overview of these possibilities, I would like to ask the readers if anyone has experience with the URP program (Michael S. Braverman: "An algorithm to improve the computational efficiency of genetic linkage analysis," Comp Biomed Res 18, 24-36, 1985). Please let me know Ä I'm sure many researchers will be interested in this topic.


CORRECTIONS IN ANALYSIS OF HUMAN GENETIC LINKAGE

Below, the currently known corrections to this book (J. Ott, 1991, Johns Hopkins University Press, Baltimore) are listed.

       Page 14, line 4 up:  Assumption (2) is sufficient for that
statement;  (2) implies (1).

       Page 18, line 8:  Replace (1.3) by (1.2).

       Page 38, Problem 2.2:  Replace 200 cM by 100 cM.

       Page 44, line 8 below table 3.1 should read:  "Generally,
for phase known data, if T=k/n is the value of...".  Also, line 12
should read:  "Since T is unbiased, ..."

       Page 47, lines 5-8:  These two sentences are clearer when
worded as follows:  "Consider now our previous hypothetical example
of one recombinant and four nonrecombinants and test H0:é=« against
H1:é=0.1.  For these data, the likelihood ratio is calculated as
Tobs = [0.1*(0.9)^4]/(0.5)^5."

       Page 48, line 16:  Replace A÷(1-á) by A÷(1-á)/à.

       Page 59, line 3 from the bottom: ..., P(e) = 1/22, ...

       Page 60, lines 6 and 7 should read:  The ith segment (i =
1..s), of length bi, then contains the likelihood ratio, L*(éi),
where b1 = «(é2+é1), bi = «(éi+1+éi)-«(éi+éi-1) = «(éi+1-éi-1), bs
= 0.5 - «(és+és-1); äbi = 0.5.
       Line 17 should read:  52.672, resulting in a value of 0.71
for Smith's (1959) posterior...
       Table 4.1:  The values of bi for i=1 (now 0.025) and i=2
(now 0.050) should be 0.030 and 0.045, respectively.  This way,
they are consistent with the definition of the bi's further up on
page 60.

       Page 34, lines 17 and 18 up are clearer when formulated as
follows: "... often used before linkage analysis as a preliminary
test of paternity."

       Page 45, lines 12 and 13 should be phrased more exactly as
follows: "..., which allows the calculation of approximate
confidence intervals from asymptotic variances... ."

       Page 68, last line before section 4.5:  Replace 11.7 by
11.6.

       Page 74, line 3:  Replace Z() and Z(f) by Z1(m) and Z2(f).

       Page 75, line 5:  Replace (1-à1)n by (1-à1)g.

       Page 92, line 3:  Replace A1 by A2.

       Page 93, table 5.3, line i=4:  Replace AB-22 by AB-11.

       Page 101, after equation (5.15):  Replace 1/[n x i(r)] by
1/[n x i(r)]«.

       Page 101, line 6 in section 5.9 should read: "type 1 is a
recombinant under one of the parental phases (phase I, say) but a
nonrecombinant under the other, ..."

       Page 117, lines 21-23:  The last sentence in this paragraph
should read:  The second child has genotype 121/222 or 122/221,
each of which requires at least one recombination in the father or
the mother.

       Page 137, first line, should read: ...between the loci C and
D.

       Page 139, Table 6.10, line R:  Replace "444éBC" by "444éAB".

       Page 148, line 11:  Replace fdd by fDD.

       Page 149, table 7.1, line d1/d1:  replace « by «r for P(g;r)
(as on the line above it). 
       Page 216, Problem 9.2, line 2:  Replace "table 9.6" by
"table 9.7".

       Page 250:  The last sentence of the top paragraph contains
a typo: -2 should be 2, and Z(à,x) was not defined.  For better
clarity, the last two sentences in that paragraph should read:  "In
practice, this means that one evaluates Z(,x) at each map position,
x, where Z(à,x) is analogous to (9.9) with é1 replaced by x, and 
is determined by the maximum of Z(à,x) at the given x value.  Only
those points x are then excluded for which Z(,x)<2 and Z(x)<-2,
where Z(x) is the lod score under homogeneity."

       Page 268, Solution 9.2, line 2:  Replace "table 9.6" by
"table 9.7".

       Page 270, line 1:  Replace 1/3 by 2/3.  Line 3:  Replace
"with that mutation" by "without that mutation".

       Page 279, ref. Hall et al. (1990):  Replace "Anserson" by
"Anderson".

       Page 294, line 2 up should read: "...tetraploid..."

       Page 302, Support interval:  Replace 110 by 55.