Testing ExtSym

ExtSym Home

Summary: The program ExtSym is tested against published data, where for each dataset the most probable extinction symbol as calculated by ExtSym is compared with the published extinction symbol. Many of the results presented here are also published in Ref [23].

This web page is structured as follows:

Acknowledgement

I am very grateful to those who have sent me data to test the ExtSym program. Please see the references to these excellent works at the bottom of this page. These works cover lab-X-ray, time-of-flight neutron, constant wavelength neutron and synchrotron X-ray data collected on a broad spectrum of compounds.

How the tests are presented on this web page

The test results are displayed in tables, see for instance Table 1, with one ExtSym test per row. The columns of these tables contain:

Some of the numbers in each row (that is each ExtSym test) link various pictures and output/input files related to that ExtSym test. In addition where permission has been given the raw data can be downloaded by either right clicking (and choose save as from the right click menu) on the number in the "Refs" column or for the datasets published in Ref. [3] these can be downloaded from www.powderdata.info. These links may for instance be used by anyone interested in double checking the test results presented on this web page.

Finally, light grey coloured rows show tests for which the top ranked extinction symbol did not match the published symbol. These tests are discussed further in the section entitled "Datasets for which the published symbol does not equal the most probable symbol".

Results from using output from DASH

For some information about the program DASH click here.

How the parameter A was determined for DASH

It was found by Kenneth Shankland / Bill David in collaboration with CCDC that the Pawley implementation used in DASH worked well with ExtSym with A=3. This is also confirmed from the results in Table 1.

Notice that on 25/05/2005 the following bug was found in the script that prepares the DASH output for the ExtSym program. For versions prior to 25/5/2005 whenever GOF is larger than 10 this script mistakenly reassign it to one! Which, for reasons that can be understood by reading this, renders the corresponding output from ExtSym useless.

Some results

Prior to the Pawley refinements performed to populate Table 1, the background was first subtracted using the background removing algorithm in DASH with the following settings: 'the Monte Carlo' tick box not ticked and with ' NoI=20,Window=40'. Many of the Pawley refinements used for the tests in this table were done by Kenneth Shankland.

Table 1:

Dataset Ref. GOF 3*GOF Best ExS Publ ExS
Lab_C7H8ClN3O4S2
[3]
1.542
4.626
P 1 21 1
P 1 21 1
Lab_C7H6O2S
[3]
1.890
5.670
P 1 21/c 1
P 1 21/c 1
Lab_C12H10N4
[3]
1.947
5.841
P 1 21/c 1
P 1 21/c 1
Lab_C15H12N2O
[3]
1.782
5.346
P 1 21/n 1
P 1 21/n 1
Lab_C12H12N2O2S
[3]
1.908
5.722
P 21 21 21
P 21 21 21
Lab_C8H8F3N3O4S2
[3]
2.718
8.154
P 1 21 1
P 1 21 1
Lab_C8H9NO2 (form I)
[3]
2.672
8.017
P 1 21/n 1
P 1 21/n 1
Lab_C8H9NO2 (form II)
[3]
1.668
5.005
P b c a
P b c a
Lab_C36H30Cl2NiP2
[3]
1.824
5.471
P 1 c 1
P 1 c 1
Lab_C9H11N2O2S Cl
[3]
1.519
4.557
P 1 21/a 1
P 1 21/a 1
Lab_C8H12NO2 Br
[3]
5.123
15.370
P b c -
P b c -
Lab_C18H17NO2
[3]
2.745
8.236
P n a b
P n a -
Lab_C15H12N2O 2H2O
[3]
3.445
10.334
C - c (ab)
C - c (ab)
Lab_C12H9N2O4S Na 2H2O
[3]
3.760
11.278
P b c n
P b c n
BM16_C7-H11-N5-O2
[4]
1.513
4.538
P 1 21/n 1
P 1 21/n 1
BM16_C6-H6-N6-O4_H2O
[4]
3.821
11.462
P 1 21/n 1
P 1 21/n 1
Daresbury_zeolite_NU-3_QUIN
[5]
2.861
8.583
R - - -
R - - -
SNBL_zeolite_VPI-9
[6]
1.518
4.554
P n c c
P 41 21 -

 

Results from using output from Topas

For information about how to transform Topas output into ExtSym input, see here.

Tom Griffin provided a C# batch script for automating the testing of ExtSym on many datasets. This script was used on the 53 datasets listed in Table 2 to find an optimised value for the 'rescale data error' parameter in advanced.asc. This parameter assumes the form A * GOF (see here), where GOF is the Goodness Of Fit and the prefactor A is the parameter to be determined. Figure 1 plots the parameter A as a function of how many of the 53 datasets result in the "Best ExS" (most probable symbol) being not equal to the "Publ ExS" (published symbol) . The lower this number the better ExtSym is said to have performed. From Figure 1 it is seen that the best region for the parameter A is 0.8 to 1.5 with 0.9 being the best value. In an ideal world A should be close to 1, which is satisfied for the Topas outputs studied in Figure 1.


Figure 1: Shows the performance of ExtSym as a function of the parameter A as described in the text. For each value of A, the y-axis shows the number of datasets for which the "Best ExS" is different from the "Publ ExS" in Table 2.

 

Comment on one specific dataset for which the Topas output is not quite understood

For this dataset (unpublished data, Richard Ibberson) the correlation matrix returned by Topas is not quite right (or [less likely I believe] the script for translating the topas output into a .hkl somehow scrabble the Topas correlation matrix). The Topas .inp file used to fit these data, the .hkl file produced from this .inp file and the ExtSym output can be downloaded from here, here and here. Also, the same data was Pawley fitted with the TF12LS program (Richard), and the .hkl and ExtSym output can be downloaded from here and here. The ExtSym output using the TF12LS .hkl file is plausible but the ExtSym output from the Topas .hkl file is definitely not reasonable. If someone manage to fix this problem please let me know. Notice for all the other data tested in Table 2 no such problem was observed.

Some results

Table 2:

Dataset Ref. GOF 0.9*GOF Best ExS Publ ExS
Lab_C7H8ClN3O4S2
[3]
1.637 1.473 P 1 21 1 P 1 21 1
Lab_C7H6O2S
[3]
2.039 1.835 P 1 21/c 1 P 1 21/c 1
Lab_C12H10N4
[3]
2.33 2.097 P 1 21/c 1 P 1 21/c 1
Lab_C15H12N2O
[3]
2.209 1.988 P 1 21/n 1 P 1 21/n 1
Lab_C12H12N2O2S
[3]
1.964 1.768 P 21 21 21 P 21 21 21
Lab_C8H8F3N3O4S2
[3]
2.178 1.960 P 1 21 1 P 1 21 1
Lab_C8H9NO2 (form I)
[3]
2.158 1.942 P 1 21/n 1 P 1 21/n 1
Lab_C8H9NO2 (form II)
[3]
1.741 1.567 P b c a P b c a
Lab_C8H8O2
[3]
3.123 2.811 P 1 21/a 1 P 1 21/a 1
Lab_C8H9NO3S
[3]
2.238 2.014 P 1 21/c 1 P 1 21/c 1
Lab_C9H15NO3S
[3]
2.002 1.809 P 21 21 21 P 21 21 21
Lab_C16H13NO2
[3]
1.649 1.484 P 1 21 1 P 1 21 1
Lab_C36H30Cl2NiP2
[3]
2.081 1.873 P 1 c 1 P 1 c 1
Lab_C9H11N2O2S Cl
[3]
1.369 1.232 P 1 21/a 1 P 1 21/a 1
Lab_C13H21NO3
[3]
1.768 1.591 P b c a P b c a
Lab_C8H12NO2 Br
[3]
2.154 1.939 P b c - P b c -
Lab_C23H29N3O2S2
[3]
1.979 1.781 P 1 21 1 P 1 21 1
Lab_C10H13ClN2O3S
[3]
3.23 2.907 P 21 21 21 P 21 21 21
Lab_C4H9N3O2 H2O
[3]
2.22 1.998 P 1 21/c 1 P 1 21/c 1
Lab_C24H34N2O6S2
[3]
1.779 1.601 P 1 21/a 1 P 1 21/a 1
Lab_C19H24ClN2 Cl
[3]
2.053 1.848 P 1 21/c 1 P 1 21/c 1
Lab_C12H22O11 H2O
[3]
1.643 1.479 P 1 21 1 P 1 21 1
Lab_C17H21N2S Cl
[3]
1.606 1.445 P 1 21/c 1 P 1 21/c 1
Lab_C12H18N2O3S
[3]
3.237 2.913 P n a - P n a -
Lab_C15H12N2O 2H2O
[3]
3.362 3.026 C - c (ab) C - c (ab)
Lab_C8H15N7O2S3
[3]
1.611 1.450 P 1 21/c 1 P 1 21/c 1
Lab_C22H27N2O4S Cl
[3]
1.545 1.391 P 21 21 21 P 21 21 21
Lab_C17H17ClN6O3 2H2O
[3]
1.839 1.655 P 1 21/c 1 P 1 21/c 1
Lab_C18H27NO3
[3]
3.124 2.812 P 1 21/c 1 P 1 21/c 1
Lab_C12H9N2O4S Na 2H2O
[3]
1.897 1.707 P b c n P b c n
Lab_C20H24O5S
[3]
2.028 1.825 P 1 21/n 1 P 1 21/n 1
Lab_C13H18O2
[3]
2.318 2.086 P 1 21 1 P 1 21 1
hasylab_zeolite_EMC-2
[18]
1.877 1.689 P - - c P - - c
NSLS_zeolite_SGT
[19]
1.419 1.277 I a - d I a - d
Daresbury_zeolite_VPI-5
[20]
2.131 1.918 P - c - P 63 - -
BM16_C7-H11-N5-O2
[4]
1.943 1.749 P 1 21/n 1 P 1 21/n 1
BM16_C6-H6-N6-O4_H2O
[4]
3.167 2.850 P 1 21/n 1 P 1 21/n 1
Daresbury_C6-H6_C6-F6_30K
[8]
5.104 4.594 P 1 21/a 1 P 1 21/a 1
ILL_C-Br-F3_2K
[9]
5.378 4.840 P 1 21/a 1 P 1 21/a 1
ILL_C-F3-I_130K
[10]
2.414 2.173 C - c (ab) C - c (ab)
ILL_K-U-O2-PO4_3D2O
[11]
3.309 2.978 P n c c P n c c
Daresbury_K-U-O2-PO4_3D2O
[11]
1.724 1.552 P n c c P n c c
Daresbury_Na-CD3
[12]
1.619 1.457 I - - - I - - -
Daresbury_Nb3-2NbO-7PO4
[13]
1.915 1.724 C 1 c 1 C 1 c 1
Daresbury_0_5Ni-Zr2-3PO4
[14]
1.622 1.460 P 1 21/n 1 P 1 21/n 1
Daresbury_Sb2-3PO4
[15]
2.826 2.543 P 1 21/n 1 P 1 21/n 1
ILL_D2_S_2K
[16]
1.772 1.595 P b c - P b c -
ILL_C-F2-Cl2_110K
[17]
3.986 3.587 F d d - F d d -
HRPD_2CH3O_SO2_5K
[21]
2.08 1.872 I 1 a 1 I 1 a 1
BM16_C10-H16-O_100K
[7]
12.77 11.493 P 21 21 21 P 21 21 21
Lab_C18H17NO2
[3]
1.176 1.058 P n a b P n a -
SNBL_zeolite_VPI-9
[6]
1.593 1.434 P - c - P 41 21 -
HRPD_2CH3O_SO2_225K
[21]
1.166 1.049 F d d - F d d -
Daresbury_zeolite_NU-3_QUIN
[5]
2.42 2.18
R - - -
R - - -

 

Results from using output from PRODD

For some information about the program PRODD click here.

Some results

GOF is set equal to the square root of the Chi^2 value printed to screen by the PRODD program.

Only 3 datasets have been tested with PRODD so far and no attempt was made to find a best value of the 'A' parameter; the first choice simply happened to be A=2. The Pawley refinements used for the ExtSym tests in table 3 were all provided by Richard Ibberson and the input (.ccl) PRODD files can be downloaded by clicking on the dataset names in the first column.

Table 3:

Dataset Ref. GOF 2*GOF Best ExS Publ ExS
HRPD_2CH3O_SO2_5K
[21]
1.3583 2.716 I 1 a 1 I 1 a 1
HRPD_2CH3O_SO2_225K
[21]
2.418 4.836 F d d - F d d -
HRPD_CD3_2_SO2
[22]
1.749
3.498
C - c -
C - c -

Results from using output from FullProf

For info about the program FullProf and the script used to generate the result in the table below click here.

Some results

Only 1 dataset has been tested with FullProf and no attempt has been made to find a best value of the 'A' parameter; the first choice simply happened to be A=2. The FullProf input parameter file (.pcr) used for each test can be downloaded by clicking on the dataset name in the first column.

Table 4:

Dataset Ref. GOF 2*GOF Best ExS Publ ExS
Lab_C7H8ClN3O4S2
[3]
2.598 5.195 P 1 21 1 P 1 21 1

Results from using output from GSAS

For info about the program GSAS and the script used to generate the result in the table below click here.

Some results

Only 1 dataset has been tested with GSAS output and no attempt has been made to find a best value of the 'A' parameter; the first choice was simply to use A≈2. Note the ExtSym output (downloadable by clicking on the link in the Best ExS column) appears to give the extinction symbols "I 1 - 1" and "I 1 a 1" the same probability.

Table 4:

Dataset Ref. GOF ~2*GOF Best ExS Publ ExS
HRPD_2CH3O_SO2_5K
[21]
2.42 5.0 I 1 - 1 / I 1 a 1 I 1 a 1

 

Datasets for which the published symbol does not equal the most probable symbol

Dataset
Explanation
Lab_C18H17NO2 "P n a -" and "P n a b" have exactly the same systematic absences, except that "P n a b" also predicts that all [h k 0] reflections with h≠0 and k odd are systematically absent. Figure 2 shows a Pawley fit of this dataset in space group "P n a 21"; "P n a 21" has extinction symbol "P n a -". The three reflections highlighted below the diffraction pattern in this figure are predicted to be absent in "P n a b" but present in "P n a -". Figures 3 (a-c) show blown up pictures of the pattern around each of these Bragg positions. As can be seen there is basically zero evidence in the data that these reflections are present and hence the reason for ExtSym to prefer "P n a b" to "P n a -". From email discussions with John Evens, it just so happens that for this system the crystal structure is such that all [h k 0] reflections with h≠0 and k odd are nearly zero.
SNBL_zeolite_VPI-9 An example where the naked eye do better than ExtSym. The program thinks the best extinction symbol is 'P - c -' whereas the correct space group is 'P 42 21 2'. A space group that has extinction group 'P - c -' is for instance 'P 42 c m'. For Pawley refinements in space groups 'P 4 2 2', 'P 42 41 2' and 'P 42 c m' see here, here and here. Although the discrepancy is not large it is clear that by studying the region around 2theta=6 that the space group 'P 42 21 2' provides a better choice than 'P 42 c m'. This highlight the importance of considering the output of ExtSym as a guide and not as a final answer.
Daresbury_zeolite_VPI-5 The published extinction symbol "P 63 - -" is ranked second and "P - c -" ranked first. The latter symbol predicts all [h -h l], [0 h l], [-h 0 l] with l odd to be absence, and looking at the pattern, or the .hkl file (or .inp file) that can be downloaded with this test, there seem to be very little evidence of the [0 h 1] reflections with h=1,2,3,4 (did not check further up the pattern), hence this must be another example where these reflections are present but just very tiny.

 


Figure 2: Topas fit of the C18H17NO2 dataset in space group Pna21.


Figure 3(a): Zoomed in region of Figure 2 around the [1 1 0] Bragg position.


Figure 3(b): Zoomed in region of Figure 2 around the [2 1 0] Bragg position.


Figure 3(c): Zoomed in region of Figure 2 around the [3 1 0] Bragg position.

History or this document

References

  1. A. J. Markvardsen, W. I. F. David, J. Johnston and K. Shankland, Acta Cryst. A57, 47 (2001)
  2. David, W.I.F., Ibberson, R.M. and Matthewman, J.C. (1992). Rutherford Appleton Laboratory Report: RAL-92-032
  3. A. J. Florence, N. Shankland, K. Shankland, W. I. F. David, E. Pidcock, X. Xu, A. Johnston, A. R. Kennedy, P. J. Cox, J. S. O. Evans, G. Steele, S. D. Cosgrove and C. S. Frampton, J. Appl. Cryst. 38, 249 (2005)
  4. V. V. Chernyshev, A. N. Fitch, E. J. Sonneveld, A. I. Kurbakov, V. A. Makarov and V. A. Tafeenko, Acta Cryst. B55, 554 (1999)
  5. McCusker, L. B. (1993). Mater. Sci. Forum, 133–136, 423–434.
  6. McCusker, L. B., Grosse-Kunstleve, R. W., Baerlocher, C., Yoshikawa, M. & Davis, M. E. (1996). Microporous Mater. 6, 295–309.
  7. M. Brunelli, A. N. Fitch, A. J. Mora, J. Solid State Chem. 163, 253-258 (2002)
  8. Williams, J. H., Cockcroft, J. K. & Fitch, A. N. (1992). Angew. Chem.
    Int. Ed. Engl. 31, 1655–1657.
  9. Jouanneaux, A., Fitch, A. N. & Cockcroft, J. K. (1992). Mol. Phys. 77, 45–50.
  10. Clarke, S., Cockcroft, J. K. & Fitch, A. N. (1993). Z. Kristallogr. 206,
    87–95.
  11. Fitch, A. N. & Cole, M. (1991). Mater. Res. Bull. 26, 407–414.
  12. Weiss, E., Corbelin, S., Cockcroft, J. K. & Fitch, A. N. (1990). Chem. Ber. 123, 1629–1634.
  13. Zah-Letho, J. J., Jouanneaux, A., Fitch, A. N., Verbaere, A. &
    Tournoux, M. (1992). Eur. J. Solid State Inorg. Chem. 29, 1309–
    1320.
  14. Jouanneaux, A., Verbaere, A., Piffard, Y., Fitch, A. N. & Kinoshita,
    M. (1991). Eur. J. Solid State Inorg. Chem. 28, 683–699.
  15. jouanneaux, A., Verbaere, A., Guyomard, D., Piffard, Y., Oyetola, S.
    & Fitch, A. N. (1991). Eur. J. Solid State Inorg. Chem. 28, 755–765.
  16. Cockcroft, J. K. & Fitch, A. N. (1990). Z. Kristallogr. 193, 1–19.
  17. Cockcroft, J. K. & Fitch, A. N. (1991). Z. Kristallogr. 197, 121–130.
  18. Baerlocher, C., McCusker, L. B. & Chiappetta, R. (1994). Microporous Mater. 2, 269–280.
  19. McCusker, L. (1988). J. Appl. Cryst. 21, 305–310.
  20. McCusker, L. B., Baerlocher, Ch., Jahn, E. & Buelow, M. (1991).
    Zeolites, pp. 308–313.
  21. Ibberson, R. M., M. T. F. Telling, et al. (2006). Acta Cryst. B62: 280-286.
  22. Ibberson, R. M. Acta Cryst (2007) C63. o292-o294.
  23. A. J. Markvardsen, K. Shankland, W. I. F. David, J. Johnston, R. M. Ibberson, M. Tucker, H. Nowell and T. Griffin, J. Appl. Cryst. 41, 1177 (2008).