Home My Page Projects Code Snippets Project Openings SML/NJ
Summary Activity Forums Tracker Lists Tasks Docs Surveys News SCM Files

SCM Repository

[smlnj] Diff of /sml/trunk/HISTORY
ViewVC logotype

Diff of /sml/trunk/HISTORY

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 643, Fri May 12 08:29:15 2000 UTC revision 1127, Fri Mar 8 01:35:33 2002 UTC
# Line 8  Line 8 
8  The form of an entry should be:  The form of an entry should be:
9    
10  Name:  Name:
11  Date:  Date: yyyy/mm/dd
12  Tag: <post-commit CVS tag>  Tag: <post-commit CVS tag>
13  Description:  Description:
14    
15    ----------------------------------------------------------------------
16    Name: Allen Leung
17    Date: 2002/03/07 20:45:00 EST
18    Tag: leunga-20020307-x86-cmov
19    Description:
20    
21       Bug fixes for CMOVcc on x86.
22    
23       1. Added machine code generation for CMOVcc
24       2. CMOVcc is now generated in preference over SETcc on PentiumPro or above.
25       3. CMOVcc cannot have an immediate operand as argument.
26    
27    ----------------------------------------------------------------------
28    Name: Matthias Blume
29    Date: 2002/03/07 16:15:00 EST
30    Tag: blume-20020307-controls
31    Description:
32    
33    This is a very large but mostly boring patch which makes (almost)
34    every tuneable compiler knob (i.e., pretty much everything under
35    Control.* plus a few other things) configurable via both the command
36    line and environment variables in the style CM did its configuration
37    until now.
38    
39    Try starting sml with '-h' (or, if you are brave, '-H')
40    
41    To this end, I added a structure Controls : CONTROLS to smlnj-lib.cm which
42    implements the underlying generic mechanism.
43    
44    The interface to some of the existing such facilities has changed somewhat.
45    For example, the MLRiscControl module now provides mkFoo instead of getFoo.
46    (The getFoo interface is still there for backward-compatibility, but its
47    use is deprecated.)
48    
49    The ml-build script passes -Cxxx=yyy command-line arguments through so
50    that one can now twiddle the compiler settings when using this "batch"
51    compiler.
52    
53    TODO items:
54    
55    We should go through and throw out all controls that are no longer
56    connected to anything.  Moreover, we should go through and provide
57    meaningful (and correct!) documentation strings for those controls
58    that still are connected.
59    
60    Currently, multiple calls to Controls.new are accepted (only the first
61    has any effect).  Eventually we should make sure that every control
62    is being made (via Controls.new) exactly once.  Future access can then
63    be done using Controls.acc.
64    
65    Finally, it would probably be a good idea to use the getter-setter
66    interface to controls rather than ref cells.  For the time being, both
67    styles are provided by the Controls module, but getter-setter pairs are
68    better if thread-safety is of any concern because they can be wrapped.
69    
70    *****************************************
71    
72    One bug fix: The function blockPlacement in three of the MLRISC
73    backpatch files used to be hard-wired to one of two possibilities at
74    link time (according to the value of the placementFlag).  But (I
75    think) it should rather sense the flag every time.
76    
77    *****************************************
78    
79    Other assorted changes (by other people who did not supply a HISTORY entry):
80    
81    1. the cross-module inliner now works much better (Monnier)
82    2. representation of weights, frequencies, and probabilities in MLRISC
83       changed in preparation of using those for weighted block placement
84       (Reppy, George)
85    
86    ----------------------------------------------------------------------
87    Name: Lal George
88    Date: 2002/03/07 14:44:24 EST 2002
89    Tag: george-20020307-weighted-block-placement
90    
91    Tested the weighted block placement optimization on all architectures
92    (except the hppa) using AMPL to generate the block and edge frequencies.
93    Changes were required in the machine properties to correctly
94    categorize trap instructions. There is an MLRISC flag
95    "weighted-block-placement" that can be used to enable weighted block
96    placement, but this will be ineffective without block/edge
97    frequencies (coming soon).
98    
99    
100    ----------------------------------------------------------------------
101    Name: Lal George
102    Date: 2002/03/05 17:24:48 EST
103    Tag: george-20020305-linkage-cluster
104    
105    In order to support the block placement optimization, a new cluster
106    is generated as the very first cluster (called the linkage cluster).
107    It contains a single jump to the 'real' entry point for the compilation
108    unit. Block placement has no effect on the linkage cluster itself, but
109    all the other clusters  have full freedom in the manner in which they
110    reorder blocks or functions.
111    
112    On the x86 the typical linkage code that is generated is:
113       ----------------------
114            .align 2
115       L0:
116            addl    $L1-L0, 72(%esp)
117            jmp     L1
118    
119    
120            .align  2
121       L1:
122       ----------------------
123    
124    72(%esp) is the memory location for the stdlink register. This
125    must contain the address of the CPS function being called. In the
126    above example, it contains the address of  L0; before
127    calling L1 (the real entry point for the compilation unit), it
128    must contain the address for L1, and hence
129    
130            addl $L1-L0, 72(%esp)
131    
132    I have tested this on all architectures except the hppa.The increase
133    in code size is of course negligible
134    
135    ----------------------------------------------------------------------
136    Name: Allen Leung
137    Date: 2002/03/03 13:20:00 EST
138    Tag: leunga-20020303-mlrisc-tools
139    
140      Added #[ ... ] expressions to mlrisc tools
141    
142    ----------------------------------------------------------------------
143    Name: Matthias Blume
144    Date: 2002/02/27 12:29:00 EST
145    Tag: blume-20020227-cdebug
146    Description:
147    
148    - made types in structure C and C_Debug to be equal
149    - got rid of code duplication (c-int.sml vs. c-int-debug.sml)
150    - there no longer is a C_Int_Debug (C_Debug is directly derived from C)
151    
152    ----------------------------------------------------------------------
153    Name: Matthias Blume
154    Date: 2002/02/26 12:00:00 EST
155    Tag: blume-20020226-ffi
156    Description:
157    
158    1. Fixed a minor bug in CM's "noweb" tool:
159       If numbering is turned off, then truly don't number (i.e., do not
160       supply the -L option to noweb).  The previous behavior was to supply
161       -L'' -- which caused noweb to use the "default" line numbering scheme.
162       Thanks to Chris Richards for pointing this out (and supplying the fix).
163    
164    2. Once again, I reworked some aspects of the FFI:
165    
166       A. The incomplete/complete type business:
167    
168       - Signatures POINTER_TO_INCOMPLETE_TYPE and accompanying functors are
169         gone!
170       - ML types representing an incomplete type are now *equal* to
171         ML types representing their corresponding complete types (just like
172         in C).  This is still safe because ml-nlffigen will not generate
173         RTTI for incomplete types, nor will it generate functions that
174         require access to such RTTI.   But when ML code generated from both
175         incomplete and complete versions of the C type meet, the ML types
176         are trivially interoperable.
177    
178         NOTE:  These changes restore the full generality of the translation
179         (which was previously lost when I eliminated functorization)!
180    
181       B. Enum types:
182    
183       - Structure C now has a type constructor "enum" that is similar to
184         how the "su" constructor works.  However, "enum" is not a phantom
185         type because each "T enum" has values (and is isomorphic to
186         MLRep.Signed.int).
187       - There are generic access operations for enum objects (using
188         MLRep.Signed.int).
189       - ml-nlffigen will generate a structure E_foo for each "enum foo".
190         * The structure contains the definition of type "mlrep" (the ML-side
191         representation type of the enum).  Normally, mlrep is the same
192         as "MLRep.Signed.int", but if ml-nlffigen was invoked with "-ec",
193         then mlrep will be defined as a datatype -- thus facilitating
194         pattern matching on mlrep values.
195         ("-ec" will be suppressed if there are duplicate values in an
196          enumeration.)
197         * Constructors ("-ec") or values (no "-ec") e_xxx of type mlrep
198         will be generated for each C enum constant xxx.
199         * Conversion functions m2i and i2m convert between mlrep and
200         MLRep.Signed.int.  (Without "-ec", these functions are identities.)
201         * Coversion functions c and ml convert between mlrep and "tag enum".
202         * Access functions (get/set) fetch and store mlrep values.
203       - By default (unless ml-nlffigen was invoked with "-nocollect"), unnamed
204         enumerations are merged into one single enumeration represented by
205         structure E_'.
206    
207    ----------------------------------------------------------------------
208    Name: Allen Leung
209    Date: 2002/02/25 04:45:00 EST
210    Tag: leunga-20020225-cps-spill
211    
212    This is a new implementation of the CPS spill phase.
213    The new phase is in the new file compiler/CodeGen/cpscompile/spill-new.sml
214    In case of problems, replace it with the old file spill.sml
215    
216    The current compiler runs into some serious performance problems when
217    constructing a large record.  This can happen when we try to compile a
218    structure with many items.  Even a very simple structure like the following
219    makes the compiler slow down.
220    
221        structure Foo = struct
222           val x_1 = 0w1 : Word32.int
223           val x_2 = 0w2 : Word32.int
224           val x_3 = 0w3 : Word32.int
225           ...
226           val x_N = 0wN : Word32.int
227        end
228    
229    The following table shows the compile time, from N=1000 to N=4000,
230    with the old compiler:
231    
232    N
233    1000   CPS 100 spill                           0.04u  0.00s  0.00g
234           MLRISC ra                               0.06u  0.00s  0.05g
235              (spills = 0 reloads = 0)
236           TOTAL                                   0.63u  0.07s  0.21g
237    
238    1100   CPS 100 spill                           8.25u  0.32s  0.64g
239           MLRISC ra                               5.68u  0.59s  3.93g
240              (spills = 0 reloads = 0)
241           TOTAL                                   14.71u  0.99s  4.81g
242    
243    1500   CPS 100 spill                           58.55u  2.34s  1.74g
244           MLRISC ra                               5.54u  0.65s  3.91g
245              (spills = 543 reloads = 1082)
246           TOTAL                                   65.40u  3.13s  6.00g
247    
248    2000   CPS 100 spill                           126.69u  4.84s  3.08g
249           MLRISC ra                               0.80u  0.10s  0.55g
250              (spills = 42 reloads = 84)
251           TOTAL                                   129.42u  5.10s  4.13g
252    
253    3000   CPS 100 spill                           675.59u  19.03s  11.64g
254           MLRISC ra                               2.69u  0.27s  1.38g
255              (spills = 62 reloads = 124)
256           TOTAL                                   682.48u  19.61s  13.99g
257    
258    4000   CPS 100 spill                           2362.82u  56.28s  43.60g
259           MLRISC ra                               4.96u  0.27s  2.72g
260              (spills = 85 reloads = 170)
261           TOTAL                                   2375.26u  57.21s  48.00g
262    
263    As you can see the old cps spill module suffers from some serious
264    performance problem.  But since I cannot decipher the old code fully,
265    instead of patching the problems up, I'm reimplementing it
266    with a different algorithm.  The new code is more modular,
267    smaller when compiled, and substantially faster
268    (O(n log n) time and O(n) space).  Timing of the new spill module:
269    
270    4000  CPS 100 spill                           0.02u  0.00s  0.00g
271          MLRISC ra                               0.25u  0.02s  0.15g
272             (spills=1 reloads=3)
273          TOTAL                                   7.74u  0.34s  1.62g
274    
275    Implementation details:
276    
277    As far as I can tell, the purpose of the CPS spill module is to make sure the
278    number of live variables at any program point (the bandwidth)
279    does not exceed a certain limit, which is determined by the
280    size of the spill area.
281    
282    When the bandwidth is too large, we decrease the register pressure by
283    packing live variables into spill records.  How we achieve this is
284    completely different than what we did in the old code.
285    
286    First, there is something about the MLRiscGen code generator
287    that we should be aware of:
288    
289    o MLRiscGen performs code motion!
290    
291       In particular, it will move floating point computations and
292       address computations involving only the heap pointer to
293       their use sites (if there is only a single use).
294       What this means is that if we have a CPS record construction
295       statement
296    
297           RECORD(k,vl,w,e)
298    
299       we should never count the new record address w as live if w
300       has only one use (which is often the case).
301    
302       We should do something similar to floating point, but the transformation
303       there is much more complex, so I won't deal with that.
304    
305    Secondly, there are now two new cps primops at our disposal:
306    
307     1. rawrecord of record_kind option
308        This pure operator allocates some uninitialized storage from the heap.
309        There are two forms:
310    
311         rawrecord NONE [INT n]  allocates a tagless record of length n
312         rawrecord (SOME rk) [INT n] allocates a tagged record of length n
313                                     and initializes the tag.
314    
315     2. rawupdate of cty
316          rawupdate cty (v,i,x)
317          Assigns to x to the ith component of record v.
318          The storelist is not updated.
319    
320    We use these new primops for both spilling and increment record construction.
321    
322     1. Spilling.
323    
324        This is implemented with a linear scan algorithm (but generalized
325        to trees).  The algorithm will create a single spill record at the
326        beginning of the cps function and use rawupdate to spill to it,
327        and SELECT or SELp to reload from it.  So both spills and reloads
328        are fine-grain operations.  In contrast, in the old algorithm
329        "spills" have to be bundled together in records.
330    
331        Ideally, we should sink the spill record construction to where
332        it is needed.  We can even split the spill record into multiple ones
333        at the places where they are needed.  But CPS is not a good
334        representation for global code motion, so I'll keep it simple and
335        am not attempting this.
336    
337     2. Incremental record construction (aka record splitting).
338    
339        Long records with many component values which are simulatenously live
340        (recall that single use record addresses are not considered to
341         be live) are constructed with rawrecord and rawupdate.
342        We allocate space on the heap with rawrecord first, then gradually
343        fill it in with rawupdate.  This is the technique suggested to me
344        by Matthias.
345    
346        Some restrictions on when this is applicable:
347        1. It is not a VECTOR record.  The code generator currently does not handle
348           this case. VECTOR record uses double indirection like arrays.
349        2. All the record component values are defined in the same "basic block"
350           as the record constructor.  This is to prevent speculative
351           record construction.
352    
353    ----------------------------------------------------------------------
354    Name: Allen Leung
355    Date: 2002/02/22 01:02:00 EST
356    Tag: leunga-20020222-mlrisc-tools
357    
358    Minor bug fixes in the parser and rewriter
359    
360    ----------------------------------------------------------------------
361    Name: Allen Leung
362    Date: 2002/02/21 20:20:00 EST
363    Tag: leunga-20020221-peephole
364    
365    Regenerated the peephole files.  Some contained typos in the specification
366    and some didn't compile because of pretty printing bugs in the old version
367    of 'nowhere'.
368    
369    ----------------------------------------------------------------------
370    Name: Allen Leung
371    Date: 2002/02/19 20:20:00 EST
372    Tag: leunga-20020219-mlrisc-tools
373    Description:
374    
375       Minor bug fixes to the mlrisc-tools library:
376    
377       1.  Fixed up parsing colon suffixed keywords
378       2.  Added the ability to shut the error messages up
379       3.  Reimplemented the pretty printer and fixed up/improved
380           the pretty printing of handle and -> types.
381       4.  Fixed up generation of literal symbols in the nowhere tool.
382       5.  Added some SML keywords to to sml.sty
383    
384    ----------------------------------------------------------------------
385    Name: Matthias Blume
386    Date: 2002/02/19 16:20:00 EST
387    Tag: blume-20020219-cmffi
388    Description:
389    
390    A wild mix of changes, some minor, some major:
391    
392    * All C FFI-related libraries are now anchored under $c:
393        $/c.cm      --> $c/c.cm
394        $/c-int.cm  --> $c/internals/c-int.cm
395        $/memory.cm --> $c/memory/memory.cm
396    
397    * "make" tool (in CM) now treats its argument pathname slightly
398      differently:
399        1. If the native expansion is an absolute name, then before invoking
400           the "make" command on it, CM will apply OS.Path.mkRelative
401           (with relativeTo = OS.FileSys.getDir()) to it.
402        2. The argument will be passed through to subsequent phases of CM
403           processing without "going native".  In particular, if the argument
404           was an anchored path, then "make" will not lose track of that anchor.
405    
406    * Compiler backends now "know" their respective C calling conventions
407      instead of having to be told about it by ml-nlffigen.  This relieves
408      ml-nlffigen from one of its burdens.
409    
410    * The X86Backend has been split into X86CCallBackend and X86StdCallBackend.
411    
412    * Export C_DEBUG and C_Debug from $c/c.cm.
413    
414    * C type encoding in ml-nlffi-lib has been improved to model the conceptual
415      subtyping relationship between incomplete pointers and their complete
416      counterparts.  For this, ('t, 'c) ptr has been changed to 'o ptr --
417      with the convention of instantiating 'o with ('t, 'c) obj whenever
418      the pointer target type is complete.  In the incomplete case, 'o
419      will be instantiated with some "'c iobj" -- a type obtained by
420      using one of the functors PointerToIncompleteType or PointerToCompleteType.
421    
422      Operations that work on both incomplete and complete pointer types are
423      typed as taking an 'o ptr while operations that require the target to
424      be known are typed as taking some ('t, 'c) obj ptr.
425    
426      voidptr is now a bit "more concrete", namely "type voidptr = void ptr'"
427      where void is an eqtype without any values.  This makes it possible
428      to work on voidptr values using functions meant to operate on light
429      incomplete pointers.
430    
431    * As a result of the above, signature POINTER_TO_INCOMPLETE_TYPE has
432      been vastly simplified.
433    
434    ----------------------------------------------------------------------
435    Name: Matthias Blume
436    Date: 2002/02/19 10:48:00 EST
437    Tag: blume-20020219-pqfix
438    Description:
439    
440    Applied Chris Okasaki's bug fix for priority queues.
441    
442    ----------------------------------------------------------------------
443    Name: Matthias Blume
444    Date: 2002/02/15 17:05:00
445    Tag: Release_110_39
446    Description:
447    
448    Last-minute retagging is becoming a tradition... :-(
449    
450    This is the working release 110.39.
451    
452    ----------------------------------------------------------------------
453    Name: Matthias Blume
454    Date: 2002/02/15 16:00:00 EST
455    Tag: Release_110_39-orig
456    Description:
457    
458    Working release 110.39.  New bootfiles.
459    
460    (Update: There was a small bug in the installer so it wouldn't work
461    with all shells.  So I retagged. -Matthias)
462    
463    ----------------------------------------------------------------------
464    Name: Matthias Blume
465    Date: 2002/02/15 14:17:00 EST
466    Tag: blume-20020215-showbindings
467    Description:
468    
469    Added EnvRef.listBoundSymbols and CM.State.showBindings.  Especially
470    the latter can be useful for exploring what bindings are available at
471    the interactive prompt.  (The first function returns only the list
472    of symbols that are really bound, the second prints those but also the
473    ones that CM's autoloading mechanism knows about.)
474    
475    ----------------------------------------------------------------------
476    Name: Matthias Blume
477    Date: 2002/02/15 12:08:00 EST
478    Tag: blume-20020215-iptrs
479    Description:
480    
481    Two improvements to ml-nlffigen:
482    
483      1. Write files only if they do not exist or if their current contents
484         do not coincide with what's being written.  (That is, avoid messing
485         with the time stamps unless absolutely necessary.)
486    
487      2. Implement a "repository" mechanism for generated files related
488         to "incomplete pointer types".   See the README file for details.
489    
490    ----------------------------------------------------------------------
491    Name: Matthias Blume
492    Date: 2002/02/14 11:50:00 EST
493    Tag: blume-20020214-quote
494    Description:
495    
496    Added a type 't t_' to tag.sml (in ml-nlffi-lib.cm).  This is required
497    because of the new and improved tag generation scheme.  (Thanks to Allen
498    Leung for pointing it out.)
499    
500    ----------------------------------------------------------------------
501    Name: Lal George
502    Date: 2002/02/14 09:55:27 EST 2002
503    Tag: george-20020214-isabelle-bug
504    Description:
505    
506    Fixed the MLRISC bug sent by Markus Wenzel regarding the compilation
507    of Isabelle on the x86.
508    
509    From Allen:
510    -----------
511     I've found the problem:
512    
513         in ra-core.sml, I use the counter "blocked" to keep track of the
514         true number of elements in the freeze queue.  When the counter goes
515         to zero, I skip examining the queue.  But I've messed up the
516         bookkeeping in combine():
517    
518             else ();
519             case !ucol of
520               PSEUDO => (if !cntv > 0 then
521                     (if !cntu > 0 then blocked := !blocked - 1 else ();
522                                        ^^^^^^^^^^^^^^^^^^^^^^^
523                      moveu := mergeMoveList(!movev, !moveu)
524                     )
525                  else ();
526    
527         combine() is called to coalesce two nodes u and v.
528         I think I was thinking that if the move counts of u and v are both
529         greater than zero then after they are coalesced then one node is
530         removed from the freeze queue.  Apparently I was thinking that
531         both u and v are of low degree, but that's clearly not necessarily true.
532    
533    
534    02/12/2002:
535        Here's the patch.  HOL now compiles.
536    
537        I don't know how this impact on performance (compile
538        time or runtime).  This bug caused the RA (especially on the x86)
539        to go thru the potential spill phase when there are still nodes on the
540        freeze queue.
541    
542    
543    
544    
545    ----------------------------------------------------------------------
546    Name: Matthias Blume
547    Date: 2002/02/13 22:40:00 EST
548    Tag: blume-20020213-fptr-rtti
549    Description:
550    
551    Fixed a bug in ml-nlffigen that was introduced with one of the previous
552    updates.
553    
554    ----------------------------------------------------------------------
555    Name: Matthias Blume
556    Date: 2002/02/13 16:41:00 EST
557    Tag: blume-20020213-cmlpq
558    Description:
559    
560    Added new priority queue export symbols (which have just been added to
561    smlnj-lib.cm) to CML's version of smlnj-lib.cm.  (Otherwise CML would
562    not compile and the installer would choke.)
563    
564    ----------------------------------------------------------------------
565    Name: Matthias Blume
566    Date: 2002/02/13 16:15:00 EST
567    Tag: blume-20020213-various
568    Description:
569    
570    1. More tweaks to ml-nlffigen:
571    
572       - better internal datastructures (resulting in slight speedup)
573       - "-match" option requires exact match
574       - "localized" gensym counters (untagged structs/unions nested within
575         other structs/unions or within typedefs get a fresh counter; their
576         tag will be prefixed by a concatenation of their parents' tags)
577       - bug fixes (related to calculation of transitive closure of types
578         to be included in the output)
579    
580    2. Minor Basis updates:
581    
582       - added implementations for List.collate and Option.app
583    
584    ----------------------------------------------------------------------
585    Name: Matthias Blume
586    Date: 2002/02/11 15:55:00 EST
587    Tag: blume-20020211-gensym
588    Description:
589    
590    Added a "-gensym" option to command line of ml-nlffigen.  This can be
591    used to specify a "stem" -- a string that is inserted in all "gensym'd"
592    names (ML structure names that correspond to unnamed C structs, unions,
593    and enums), so that separate runs of ml-nlffigen do not clash.
594    
595    ----------------------------------------------------------------------
596    Name: Matthias Blume
597    Date: 2002/02/11 12:05:00 EST
598    Tag: blume-20020211-gensml
599    Description:
600    
601    A quick fix for a problem with GenSML (in the pgraph-util library):
602    Make generation of toplevel "local" optional.  (Strictly speaking,
603    signature definitions within "local" are not legal SML.)
604    
605    Other than that: updates to INSTALL and cm/TODO.
606    
607    ----------------------------------------------------------------------
608    Name: Matthias Blume
609    Date: 2002/02/08 15:00:00 EST
610    Tag: blume-20020208-uniquepid
611    Description:
612    
613    0. Version number has been bumped to 110.38.1.  NEW BOOTFILES!!!
614    
615    1. The installer (config/install.sh) has gotten smarter:
616    
617         - Configuration options are a bit easier to specify now
618           (in config/targets).
619         - Bug in recognizing .tar.bz2 files fixed.
620         - Installer automatically resolves dependencies between
621           configuration options (e.g., if you ask for eXene, you will
622           also get cml -- regardless whether you asked for it or not).
623         - Installer can run in "quieter mode" by setting the environment
624           variable INSTALL_QUIETLY to "true".  "Quieter" does not mean
625           "completely silent", though.
626         - Build HashCons library as part of smlnj-lib.
627    
628    2. A new scheme for assigning persistent identifiers to compilation
629       units (and, by extension, to types etc.) has been put into place.
630       This fixes a long-standing bug where types and even dynamic values
631       can get internally confused, thereby compromising type safety
632       (abstraction) and dynamic correctness.  See
633    
634         http://cm.bell-labs.com/cm/cs/who/blume/pid-confusion.tgz
635    
636       for an example of how things could go wrong until now.
637    
638       The downside of the new scheme is that pids are not quite as
639       persistent as they used to be: CM will generate a fresh pid
640       for every compilation unit that it thinks it sees for the first
641       time.  That means that if you compile starting from a clean, fresh
642       source tree at two different times, you end up with different
643       binaries.
644    
645       Cutoff recompilation, however, has not been compromised because
646       CM keeps pid information in special caches between runs.
647    
648    ----------------------------------------------------------------------
649    Name: Lal George
650    Date: 2002/02/07 15:34:13 EST 2002
651    Tag: <none>
652    Description:
653    
654    Compilers that generate assembly code may produce  global labels
655    whose value is resolved at link time. The various peephole optimization
656    modules did not take this in account.
657    
658    TODO. The Labels.addrOf function should really return an option
659    type so that clients are forced to deal with this issue, rather
660    than an exception being raised.
661    
662    ----------------------------------------------------------------------
663    Name: Lal George
664    Date: 2002/02/06 13:55:02 EST
665    Tag: george-20020206-ra-breakup
666    Description:
667    
668    1. A bug fix from Allen.
669    
670        A typo causes extra fstp %st(0)'s to be generated at compensation
671        edges, which might cause stack underflow traps at runtime.  This
672        occurs in fft where there are extraneous fstps right before the 'into'
673        trap instruction (in this case they are harmless since none of the
674        integers overflow.)
675    
676    2. Pulled out various utility modules that were embedded in the modules
677       of the register allocator. I need these modules for other purposes, but
678       they are not complete enough to put into a library (just yet).
679    ----------------------------------------------------------------------
680    Name: Matthias Blume
681    Date: 2002/01/31 16:05:00 EST
682    Tag: blume-20020131-sparc-ccalls
683    Description:
684    
685    1. C-calls on Sparc needlessly allocated a huge chunk (96 bytes)
686       of extra stack space by mistake.  Fixed.
687    
688    2. Bug in logic of handling of command-line options in ml-nlffigen fixed.
689    
690    ----------------------------------------------------------------------
691    Name: Allen Leung
692    Date: 2002/01/30
693    Tag: leunga-20020130-nowhere-bug-fix
694    Description:
695    
696       MLRISC bug fixes:
697       1. Fixed a bindings computation bug in the 'nowhere' program generator tool.
698       2. MachineInt.fromString was negating its value.
699    
700    ----------------------------------------------------------------------
701    Name: Matthias Blume
702    Date: 2002/01/29
703    Tag: blume-20020129-INSTALL
704    Description:
705    
706    - Added somewhat detailed installation instructions (file INSTALL).
707    - Fixed curl-detection bug in config/install.sh.
708    - It is now possible to select the URL getter using the URLGETTER
709      environment variable:
710    
711          not set / "unknown"      --> automatic detection (script tries wget,
712                                       curl, and lynx)
713          "wget" / "curl" / "lynx" --> use the specified program (script "knows"
714                                       how to properly invoke them)
715          other                    --> use $URLGETTER directly, it must take
716                                       precisely two command-line arguments
717                                       (source URL and destination file name)
718    
719    ----------------------------------------------------------------------
720    Name: Matthias Blume
721    Date: 2002/01/28
722    Tag: blume-20020128-sparc-ccalls
723    Description:
724    
725    - Fixed problem with calculation of "used" registers in sparc-c-calls.
726    - Make use of the allocParam argument in sparc-c-calls.
727    
728    ----------------------------------------------------------------------
729    Name: Matthias Blume
730    Date: 2002/01/28
731    Tag: blume-20020128-allocParam
732    Description:
733    
734    John Reppy:  Changes c-calls API to accept client-callback for
735    allocating extra stack space.
736    me: Corresponding changes to mlriscGen (using a dummy argument that
737        does not change the current behavior).
738    
739    ----------------------------------------------------------------------
740    Name: Matthias Blume
741    Date: 2002/01/28 12:00:00
742    Tag: Release_110_38
743    Description:
744    
745    This time for real!!!
746    
747    ----------------------------------------------------------------------
748    Name: Matthias Blume
749    Date: 2002/01/28 10:56:00 EST
750    Tag: blume-20020128-retraction
751    Description:
752    
753    0. Retracted earlier 110.38.  (The Release_110_38 tag has been replaced
754       with blume-Release_110_38-retracted.)
755    
756    1. Fixed a problem with incorrect rounding modes in real64.sml.
757       (Thanks to Andrew Mccreight <andrew.mccreight@yale.edu>.)
758    
759    2. A bug in ml-nlffigen related to the handling of unnamed structs, unions,
760       and enums fixed.  The naming of corresponding ML identifiers should
761       now be consistent again.
762    
763    ----------------------------------------------------------------------
764    Name: Allen Leung
765    Date: 2002/01/27
766    Tag: leunga-20020127-nowhere
767    Description:
768    
769       Added a target called nowhere in the configuration scripts.
770       Enabling this will build the MLRISC 'nowhere' tool (for translating
771       programs with where-clauses into legal SML code) during installation.
772    
773    ----------------------------------------------------------------------
774    Name: Matthias Blume
775    Date: 2002/01/25 21:27:00 EST
776    Tag: blume-Release_110_38-retracted
777    Description:
778    
779    Call it a (working) release!  Version is 110.38. Bootfiles are ready.
780    
781    README will be added later.
782    
783    !!! NOTE:  Re-tagged as blume-Release_110_38-retracted. Original tag
784    (Release_110_38) removed.  Reason: Last-minute bug fixes.
785    
786    ----------------------------------------------------------------------
787    Name: Matthias Blume
788    Date: 2002/01/25
789    Tag: blume-20020125-ffi
790    Description:
791    
792    A large number of tweaks and improvements to ml-nlffi-lib and
793    ml-nlffigen:
794    
795       - ML represenation types have been streamlined
796       - getter and setter functions work with concrete values, not abstract
797         ones where possible
798       - ml-nlffigen command line more flexible (see README file there)
799       - some bugs have been fixed (hopefully)
800    
801    ----------------------------------------------------------------------
802    Name: Lal George
803    Date: 2002/01/24
804    Tag: george-20020124-risc-ra-interface
805    Description:
806    
807       There is a dramatic simplification in the interface to the
808       register allocator for RISC architectures as a result of making
809       parallel copy instructions explicit.
810    
811    ----------------------------------------------------------------------
812    Name: Matthias Blume
813    Date: 2002/01/22
814    Tag: blume-20020122-x86-ccalls
815    Description:
816    
817    Bug fix for c-calls on x86 (having to do with how char- and
818    short-arguments are being handled).
819    
820    ----------------------------------------------------------------------
821    Name: Matthias Blume
822    Date: 2002/01/21
823    Tag: blume-20020121-ff
824    Description:
825    
826    Another day of fiddling with the FFI...
827    
828    1. Bug fix/workaround:  CKIT does not complain about negative array
829       dimensions, so ml-nlffigen has to guard itself against this possibility.
830       (Otherwise a negative dimension would send it into an infinite loop.)
831    
832    2. Some of the abstract types (light objects, light pointers, most "base"
833       types) in structure C are now eqtypes.
834    
835    3. Added constructors and test functions for NULL function pointers.
836    
837    ----------------------------------------------------------------------
838    Name: Matthias Blume
839    Date: 2002/01/18
840    Tag: blume-20020118-ready-for-new-release
841    Description:
842    
843    Made config/srcarchiveurl point to a new place.  (Will provide boot
844    files shortly.)
845    
846    Maybe we christen this to be 110.38?
847    
848    ----------------------------------------------------------------------
849    Name: Matthias Blume
850    Date: 2002/01/18
851    Tag: blume-20020118-more-ffifiddle
852    Description:
853    
854    Today's FFI fiddling:
855    
856      - Provided a structure CGetSet with "convenient" versions of C.Get.* and
857        C.Set.* that use concrete (MLRep.*) arguments and results instead
858        of abstract ones.
859    
860      - Provided word-style bit operations etc. for "int" representation
861        types in MLRep.S<Foo>Bitops where <Foo> ranges over Char, Int, Short,
862        and Long.
863    
864    ----------------------------------------------------------------------
865    Name: Matthias Blume
866    Date: 2002/01/18
867    Tag: blume-20020118-use-x86-fp
868    Description:
869    
870    Now that x86-fast-fp seems to be working, I turned it back on again
871    by default.  (Seems to work fine now, even with the FFI.)
872    
873    Other than that, I added some documentation about the FFI to
874    src/ml-nlffigen/README and updated the FFI test examples in
875    src/ml-nlffi-lib/Tests/*.
876    
877    ----------------------------------------------------------------------
878    Name: Allen Leung
879    Date: 2002/01/17
880    Tag: leunga-20020117-x86-fast-fp-call
881    Description:
882    
883       1. Fixed a problem with handling return fp values when x86's fast fp
884          mode is turned on.
885    
886       2. Minor pretty printing fix for cellset.  Print %st(0) as %st(0) instead
887          of %f32.
888    
889       3. Added a constructor INT32lit to the ast of MLRISC tools.
890    
891    ----------------------------------------------------------------------
892    Name: Matthias Blume
893    Date: 2002/01/16
894    Tag: blume-20020116-ffifiddle
895    Description:
896    
897    More fiddling with the FFI interface:
898    
899     - Make constness 'c instead of rw wherever possible.  This eliminates
900       the need for certain explicit coercions.  (However, due to ML's
901       value polymorphism, there will still be many cases where explicit
902       coercions are necessary.  Phantom types are not the whole answer
903       to modeling a subtyping relationship in ML.)
904    
905     - ro/rw coersions for pointers added.  (Avoids the detour through */&.)
906    
907     - "printf" test example added to src/ml-nlffi-lib/Tests.  (Demonstrates
908       clumsy workaround for varargs problem.)
909    
910    ----------------------------------------------------------------------
911    Name: Lal George
912    Date: 2002/01/15
913    Tag: <none>
914    Description:
915    
916    1. Since COPY instructions are no longer native to the architecture,
917       a generic functor can be used to implement the expandCopies function.
918    
919    2. Allowed EXPORT and IMPORT pseudo-op declarations to appear inside a
920       TEXT segment.
921    
922    ----------------------------------------------------------------------
923    Name: Matthias Blume
924    Date: 2002/01/15
925    Tag: blume-20020115-ffiupdates
926    Description:
927    
928    1. Fix for bug resulting in single-precision float values being returned
929       incorrectly from FFI calls.
930    
931    2. Small modifications to C FFI API:
932    
933        - memory-allocation routines return straight objects (no options)
934          and raise an exception in out-of-memory situations
935        - unsafe extensions to cast between function pointers and pointers
936          from/to ints
937        - added structure C_Debug as an alternative to structure C where
938          pointer-dereferencing (|*| and |*!) always check for null-pointers
939        - added open_lib' to DynLinkage;  open_lib' works like open_lib
940          but also takes a (possibly empty) list of existing library handles
941          that the current library depends on
942    
943    ----------------------------------------------------------------------
944    Name: Matthias Blume
945    Date: 2002/01/10
946    Tag: blume-20020110-newffigen
947    Description:
948    
949    1. Updates to portable graph code.
950    
951    2. Major update to ml-nlffigen and ml-nlffi-lib.  Things are much
952       more scalable now so that even huge interfaces such as the one
953       for GTK compile in finite time and space. :-)
954       See src/ml-nlffigen/README for details on what's new.
955    
956    ----------------------------------------------------------------------
957    Name: Lal George
958    Date: 2001/01/09 14:31:35 EST 2002
959    Tag: george-20011206-rm-native-copy
960    Description:
961    
962            Removed the native COPY and FCOPY instructions
963            from all the architectures and replaced it with the
964            explicit COPY instruction from the previous commit.
965    
966            It is now possible to simplify many of the optimizations
967            modules that manipulate copies. This has not been
968            done in this change.
969    
970    ----------------------------------------------------------------------
971    Name: Lal George
972    Date: 2001/12/06 16:50:13 EST 2001
973    Tag: george-20011206-mlrisc-instruction
974    Description:
975    
976    Changed the representation of instructions from being fully abstract
977    to being partially concrete. That is to say:
978    
979      from
980            type instruction
981    
982      to
983            type instr                              (* machine instruction *)
984    
985            datatype instruction =
986                LIVE of {regs: C.cellset, spilled: C.cellset}
987              | KILL of {regs: C.cellset, spilled: C.cellset}
988              | COPYXXX of {k: CB.cellkind, dst: CB.cell list, src: CB.cell list}
989              | ANNOTATION of {i: instruction, a: Annotations.annotation}
990              | INSTR of instr
991    
992    This makes the handling of certain special instructions that appear on
993    all architectures easier and uniform.
994    
995    LIVE and KILL say that a list of registers are live or killed at the
996    program point where they appear. No spill code is generated when an
997    element of the 'regs' field is spilled, but the register is moved to
998    the 'spilled' (which is present, more for debugging than anything else).
999    
1000    LIVE replaces the (now deprecated) DEFFREG instruction on the alpha.
1001    We used to generate:
1002    
1003            DEFFREG f1
1004            f1 := f2 + f3
1005            trapb
1006    
1007    but now generate:
1008    
1009            f1 := f2 + f3
1010            trapb
1011            LIVE {regs=[f1,f2,f3], spilled=[]}
1012    
1013    Furthermore, the DEFFREG (hack) required that all floating point instruction
1014    use all registers mentioned in the instruction. Therefore f1 := f2 + f3,
1015    defines f1 and uses [f1,f2,f3]! This hack is no longer required resulting
1016    in a cleaner alpha implementation. (Hopefully, intel will not get rid of
1017    this architecture).
1018    
1019    COPYXXX is intended to replace the parallel COPY and FCOPY  available on
1020    all the architectures. This will result in further simplification of the
1021    register allocator that must be aware of them for coalescing purposes, and
1022    will also simplify certain aspects of the machine description that provides
1023    callbacks related to parallel copies.
1024    
1025    ANNOTATION should be obvious, and now INSTR represents the honest to God
1026    machine instruction set!
1027    
1028    The <arch>/instructions/<arch>Instr.sml files define certain utility
1029    functions for making porting easier -- essentially converting upper case
1030    to lower case. All machine instructions (of type instr) are in upper case,
1031    and the lower case form generates an MLRISC instruction. For example on
1032    the alpha we have:
1033    
1034      datatype instr =
1035         LDA of {r:cell, b:cell, d:operand}
1036       | ...
1037    
1038      val lda : {r:cell, b:cell, d:operand} -> instruction
1039        ...
1040    
1041    where lda is just (INSTR o LDA), etc.
1042    
1043    ----------------------------------------------------------------------
1044    Name: Matthias Blume
1045    Date: 2001/11/22 21:40:00 EST
1046    Tag: Release_110_37
1047    Description:
1048    
1049    Release 110.37.  This time for real.
1050    
1051    ----------------------------------------------------------------------
1052    Name: Matthias Blume
1053    Date: 2001/11/21 16:35:00 EST
1054    Tag: blume-20011121-foot-in-mouth
1055    Description:
1056    
1057    Removed the "Release_110_37" tag because of a serious bug.
1058    This will be re-tagged once the bug is fixed.
1059    
1060    ----------------------------------------------------------------------
1061    Name: Matthias Blume
1062    Date: 2001/11/21 16:14:00 EST
1063    Tag: blume-20011121-forgottenfile
1064    Description:
1065    
1066    Forgot to add a file.  (Just a .tex-file -- part of
1067    the CM manual source.)
1068    
1069    ----------------------------------------------------------------------
1070    Name: Matthias Blume
1071    Date: 2001/11/21 16:10:00 EST
1072    Tag: blume-20011121-invalid_110_37
1073    Description:
1074    
1075    Note: I removed the original tag "Release_110_37" from this commit
1076          because we found a serious bug in all non-x86 backends.
1077          - Matthias
1078    
1079    1. Modifications to the SML/NJ code generator and to the runtime system
1080       so that code object name strings are directly inserted into code
1081       objects at code generation time.  The only business the runtime system
1082       has with this is now to read the name strings on occasions.
1083       (The encoding of the name string has also changed somewhat.)
1084    
1085    2. CM now implements a simple "set calculus" for specifying export lists.
1086       In particular, it is now possible to refer to the export lists of
1087       other libraries/groups/sources and form unions as well as differences.
1088       See the latest CM manual for details.
1089    
1090    3. An separate notion of "proxy" libraries has again be eliminated from
1091       CM's model.  (Proxy libraries are now simply a special case of using
1092       the export list calculus.)
1093    
1094    4. Some of the existing libraries now take advantage of the new set
1095       calculus.
1096       (Notice that not all libraries have been converted because some
1097       of the existing .cm-files are supposed to be backward compatible
1098       with 110.0.x.)
1099    
1100    5. Some cleanup in stand-alone programs.  (Don't use "exnMessage" -- use
1101       "General.exnMessage"!  The former relies on a certain hook to be
1102       initialized, and that often does not happen in the stand-alone case.)
1103    
1104    ----------------------------------------------------------------------
1105    Name: Lal George
1106    Date: 2001/11/21  13:56:18 EST
1107    Tag: george-2001121-pseudo-ops
1108    Description:
1109    
1110      Implemented a complete redesign of MLRISC pseudo-ops. Now there
1111      ought to never be any question of incompatabilities with
1112      pseudo-op syntax expected by host assemblers.
1113    
1114      For now, only modules supporting GAS syntax are implemented
1115      but more should follow, such as MASM, and vendor assembler
1116      syntax, e.g. IBM as, Sun as, etc.
1117    
1118    ----------------------------------------------------------------------
1119    Name: Matthias Blume
1120    Date: 2001/11/14 11:52:00 EST
1121    Tag: blume-20011114-srcname
1122    Description:
1123    
1124    1. Routed the name of the current source file to mlriscgen where it
1125       should be directly emitted into the code object.  (This last part
1126       is yet to be done.)
1127    
1128    2. Some cleanup of the pgraph code to make it match the proposal that
1129       I put out the other day.  (The proposal notwithstanding, things are
1130       still in flux here.)
1131    
1132    ----------------------------------------------------------------------
1133    Name: Lal George
1134    Date: 2001/11/14 09:44:04 EST
1135    Tag:
1136    Description:
1137    
1138      Fix for a backpatching bug reported by Allen.
1139    
1140      Because the boundary between short and long span-dependent
1141      instructions is +/- 128, there are an astounding number of
1142      span-dependent instructions whose size is over estimated.
1143    
1144      Allen came up with the idea of letting the size of span
1145      dependent instructions be non-monotonic, for a maxIter
1146      number of times, after which the size must be monotonically
1147      increasing.
1148    
1149      This table shows the number of span-dependent instructions
1150      whose size was over-estimated as a function of maxIter, for the
1151      file Parse/parse/ml.grm.sml:
1152    
1153         maxIter            # of instructions:
1154            10                      687
1155            20                      438
1156            30                      198
1157            40                        0
1158    
1159      In compiling the compiler, there is no significant difference in
1160      compilation speed between maxIter=10 and maxIter=40. Actually,
1161      my measurements showed that maxIter=40 was a tad faster than
1162      maxIter=10! Also 96% of the  files in the compiler reach a fix
1163      point within 13 iterations, so fixing maxIter at 40, while high,
1164      is okay.
1165    
1166    ----------------------------------------------------------------------
1167    Name: Matthias Blume
1168    Date: 2001/10/31 15:25:00 EST
1169    Tag: blume-20011031-pgraph
1170    Description:
1171    
1172    CKIT:
1173    * Changed the "Function" constructor of type Ast.ctype to carry optional
1174      argument identifiers.
1175    * Changed the return type of TypeUtil.getFunction accordingly.
1176    * Type equality ignores the argument names.
1177    * TypeUtil.composite tries to preserve argument names but gives up quickly
1178      if there is a mismatch.
1179    
1180    installation script:
1181    * attempts to use "curl" if available (unless "wget" is available as well)
1182    
1183    CM:
1184    * has an experimental implementation of "portable graphs" which I will
1185      soon propose as an implementation-independent library format
1186    * there are also new libraries $/pgraph.cm and $/pgraph-util.cm
1187    
1188    NLFFI-LIB:
1189    * some cleanup (all cosmetic)
1190    
1191    NLFFIGEN:
1192    * temporarily disabled the mechanism that suppresses ML output for
1193      C definitions whose identifiers start with an underscore character
1194    * generate val bindings for enum constants
1195    * user can request that only one style (light or heavy) is being used;
1196      default is to use both (command-line arguments: -heavy and -light)
1197    * fixed bug in handling of function types involving incomplete pointers
1198    * generate ML entry points that take record arguments (i.e., using
1199      named arguments) for C functions that have a prototype with named
1200      arguments
1201      (see changes to CKIT)
1202    
1203    ----------------------------------------------------------------------
1204    Name: Allen Leung
1205    Date: 2001/10/27 20:34:00 EDT
1206    Tag: leunga-20011027-x86-fast-fp-call
1207    Description:
1208    
1209       Fixed the bug described in blume-20010920-slowfp.
1210    
1211       The fix involves
1212          1. generating FCOPYs in FSTP in ia32-svid
1213          2. marking a CALL with the appropriate annotation
1214    
1215    ----------------------------------------------------------------------
1216    Name: Matthias Blume
1217    Date: 2001/10/16 11:32:00 EDT
1218    Tag: blume-20011016-netbsd
1219    Description:
1220    
1221    Underscore patch from Chris Richards (fixing problem with compiling
1222    runtime system under recent NetBSD).
1223    
1224    ----------------------------------------------------------------------
1225    Name: Allen Leung
1226    Date: 2001/10/12 17:18:32 EDT 2001
1227    Tag: leung-20011012-x86-printflowgraph
1228    Description:
1229    
1230    X86RA now uses a valid (instead of dummy) PrintFlowgraph module.
1231    
1232    ----------------------------------------------------------------------
1233    Name: Lal George
1234    Date: 2001/10/11 23:51:34 EDT
1235    Tag: george-20011011-too-many-instrs
1236    Description:
1237    
1238    The representation of a program point never expected to see more
1239    than 65536 instructions in a basic block!
1240    
1241    ----------------------------------------------------------------------
1242    Name: Lal George
1243    Date: 2001/10/09 09:41:37 EDT
1244    Tag: george-20011008-mlrisc-labels
1245    Description:
1246    
1247    Changed the machine description files to support printing of
1248    local and global labels in assembly code, based on host assembler
1249    conventions.
1250    
1251    ----------------------------------------------------------------------
1252    Name: Matthias Blume
1253    Date: 2001/09/25 15:25:00 EDT
1254    Tag: blume-20010925-exninfo
1255    Description:
1256    
1257    I provided a non-hook implementation of exnName (at the toplevel) and
1258    made the "dummy" implementation of exnMessage (at the toplevel) more
1259    useful: if nothing gets "hooked in", then at least you are going to
1260    see the exception name and a message indicating why you don't see more.
1261    
1262    [For the time being, programs that need exnMessage and want to use
1263    ml-build should either use General.exnMessage (strongly recommended) or
1264    refer to structure General at some other point so that CM sees a
1265    static dependency.]
1266    
1267    [Similar remarks go for "print" and "use":  If you want to use their
1268    functionality in stand-alone programs generated by ml-build, then use
1269    TextIO.output and Backend.Interact.useFile (from $smlnj/compiler.cm).]
1270    
1271    ----------------------------------------------------------------------
1272    Name: Matthias Blume
1273    Date: 2001/09/20 17:28:00 EDT
1274    Tag: blume-20010920-slowfp
1275    Description:
1276    
1277    Allen says that x86-fast-fp is not safe yet, so I turned it off again...
1278    
1279    ----------------------------------------------------------------------
1280    Name: Matthias Blume
1281    Date: 2001/09/20 17:20:00 EDT
1282    Tag: blume-20010920-canonicalpaths
1283    Description:
1284    
1285    0. Updated the BOOT file (something that I forgot to do earlier).
1286    
1287    1. Small internal change to CM so that it avoids "/../" in filenames
1288       as much as possible (but only where it is safe).
1289    
1290    2. Changed config/_run-sml (resulting in a changed bin/.run-sml) so
1291       that arguments that contain delimiters are passed through correctly.
1292       This change also means that all "special" arguments of the form
1293       @SMLxxx... must come first.
1294    
1295    3. Changed install script to put relative anchor names for tool commands
1296       into pathconfig.
1297    
1298    ----------------------------------------------------------------------
1299    Name: Matthias Blume
1300    Date: 2001/09/18 15:35:00 EDT
1301    Tag: blume-20010918-readme11036
1302    Description:
1303    
1304    Added README files.
1305    
1306    ----------------------------------------------------------------------
1307    Name: Matthias Blume
1308    Date: 2001/09/18 11:45:00 EDT
1309    Tag: Release_110_36 (retag)
1310    Description:
1311    
1312    Fixed mistake in config/preloads. Retagged as 110.36.
1313    
1314    ----------------------------------------------------------------------
1315    Name: Matthias Blume
1316    Date: 2001/09/18 09:40:00 EDT
1317    Tag: Release_110_36_orig (tag changed)
1318    Description:
1319    
1320    New version (110.36).  New bootfiles.
1321    
1322    ----------------------------------------------------------------------
1323    Name: Matthias Blume
1324    Date: 2001/09/14 16:15:00 EDT
1325    Tag: blume-20010914-x86fastfp
1326    Description:
1327    
1328    John committed some changes that Allen made, in particular a (hopefully)
1329    correctly working version of the x86-fp module.
1330    
1331    I changed the default setting of the Control.MLRISC.getFlag "x86-fast-fp"
1332    flag to "true".  Everything seems to compile to a fixpoint ok, and
1333    "mandelbrot" speeds up by about 15%.
1334    
1335    ----------------------------------------------------------------------
1336    Name: Matthias Blume
1337    Date: 2001/09/13 11:20:00 EDT
1338    Tag: blume-20010913-minimal
1339    Description:
1340    
1341    1. Stefan Monnier's patch to fix a miscompilation problem that
1342       was brought to light by John Reppy's work on Moby.
1343    
1344    2. Implemented a minimal "structure Compiler" that contains just
1345       "version" and "architecture".  The minimal version will be
1346       available when the full version is not.  This is for backward-
1347       compatibility with code that wants to test Compiler.version.
1348    
1349    ----------------------------------------------------------------------
1350    Name: Matthias Blume
1351    Date: 2001/08/28 14:03:00 EDT
1352    Tag: blume-20010828-ml-lex
1353    Description:
1354    
1355    Fix for bug 1581, received from Neophytos Michael.
1356    
1357    ----------------------------------------------------------------------
1358    Name: Matthias Blume
1359    Date: 2001/08/27 11:20:00 EDT
1360    Tag: blume-20010827-readme11035
1361    Description:
1362    
1363    Fleshed out the README file for 110.35.
1364    
1365    ----------------------------------------------------------------------
1366    Name: Matthias Blume
1367    Date: 2001/08/24 17:10:00 EDT
1368    Tag: Release_110_35
1369    Description:
1370    
1371    New version number (110.35).  New bootfiles.
1372    
1373    ----------------------------------------------------------------------
1374    Name: Lal George
1375    Date: 2001/08/24 13:47:18 EDT 2001
1376    Tag: george-20010824-MLRISC-graphs
1377    Description:
1378    
1379     removed clusters from MLRISC completely and replaced with graphs.
1380    
1381    ----------------------------------------------------------------------
1382    Name: Matthias Blume
1383    Date: 2001/08/23 17:50:00 EDT
1384    Tag: blume-20010823-toplevel
1385    Description:
1386    
1387    - some reorganization of the code that implements various kinds of
1388      environments in the compiler (static, dynamic, symbolic, combined)
1389    - re-implemented the EnvRef module so that evalStream works properly
1390      (if the stream contains references to "use", "CM.make", etc.)
1391    - cleaned up evalloop.sml and interact.sml (but they need more cleaning)
1392    
1393    ----------------------------------------------------------------------
1394    Name: Matthias Blume
1395    Date: 2001/08/20 15:50 EDT
1396    Tag: blume20010820-slipup
1397    Description:
1398    
1399    I forgot to commit a few files.  Here they are...
1400    
1401    ----------------------------------------------------------------------
1402    Name: Matthias Blume
1403    Date: 2001/08/20 15:35:00 EDT
1404    Tag: blume-20010820-debugprof
1405    Description:
1406    
1407    !!!! NEW BOOTFILES !!!!
1408    
1409    This is another round of reorganizing the compiler sources.  This
1410    time the main goal was to factor out all the "instrumentation"
1411    passes (for profiling and backtracing) into their own library.
1412    The difficulty was to do it in such a way that it does not depend
1413    on elaborate.cm but only on elabdata.cm.
1414    
1415    Therefore there have been further changes to both elaborate.cm and
1416    elabdata.cm -- more "generic" things have been moved from the former
1417    to the latter.  As a result, I was forced to split the assignment
1418    of numbers indicating "primtyc"s into two portions: SML-generic and
1419    SML/NJ-specific.  Since it would have been awkward to maintain,
1420    I bit the bullet and actually _changed_ the mapping between these
1421    numbers and primtycs.  The bottom line of this is that you need
1422    a new set of bin- and bootfiles.
1423    
1424    I have built new bootfiles for all architectures, so doing a fresh
1425    checkout and config/install.sh should be all you need.
1426    
1427    The newly created library's name is
1428    
1429        $smlnj/viscomp/debugprof.cm
1430    
1431    and its sources live under
1432    
1433        src/compiler/DebugProf
1434    
1435    ----------------------------------------------------------------------
1436    Name: Matthias Blume
1437    Date: 2001/08/15 17:15:00 EDT
1438    Tag: blume-20010815-compreorg
1439    Description:
1440    
1441    This is a first cut at reorganizing the CM libraries that make up the
1442    core of the compiler.  The idea is to separate out pieces that could
1443    be used independently by tools, e.g., the parser, the typechecker, etc.
1444    
1445    The current status is a step in this direction, but it is not quite
1446    satisfactory yet.  Expect more changes in the future.
1447    
1448    Here is the current (new) organization...
1449    
1450        What used to be $smlnj/viscomp/core.cm is now divided into
1451        six CM libraries:
1452    
1453             $smlnj/viscomp/basics.cm
1454                           /parser.cm
1455                           /elabdata.cm
1456                           /elaborate.cm
1457                           /execute.cm
1458                           /core.cm
1459    
1460        The CM files for these libraries live under src/system/smlnj/viscomp.
1461        All these libraries are proxy libraries that contain precisely
1462        one CM library component.  Here are the locations of the components
1463        (all within the src/compiler tree):
1464    
1465             Basics/basics.cm
1466             Parse/parser.cm
1467             ElabData/elabdata.cm
1468             Elaborator/elaborate.cm
1469             Execution/execute.cm
1470             core.cm
1471    
1472         [This organization is the same that has been used already
1473         for a while for the architecture-specific parts of the visible
1474         compiler and for the old version of core.cm.]
1475    
1476         As you will notice, many source files have been moved from their
1477         respective original locations to a new home in one of the above
1478         subtrees.
1479    
1480         The division of labor between the new libraries is the following:
1481    
1482             basics.cm:
1483                - Simple, basic definitions that pertain to many (or all) of
1484                  the other libraries.
1485             parser.cm:
1486                - The SML parser, producing output of type Ast.dec.
1487                - The type family for Ast is also defined and exported here.
1488             elabdata.cm:
1489                - The datatypes that describe input and output of the elaborator.
1490                  This includes types, absyn, and static environments.
1491             elaborator.cm:
1492                - The SML/NJ type checker and elaborator.
1493                  This maps an Ast.dec (with a given static environment) to
1494                  an Absyn.dec (with a new static environment).
1495                - This libraries implements certain modules that used to be
1496                  structures as functors (to remove dependencies on FLINT).
1497             execute.cm:
1498                - Everything having to do with executing binary code objects.
1499                - Dynamic environments.
1500             core.cm:
1501                - SML/NJ-specific instantiations of the elaborator and MLRISC.
1502                - Top-level modules.
1503                - FLINT (this should eventually become its own library)
1504    
1505    Notes:
1506    
1507    I am not 100% happy with the way I separated the elaborator (and its
1508    data structures) from FLINT.  Two instances of the same problem:
1509    
1510        1. Data structures contain certain fields that carry FLINT-specific
1511           information.  I hacked around this using exn and the property list
1512           module from smlnj-lib.  But the fact that there are middle-end
1513           specific fields around at all is a bit annoying.
1514    
1515        2. The elaborator calculates certain FLINT-related information.  I tried
1516           to make this as abstract as I could using functorization, but, again,
1517           the fact that the elaborator has to perform calculations on behalf
1518           of the middle-end at all is not nice.
1519    
1520        3. Having to used exn and property lists is unfortunate because it
1521           weakens type checking.  The other alternative (parameterizing
1522           nearly *everything*) is not appealing, though.
1523    
1524    I removed the "rebinding =" warning hack because due to the new organization
1525    it was awkward to maintain it.  As a result, the compiler now issues some of
1526    these warnings when compiling init.cmi during bootstrap compilation. On
1527    the plus side, you also get a warning when you do, for example:
1528       val op = = Int32.+
1529    which was not the case up to now.
1530    
1531    I placed "assign" and "deref" into the _Core structure so that the
1532    code that deals with the "lazy" keyword can find them there.  This
1533    removes the need for having access to the primitive environment
1534    during elaboration.
1535    
1536    ----------------------------------------------------------------------
1537    Name: Matthias Blume
1538    Date: 2001/08/13
1539    Tag: blume-20010813-closures
1540    Description:
1541    
1542    This fix was sent to us by Zhong Shao.  It is supposed to improve the
1543    performance of certain loops by avoiding needless closure allocation.
1544    
1545    ----------------------------------------------------------------------
1546    Name: Lal George
1547    Date: 2001/07/31 10:03:23 EDT 2001
1548    Tag: george-20010731-x86-fmalloc
1549    Description: Fixed bug in x86 calls
1550    
1551        There was a bug where call instructions would mysteriously
1552        vanish. The call instruction had to be one that returned
1553        a floating point value.
1554    
1555    ----------------------------------------------------------------------
1556    Name: Lal George
1557    Date: 2001/07/19 16:36:29 EDT 2001
1558    Tag: george-20010719-simple-cells
1559    Description:
1560    
1561    I have dramatically simplified the interface for CELLS in MLRISC.
1562    
1563    In summary, the cells interface is broken up into three parts:
1564    
1565      1. CellsBasis : CELLS_BASIS
1566    
1567            CellsBasis is a top level structure and common for all
1568            architectures.  it contains the definitions of basic datatypes
1569            and utility  functions over these types.
1570    
1571      2. functor Cells() : CELLS
1572    
1573            Cells generates an interface for CELLS that incorporates the
1574            specific resources on the target architecture, such as the
1575            presence of special register classes, their number and size,
1576            and various useful substructures.
1577    
1578      3. <ARCH>CELLS
1579    
1580            e.g. SparcCells: SPARCCELLS
1581    
1582            <ARCH>CELLS usually contains additional bindings for special
1583            registers  on the architecture, such as:
1584    
1585                    val r0 : cell           (* register zero *)
1586                    val y : cell            (* Y register *)
1587                    val psr : cell          (* processor status register *)
1588                    ...
1589    
1590            The structure returned by applying the Cells functor is opened
1591            in this interface.
1592    
1593    The main implication of all this is that the datatypes for cells is
1594    split between CellsBasis and CELLS -- a fairly simple change for user
1595    code.
1596    
1597    In the old scheme the CELLS interface had a definitional binding of
1598    the form:
1599    
1600            signature CELLS = sig
1601    
1602               structure CellsBasis = CellsBasis
1603    
1604               ...
1605    
1606            end
1607    
1608    With all the sharing constraints that goes on in MLRISC, this old
1609    design  quickly leads to errors such as:
1610    
1611            "structure definition spec inside of sharing ... "
1612    
1613    
1614    and appears to require an unacceptable amount of sharing and where
1615    constraint hackery.
1616    
1617    I think this error message (the interaction of definitional specs and
1618    sharing) requires more explanation on our web page.
1619    
1620    ----------------------------------------------------------------------
1621    Name: Matthias Blume
1622    Date: 2001/07/19 15:00:00 EDT
1623    Tag: blume-20010719-libreorg
1624    Description:
1625    
1626    This update puts together a fairly extensive but straightforward change
1627    to the way the libraries that implement the interactive system are
1628    organized:
1629    
1630       The biggest change is the elimination of structure Compiler.  As a
1631       replacement for this structure, there is now a CM library
1632       (known as $smlnj/compiler.cm or $smlnj/compiler/current.cm)
1633       that exports all the substructures of the original structure Compiler
1634       directly.  So instead of saying Compiler.Foo.bar one now simply
1635       says Foo.bar.  (The CM libraries actually export a collection of
1636       structures that is richer than the collection of substructures of
1637       structure Compiler.)
1638    
1639       To make the transition smooth, there is a separate library called
1640       $smlnj/compiler/compiler.cm which puts together and exports the
1641       original structure Compiler (or at least something very close to it).
1642    
1643       There are five members of the original structure Compiler
1644       that are not exported directly but which instead became members
1645       of a new structure Backend (described by signature BACKEND).  These are:
1646       structure Profile (: PROFILE), structure Compile (: COMPILE), structure
1647       Interact (: INTERACT), structure Machine (: MACHINE), and val
1648       architecture (: string).
1649    
1650       Structure Compiler.Version has become structure CompilerVersion.
1651    
1652       Cross-compilers for alpha32, hppa, ppc, sparc, and x86 are provided
1653       by $smlnj/compiler/<arch>.cm where <arch> is alpha32, hppa, ppc, sparc,
1654       or x86, respectively.
1655       Each of these exports the same frontend structures that
1656       $smlnj/compiler.cm exports.  But they do not have a structure Backend
1657       and instead export some structure <Arch>Backend where <Arch> is Alpha32,
1658       Hppa, PPC, Sparc, or X86, respectively.
1659    
1660       Library $smlnj/compiler/all.cm exports the union of the exports of
1661       $smlnj/compiler/<arch>.cm
1662    
1663       There are no structures <Arch>Compiler anymore, use
1664       $smlnj/compiler/<arch>.cm instead.
1665    
1666       Library host-compiler-0.cm is gone.  Instead, the internal library
1667       that instantiates CM is now called cm0.cm.  Selection of the host
1668       compiler (backend) is no longer done here but. (Responsibility for it
1669       now lies with $smlnj/compiler/current.cm.  This seems to be more
1670       logical.)
1671    
1672       Many individual files have been moved or renamed.  Some files have
1673       been split into multiple files, and some "dead" files have been deleted.
1674    
1675    Aside from these changes to library organization, there are also changes
1676    to the way the code itself is organized:
1677    
1678       Structure Binfile has been re-implemented in such a way that it no
1679       longer needs any knowledge of the compiler.  It exclusively deals
1680       with the details of binfile layout.  It no longer invokes the
1681       compiler (for the purpose of creating new prospective binfile
1682       content), and it no longer has any knowledge of how to interpret
1683       pickles.
1684    
1685       Structure Compile (: COMPILE) has been stripped down to the bare
1686       essentials of compilation.  It no longer deals with linking/execution.
1687       The interface has been cleaned up considerably.
1688    
1689       Utility routines for dealing with linking and execution have been
1690       moved into their own substructures.
1691    
1692       (The ultimate goal of these changes is to provide a light-weight
1693       binfile loader/linker (at least for, e.g., stable libraries) that
1694       does not require CM or the compiler to be present.)
1695    
1696    CM documentation has been updated to reflect the changes to library
1697    organization.
1698    
1699    ----------------------------------------------------------------------
1700    Name: Matthias Blume
1701    Date: 2001/07/10 17:30:00 EDT
1702    Tag: Release_110_34
1703    Description:
1704    
1705    Minor tweak to 110.34 (re-tagged):
1706    
1707      - README.html file added to CVS repository
1708      - runtime compiles properly under FreeBSD 3.X and 4.X
1709    
1710    ----------------------------------------------------------------------
1711    Name: Matthias Blume
1712    Date: 2001/07/10 17:30:00 EDT
1713    Tag: Release_110_34
1714    Description:
1715    
1716    New version number (110.34). New bootfiles.
1717    
1718    ----------------------------------------------------------------------
1719    Name: Matthias Blume
1720    Date: 2001/07/09 16:00:00 EDT
1721    Tag: blume-20010709-more-varargs
1722    Description:
1723    
1724    I changed the handling of varargs in ml-nlffigen again:
1725    The ellipsis ... will now simply be ignored (with an accompanying warning).
1726    
1727    The immediate effect is that you can actually call a varargs function
1728    from ML -- but you can't actually supply any arguments beyond the ones
1729    specified explicitly.  (For example, you can call printf with its format
1730    string, but you cannot pass additional arguments.)
1731    
1732    This behavior is only marginally more useful than the one before, but
1733    it has the advantage that a function or, more importantly, a function
1734    type never gets dropped on the floor, thus avoiding follow-up problems with
1735    other types that refer to the offending one.
1736    
1737    ----------------------------------------------------------------------
1738    Name: Matthias Blume
1739    Date: 2001/07/09 11:25:00 EDT
1740    Tag: blume-20010709-varargs
1741    Description:
1742    
1743    1. ckit-lib.cm now exports structure Error
1744    2. ml-nlffigen reports occurences of "..." (i.e., varargs function types)
1745       with a warning accompanied by a source location.  Moreover, it
1746       merely skips the offending function or type and proceeds with the
1747       rest of its work.u  As a result, one can safely feed C code containing
1748       "..." to ml-nlffigen.
1749    3. There are some internal improvements to CM, providing slightly
1750       more general string substitutions in the tools subsystem.
1751    
1752    ----------------------------------------------------------------------
1753    Name: Matthias Blume
1754    Date: 2001/06/27 15:10:00 EDT
1755    Tag: blume-20010627-concur
1756    Description:
1757    
1758    Fixed a small bug in CM's handling of parallel compilation.
1759    (You could observe the bug by Control-C-interrupting an ordinary
1760    CMB.make or CM.stabilize and then attaching some compile servers.
1761    The result was that all of a sudden the previously interrupted
1762    compilation would continue on its own.  This was because of
1763    an over-optimization: CM did not bother to clean out certain queues
1764    when no servers were attached "anyway", resulting in the contents
1765    of these queues to grab control when new servers did get attached.)
1766    
1767    There is also another minor update to the CM manual.
1768    
1769    ----------------------------------------------------------------------
1770    Name: Matthias Blume
1771    Date: 2001/06/26 16:15:00 EDT
1772    Tag: blume-20010626-cmdoc
1773    Description:
1774    
1775    Minor typo fixed in CM manual (syntax diagram for libraries).
1776    
1777    ----------------------------------------------------------------------
1778    Name: Matthias Blume
1779    Date: 2001/06/25 22:55:00 EDT
1780    Tag: blume-20010625-x86pc
1781    Description:
1782    
1783    Fixed a nasty bug in the X86 assembly code that caused signal
1784    handlers to fail (crash) randomly.
1785    
1786    ----------------------------------------------------------------------
1787    Name: Matthias Blume
1788    Date: 2001/06/25 12:05:00 EDT
1789    Tag: blume-20010625-nlffigen
1790    Description:
1791    
1792    This update fixes a number of minor bugs in ml-nlffigen as reported by
1793    Nick Carter <nbc@andrew.cmu.edu>.
1794    
1795      1. Silly but ok typedefs of the form "typedef void myvoid;" are now accepted.
1796      2. Default names for generated files are now derived from the name of
1797         the C file *without its directory*.  In particular, this causes generated
1798         files to be placed locally even if the C file is in some system directory.
1799      3. Default names for generated signatures and structures are also derived
1800         from the C file name without its directory.  This avoids silly things
1801         like "structure GL/GL".
1802         (Other silly names are still possible because ml-nlffigen does not do
1803          a thorough check of whether generated names are legal ML identifiers.
1804          When in doubt, use command line arguments to force particular names.)
1805    
1806    ----------------------------------------------------------------------
1807    Name: Matthias Blume
1808    Date: 2001/06/21 12:25:00 EDT
1809    Tag: blume-20010621-eXene
1810    Description:
1811    
1812    eXene now compiles and (sort of) works again.
1813    
1814    The library name (for version > 110.33) is $/eXene.cm.
1815    
1816    I also added an new example in src/eXene/examples/nbody.  See the
1817    README file there for details.
1818    
1819    ----------------------------------------------------------------------
1820    Name: Matthias Blume
1821    Date: 2001/06/20 16:40:00 EDT
1822    Tag: blume-20010620-cml
1823    Description:
1824    
1825    CML now compiles and works again.
1826    
1827    Libraries (for version > 110.33):
1828    
1829      $cml/cml.cm            Main CML library.
1830      $cml/basis.cm          CML's version of $/basis.cm.
1831      $cml/cml-internal.cm   Internal helper library.
1832      $cml/core-cml.cm       Internal helper library.
1833      $cml-lib/trace-cml.cm  Tracing facility.
1834      $cml-lib/smlnj-lib.cm  CML's version of $/smlnj-lib.cm
1835    
1836    The installer (config/install.sh) has been taught how to properly
1837    install this stuff.
1838    
1839    ----------------------------------------------------------------------
1840    Name: Matthias Blume
1841    Date: 2001/06/19 17:55:00 EDT
1842    Tag: blume-20010619-instantiate
1843    Description:
1844    
1845    This un-breaks the fix for bug 1432.
1846    (The bug was originally fixed in 110.9 but I broke it again some
1847    time after that.)
1848    
1849    ----------------------------------------------------------------------
1850    Name: Matthias Blume
1851    Date: 2001/06/19 17:25:00 EDT
1852    Tag: blume-20010619-signals
1853    Description:
1854    
1855    This should (hopefully) fix the long-standing signal handling bug.
1856    (The runtime system was constructing a continuation record with an
1857    incorrect descriptor which would cause the GC to drop data on the floor...)
1858    
1859    ----------------------------------------------------------------------
1860    Name: Matthias Blume
1861    Date: 2001/06/15 15:05:00 EDT
1862    Tag: blume-20010615-moresparc
1863    Description:
1864    
1865    Here is a short late-hour update related to Sparc c-calls:
1866    
1867     -- made handling of double-word arguments a bit smarter
1868    
1869     -- instruction selection phase tries to collapse certain clumsily
1870        constructed ML-Trees; typical example:
1871    
1872            ADD(ty,ADD(_,e,LI d1),LI d2)  ->  ADD(ty,e,LI(d1+d2))
1873    
1874        This currently has no further impact on SML/NJ since mlriscGen does
1875        not seem to generate such patterns in the first place, and c-calls
1876        (which did generate them in the beginning) has meanwhile been fixed
1877        so as to avoid them as well.
1878    
1879    ----------------------------------------------------------------------
1880    Name: Matthias Blume
1881    Date: 2001/06/15 15:05:00 EDT
1882    Tag: blume-20010615-sparc
1883    Description:
1884    
1885    The purpose of this update is to provide an implementation of NLFFI
1886    on Sparc machines.
1887    
1888    Here are the changes in detail:
1889    
1890       * src/MLRISC/sparc/c-calls/sparc-c-calls.sml is a new file containing
1891       the Sparc implementation of the c-calls API.
1892       * The Sparc backend of SML/NJ has been modified to uniformely use %fp
1893       for accessing the ML frame.  Thus, we have a real frame pointer and
1894       can freely modify %sp without need for an omit-frame-ptr phase.
1895       The vfp logic in src/compiler/CodeGen/* has been changed to accomodate
1896       this case.
1897       * ml-nlffigen has been taught to produce code for different architectures
1898       and calling conventions.
1899       * In a way similar to what was done in the x86 case, the Sparc
1900       backend uses its own specific extension to mltree.  (For example,
1901       it needs to be able to generate UNIMP instructions which are part
1902       of the calling convention.)
1903       * ml-nlffi-lib was reorganized to make it more modular (in particular,
1904       to make it easier to plug in new machine- and os-dependent parts).
1905    
1906    There are some other fairly unrelated bug fixes and cleanups as well:
1907    
1908       * I further hacked the .cm files for MLRISC tools (like MDLGen) so
1909       that they properly share their libraries with existing SML/NJ libraries.
1910       * I fixed a minor cosmetic bug in CM, supressing certain spurious
1911       follow-up error messages.
1912       * Updates to CM/CMB documentation.
1913    
1914    TODO items:
1915    
1916       * MLRISC should use a different register as its asmTemp on the Sparc.
1917         (The current %o2 is a really bad choice because it is part of the
1918         calling conventions, so things might interfere in unexpected ways.)
1919    
1920    ----------------------------------------------------------------------
1921    Name: Matthias Blume
1922    Date: 2001/06/07
1923    Tag: blume-20010607-calls
1924    Description:
1925    
1926    A number of internal changes related to C calls and calling conventions:
1927    
1928    1. ML-Tree CALL statements now carry a "pops" field.  It indicates the
1929       number of bytes popped implicitly (by the callee).  In most cases
1930       this field is 0 but on x86/win32 it is some non-zero value.  This
1931       is information provided for the benefit of the "omit-frameptr" pass.
1932    2. The CALL instruction on the x86 carries a similar "pops" field.
1933       The instruction selection phase copies its value from the ML-Tree
1934       CALL statement.
1935    3. On all other architectures, the instruction selection phase checks
1936       whether "pops=0" and complains if not.
1937    4. The c-calls implementation for x86 now accepts two calling conventions:
1938       "ccall" and "stdcall".  When "ccall" is selected, the caller cleans
1939       up after the call and pops is set to 0.  For "stdcall", the caller
1940       does nothing, leaving the cleanup to the callee; pops is set to
1941       the number of bytes that were pushed onto the stack.
1942    5. The cproto decoder (compiler/Semant/types/cproto.sml) now can
1943       distinguish between "ccall" and "stdcall".
1944    6. The UNIMP instruction has been added to the supported Sparc instruction
1945       set. (This is needed for implementing the official C calling convention
1946       on this architecture.)
1947    7. I fixed some of the .cm files under src/MLRISC/Tools to make them
1948       work with the latest CM.
1949    
1950    ----------------------------------------------------------------------
1951    Name: Matthias Blume
1952    Date: 2001/06/05 15:10:00 EDT
1953    Tag: blume-20010605-cm-index
1954    Description:
1955    
1956    0. The "lambdasplit" parameter for class "sml" in CM has been documented.
1957    
1958    1. CM can now generate "index files".  These are human-readable files
1959       that list on a per-.cm-file basis each toplevel symbol defined or
1960       imported.  The location of the index file for
1961       <p>/<d>.cm is <p>/CM/INDEX/<d>.cm.
1962       To enable index-file generation, set CM.Control.generate_index to true
1963       or export an environment-symbol: export CM_GENERATE_INDEX=true.
1964    
1965       The CM manual has been updated accordingly.
1966    
1967    2. I made some slight modifications to the c-calls API in MLRISC.
1968    
1969         a) There is now a callback to support saving/restoring of
1970            dedicated but caller-save registers around the actual call
1971            instruction.
1972         b) One can optionally specify a comment-annotation for the
1973            call instruction.
1974    
1975    3. SML/NJ (mlriscGen.sml) uses this new API for the rawccall primop.
1976       (For example, the comment annotation shows the C prototype of
1977        the function being called.)
1978    
1979    ----------------------------------------------------------------------
1980    Name: Matthias Blume
1981    Date: 2001/06/01 13:30:00 EDT
1982    Tag: blume-20010601-nlffi-cleanup
1983    Description:
1984    
1985    This is mostly a cleanup of MLFFI stuff:
1986    
1987       - some signature files have been put into a more exposed place
1988       - the ugly 'f type parameter is gone (simplifies types tremendously!)
1989       - ml-nlffigen changed accordingly
1990       - tutorial updated
1991    
1992    Other changes:
1993    
1994       - author's affiliation in CM manual(s) updated
1995       - some more recognized keywords added to Allen's sml.sty
1996    
1997    ----------------------------------------------------------------------
1998    Name: Matthias Blume
1999    Date: 2001/05/25 15:30:00 EDT
2000    Tag: blume-20010525-iptr
2001    Description:
2002    
2003      - put the official 110.33-README (as it appears on the ftp server) under
2004        CVS
2005      - fixed a small bug related to incomplete pointer types in
2006        ml-nlffigen
2007      - small cosmetic change to the ml-nlffi-lib's "arr" type constructor
2008        (it does not need the 'f type parameter)
2009    
2010    ----------------------------------------------------------------------
2011    Name: Matthias Blume
2012    Date: 2001/05/23 14:30:00 EDT
2013    Tag: Release_110_33
2014    Description:
2015    
2016    New version number (110.33).  New bootfiles.
2017    
2018    ----------------------------------------------------------------------
2019    Name: Matthias Blume
2020    Date: 2001/05/22 18:06:00 EDT
2021    Tag: blume-20010522-targets
2022    Description:
2023    
2024    Made install.sh use file config/targets.customized if it exists, falling
2025    back to config/targets if it doesn't.  This way one can have a customized
2026    version of the targets file without touching the "real thing", thus
2027    eliminating the constant fear of accidentally checking something bogus
2028    back into the CVS repository...  (File config/targets.customized must
2029    not be added to the repository!)
2030    
2031    ----------------------------------------------------------------------
2032    Name: Matthias Blume
2033    Date: 2001/05/22 16:30:00 EDT
2034    Tag: blume-20010522-minitut
2035    Description:
2036    
2037    1. Bug fix in ml-nlffigen; now (hopefully) correctly handling
2038       struct returns.
2039    2. Added src/ml-nlffi-lib/Doc/mini-tutorial.txt.  This is some very
2040       incomplete, preliminary documentation for NLFFI.
2041    
2042    ----------------------------------------------------------------------
2043    Name: Matthias Blume
2044    Date: 2001/05/14 11:30:00 EDT
2045    Tag: blume-20010514-script
2046    Description:
2047    
2048    Some bugs in install script fixed.
2049    
2050    In addition to that I also made a slight change to the NLFFI API:
2051    Functors generated by ml-nlffigen now take the dynamic library as a
2052    straight functor argument, not as a suspended one.  (The original
2053    functor code used to force the suspension right away anyway, so there
2054    was nothing gained by this complication of the interface.)
2055    
2056    ----------------------------------------------------------------------
2057    Name: Matthias Blume
2058    Date: 2001/05/11 14:35:00 EDT
2059    Tag: blume-20010511-ml-nlffi
2060    Description:
2061    
2062    I finally took the plunge and added my new FFI code to the main
2063    repository.  For x86-linux it is now ready for prime-time.
2064    
2065    There are two new subdirectories of "src":
2066    
2067      - ml-nlffi-lib:
2068           The utility library for programs using the FFI interface.
2069           Here is the implementation of $/c.cm and its associated low-level
2070           partners $/c-int.cm and $/memory.cm.
2071      - ml-nlffigen:
2072           A stand-alone program for generating ML glue code from C source
2073           code.
2074    
2075    Building ml-nlffigen requires $/ckit-lib.cm.
2076    
2077    The config/install.sh script has been updates to do the Right Thing
2078    (hopefully).
2079    
2080    Notice that the source tree for the C-Kit will not be put under "src"
2081    but directly under the installation root directory.  (This is the
2082    structure that currently exists on the CVS server when you check out
2083    module "sml".)  Fortunately, config/install.sh knows about this oddity.
2084    
2085    Bugs: No documentation yet.
2086    
2087    ----------------------------------------------------------------------
2088    Name: Matthias Blume
2089    Date: 2001/05/09 16:35:00 EDT
2090    Tag: blume-20010509-cpscontract
2091    Description:
2092    
2093    Fixed a bug in the accounting code in cpsopt/contract.sml.  (The
2094    wrapper/unwrapper elimination did not decrement usage counts and some
2095    dead variables got overlooked by the dead-up logic.)
2096    
2097    ----------------------------------------------------------------------
2098    Name: Lal George
2099    Date: 2001/05/08  17:26:09 EDT
2100    Tag: george-20010508-omit-frameptr
2101    Description:
2102    
2103    Changes to implement the omit-frame-pointer optimization to support
2104    raw C calls. For now, there is only support on the Intel x86, but
2105    other architectures will follow as more experience is gained with this.
2106    
2107    
2108    ----------------------------------------------------------------------
2109    Name: Matthias Blume
2110    Date: 2001/05/07 14:40:00 EDT
2111    Tag: blume-20010507-proxies
2112    Description:
2113    
2114    I made into "proxy libraries" all libraries that qualify for such a
2115    change.  (A qualifying library is a library that has another library or
2116    groups as its sole member and repeats that member's export list
2117    verbatim.  A proxy library avoids this repetition by omitting its export
2118    list, effectively inheriting the list that its (only) member exports.
2119    See the CM manual for more explanation.)
2120    The main effect is that explicit export lists for these libraries
2121    do not have to be kepts in sync, making maintenance a bit easier.
2122    
2123    I also added copyright notices to many .cm-files.
2124    
2125    Last but not least, I made a new set of bootfiles.
2126    
2127    ----------------------------------------------------------------------
2128    Name: Matthias Blume
2129    Date: 2001/05/04 17:00:00 EDT
2130    Tag: blume-20010504-cm-lsplit
2131    Description:
2132    
2133    0. John merged pending changes to $/smlnj-lib.cm
2134    
2135    1. Allen's previous change accidentally backed out of one of Lal's
2136       earlier changes.  I undid this mistake (re-introducing Lal's change).
2137    
2138    2. I used the new topOrder' function from graph-scc.sml (from $/smlnj-lib.cm)
2139       within the compiler where applicable.  There is some code simplification
2140       because of that.
2141    
2142    3. The "split" phase (in FLINT) is now part of the default list of phases.
2143       Compiler.Control.LambdaSplitting.* can be used to globally control the
2144       lambda-splitting (cross-module-inlining) engine.  In addition to that,
2145       it can now also be controlled on a per-source basis: CM has been taught
2146       a new tool parameter applicable to ML source files.
2147    
2148       - To turn lambda-splitting off completely:
2149            local open Compiler.Control.LambdaSplitting in
2150                val _ = set Off
2151            end
2152       - To make "no lambda-splitting" the global default (but allow per-source
2153         overriding); this is the initial setting:
2154            local open Compiler.Control.LambdaSplitting in
2155                val _ = set (Default NONE)
2156            end
2157       - To make "lambda-splitting with aggressiveness a" the global default
2158         (and allow per-source overriding):
2159            local open Compiler.Control.LambdaSplitting in
2160                val _ = set (Default (SOME a))
2161            end
2162    
2163       - To turn lambda-splitting off for a given ML souce file (say: a.sml)
2164         write (in the respective .cm-file):
2165            a.sml (lambdasplitting:off)
2166       - To turn lambda-splitting for a.sml on with minimal aggressiveness:
2167            a.sml (lambdasplitting:on)
2168       - To turn lambda-splitting for a.sml on with aggressiveness <a> (where
2169         <a> is a decimal non-negative integer):
2170            a.sml (lambdasplitting:<a>)
2171       - To turn lambda-splitting for a.sml on with maximal aggressiveness:
2172            a.sml (lambdasplitting:infinity)
2173       - To use the global default for a.sml:
2174            a.sml (lambdasplitting:default)
2175         or simply
2176            a.sml
2177    
2178    ----------------------------------------------------------------------
2179    Name: Allen Leung
2180    Date: 2001/05/04 01:57:00 EDT
2181    Tag: leunga-20010504-sync
2182    Description:
2183    
2184      MLRISC features.
2185    
2186      1. Fix to CMPXCHG instructions.
2187      2. Changed RA interface to allow annotations in callbacks.
2188      3. Added a new method to the stream interface to allow annotations updates.
2189    
2190    ----------------------------------------------------------------------
2191    Name: Matthias Blume
2192    Date: 2001/05/01 11:45:00 EDT
2193    Tag: blume-20010501-pcedittmp
2194    Description:
2195    
2196    Changed install.sh to use the current working directory instead of
2197    /usr/tmp for a temporary file (pcedittmp).  The previous choice
2198    of /usr/tmp caused trouble with MacOS X because of file premission
2199    problems.
2200    
2201    ----------------------------------------------------------------------
2202    Name: Matthias Blume
2203    Date: 2001/04/20 11:10:00 EDT
2204    Tag: blume-20010420-inMLflag
2205    Description:
2206    
2207     - added vp_limitPtrMask to vproc-state.h
2208       (for use by the raw-C-calls mechanism to implement proper interrupt
2209        handling)
2210     - made the ML compiler aware of various data-structure offsets so it
2211       can generate code for accessing the vp_inML flag and vp_limitPtrMask
2212     - tweaked mlriscGen.sml to have it emit interrupt-handling code for
2213       raw C-calls
2214    
2215    ----------------------------------------------------------------------
2216    Name: Lal George
2217    Date: 2001/04/20 09:15:28 EDT
2218    Tag: george-20010420-macosX
2219    Description:
2220    
2221     - Changes to port to Mac OS X; Darwin.
2222    
2223     - In the process I found that sqrt was broken on the PPC, because the
2224       fsqrt instruction is not implemented.
2225    
2226    ----------------------------------------------------------------------
2227    Name: Matthias Blume
2228    Date: 2001/04/18 12:45:00 EDT
2229    Tag: blume-20010418-ccalls
2230    Description:
2231    
2232     - fixed two off-by-4 errors in the x86-specific c-calls implementation
2233       (this bug prevented structure arguments containing pointers from being
2234        passed correctly)
2235     - changed the raw-C-call code in mlriscGen.sml in such a way that
2236       structure arguments are represented as a pointer to the beginning
2237       of the structure (instead of having a series of synthesized arguments,
2238       one for each structure member)
2239    
2240     - made makeml script's verbosity level configurable via environment
2241       variable (MAKEML_VERBOSITY)
2242    
2243     - eliminated placeholder implementations for f32l, w16s, i16s, and f32s
2244       in rawmem-x86.sml; we are now using the real thing
2245    
2246    ----------------------------------------------------------------------
2247    Name: Matthias Blume
2248    Date: 2001/03/22 16:25:00 EST
2249    Tag: blume-20010322-bootfiles
2250    Description:
2251    
2252    Created a new set of bootfiles (for your automatic installation convenience).
2253    
2254    ----------------------------------------------------------------------
2255    Name: Matthias Blume
2256    Date: 2001/03/22 15:10:00 EST
2257    Tag: blume-20010322-rawmem-parcm
2258    Description:
2259    
2260    1. All "raw memory access" primitives for the new FFI are implemented now
2261       (at least on the x86).
2262    2. Some further cleanup of CM's parallel make mechanism.
2263    
2264    ----------------------------------------------------------------------
2265    Name: Matthias Blume
2266    Date: 2001/03/19 17:53:00 EST
2267    Tag: blume-20010319-parallel
2268    Description:
2269    
2270    Parallel make (using compile servers) now works again.
2271    
2272    To this end, CM.stabilize and CMB.make have been modified to work in
2273    two passes when compile servers are attached:
2274       1. Compile everything, do not perform stabilization; this pass
2275          uses compile servers
2276       2. Stabilize everything; this pass does not use compile servers
2277    If there are no compile servers, the two passes are combined into one
2278    (as before).  Splitting the passes increases the inherent parallelism
2279    in the dependency graph because the entire graph including all
2280    libraries is available at the same time.  This, in turn, improves
2281    server utilization.  The downside is that the master process will
2282    have to do some extra work after compilation is done (because for
2283    technical reasons it must re-read all the binfiles during stabilization).
2284    
2285    ----------------------------------------------------------------------
2286    Name: Matthias Blume
2287    Date: 2001/03/16 12:22:00 EST
2288    Tag: blume-20010316-bootfiles
2289    Description:
2290    
2291    Created a new set of bootfiles (for your automatic installation convenience).
2292    
2293    ----------------------------------------------------------------------
2294    Name: Matthias Blume
2295    Date: 2001/03/16 11:00:00 EST
2296    Tag: blume-20010316-MLTREE-fixup
2297    Description:
2298    
2299    This is a minor fixup for an (untagged) earlier commit by Allen.
2300    (A file was missing).
2301    
2302    ----------------------------------------------------------------------
2303    Name: Allen Leung
2304    Date: Mon Mar  5 18:54:57 EST 2001
2305    Tag: leunga-20010305-cut-support
2306    
2307    1. New support for alternative control-flow in MLTREE.
2308       Currently we support
2309    
2310          FLOW_TO(CALL ...., [k1,...,kn])
2311    
2312       This is needed for 'cuts to' in C-- and try/handle-like constructs
2313       in Moby
2314    
2315       New assembler flag "asm-show-cutsto" to turn on control-flow debugging.
2316    
2317    2. Register Allocator
2318    
2319       Changes in interface [from Fermin, John]
2320    
2321    3. Alpha 8-bit SLL support [Fermin]
2322    
2323    4. All architectures
2324    
2325       A new module (ClusterExpandCopies) for expanding parallel copies.
2326    
2327    ----------------------------------------------------------------------
2328    Name: Allen Leung
2329    Date: 2001/02/27 23:07:00 EST
2330    Tag: leunga-20010227-minor-stuff
2331    
2332    1. Alpha bug fix for CMOVNE
2333    2. Handle mltree COND(..,FCMP ...,...)
2334    3. Bug fix in simplifier
2335    
2336    ----------------------------------------------------------------------
2337    Name: Matthias Blume
2338    Date: 2001/01/30 17:50:00 EST
2339    Tag: blume-20010130-sync
2340    Description:
2341    
2342    This is just a minor update to sync my devel branch with the main brach.
2343    The only visible change is the addition of some README files.
2344    
2345    ----------------------------------------------------------------------
2346    Name: Matthias Blume
2347    Date: 2001/01/12 23:30:00 JST
2348    Tag: blume-20010112-bootfiles
2349    Description:
2350    
2351    Made a new set of bootfiles that goes with the current state of the
2352    repository.
2353    
2354    ----------------------------------------------------------------------
2355    Name: Matthias Blume
2356    Date: 2001/01/12 21:20:00 JST
2357    Tag: blume-20010112-sync
2358    Description:
2359    
2360    I am just flushing out some minor changes that had accumulated in
2361    my private branch in order to sync with the main tree.  (This is
2362    mainly because I had CVS trouble when trying to merge _into_ my
2363    private branch.)
2364    
2365    Most people should be completely unaffected by this.
2366    
2367    ----------------------------------------------------------------------
2368    Name: Allen Leung
2369    Date: Thu Jan 11 21:03:00 EST 2001
2370    Tag: leunga-20010111-labexp=mltree
2371    Description:
2372    
2373    1.  Removed the type LabelExp and replace it by MLTree.
2374    2.  Rewritten mltree-simplify with the pattern matcher tool.
2375    3.  There were some bugs in alpha code generator which would break
2376        64-bit code generation.
2377    4.  Redo the tools to generate code with the
2378    5.  The CM files in MLRISC (and in src/system/smlnj/MLRISC)
2379        are now generated by perl scripts.
2380    
2381    ----------------------------------------------------------------------
2382    Name: Matthias Blume
2383    Date: 2001/01/10 21:55:00 JST
2384    Tag: blume-20010110-rcc
2385    Description:
2386    
2387    The RCC stuff now seems to work (but only on the x86).
2388    This required hacking of the c-calls interface (and -implementation) in
2389    MLRISC.
2390    
2391    Normal compiler users should be unaffected.
2392    
2393    ----------------------------------------------------------------------
2394    Name: Matthias Blume
2395    Date: 2001/01/09 01:20:00 JST
2396    Tag: blume-20010109-rcc
2397    Description:
2398    
2399    This is a fairly big patch, flushing out a large number of pending
2400    changes that I made to my development copy over the last couple of days.
2401    
2402    Of practical relevance at this moment is a workaround for a pickling
2403    bug that Allen ran into the other day.  The cause of the bug itself is
2404    still unknown and it might be hard to fix it properly, but the
2405    workaround has some merits of its own (namely somewhat reducing pickling
2406    overhead for certain libraries).  Therefore, I think this solution should
2407    be satisfactory at this time.
2408    
2409    The rest of the changes (i.e., the vast majority) has to do with my
2410    ongoing efforts of providing direct support for C function calls from
2411    ML.  At the moment there is a new primop "RAW_CCALL", typing magic
2412    in types/cproto.sml (invoked from FLINT/trans/translate.sml), a new
2413    case in the FLINT CPS datatype (RCC), changes to cps/convert.sml to
2414    translate uses of RAW_CCALL into RCC, and changes to mlriscGen.sml to
2415    handle RCC.
2416    
2417    The last part (the changes to mlriscGen.sml) are still known to be
2418    wrong on the x86 and not implemented on all other architectures.  But
2419    the infrastructure is in place. I had to change a few functor
2420    signatures in the backend to be able to route the CCalls interface
2421    from MLRISC there, and I had to specialize the mltree type (on the
2422    x86) to include the necessary extensions. (The extensions themselves
2423    were already there and redy to go in MLRISC/x86).
2424    
2425    Everything should be very happy as soon as someone helps me with
2426    mlriscGen.sml...
2427    
2428    In any case, nothing of this should matter to anyone as long as the
2429    new primop is not being used (which is going to be the case unless you
2430    find it where I hid it :). The rest of the compiler is completely
2431    unaffected.
2432    
2433    ----------------------------------------------------------------------
2434    Name: Matthias Blume
2435    Date: 2001/01/05 00:30:00 JST
2436    Tag: blume-20010105-primops
2437    Description:
2438    
2439    Added some experimental support for work that I am doing right now.
2440    These changes mostly concern added primops, but there is also a new
2441    experimental C library in the runtime system (but currently not enabled
2442    anywhere except on Linux/X86).
2443    
2444    In the course of adding primops (and playing with them), I discovered that
2445    Zhong's INL_PRIM hack (no type info for certain primops) was, in fact, badly
2446    broken.  (Zhong was very right he labeled this stuff as "major gross hack".)
2447    To recover, I made type information in INL_PRIM mandatory and changed
2448    prim.sml as well as built-in.sml accordingly.  The InLine structure now
2449    has complete, correct type information (i.e., no bottom types).
2450    
2451    Since all these changes mean that we need new binfiles, I also bumped the
2452    version number to 110.32.1.
2453    
2454    ----------------------------------------------------------------------
2455    Name: Matthias Blume
2456    Date: 2000/12/30 22:10:00 JST
2457    Tag: blume-20001230-various
2458    Description:
2459    
2460    Added proxy libraries for MLRISC and let MLRISC libraries refer
2461    to each other using path anchors.  (See CM manual for explanation.)
2462    
2463    Updated CM documentation.
2464    
2465    Fixed some bugs in CM.
2466    
2467    Implemented "proxy" libraries (= syntactic sugar for CM).
2468    
2469    Added "-quiet" option to makeml and changed runtime system accordingly.
2470    
2471    Added cleanup handler for exportML to reset timers and compiler stats.
2472    
2473    ----------------------------------------------------------------------
2474    Name: Lal George
2475    Date: 2000/12/22 22:22:58 EST 2000
2476    Tag: Release_110_32
2477    Description:
2478    
2479            Infinite precision used throughout MLRISC.
2480            see MLRISC/mltree/machine-int.sig
2481    
2482    ----------------------------------------------------------------------
2483    Name: Matthias Blume
2484    Date: 2000/12/22 23:16:00 JST
2485    Tag: blume-20001222-warn
2486    Description:
2487    
2488    Corrected wording and formatting of some CM warning message which I
2489    broke in my previous patch.
2490    
2491    ----------------------------------------------------------------------
2492    Name: Matthias Blume
2493    Date: 2000/12/22 21:20:00 JST
2494    Tag: blume-20001222-anchorenv
2495    Description:
2496    
2497    Fixed CM's handling of anchor environments in connection with CMB.make.
2498    
2499    ----------------------------------------------------------------------
2500    Name: Matthias Blume
2501    Date: 2000/12/22 13:15:00 JST
2502    Tag: blume-20001222-cleanup
2503    Description:
2504    
2505    Removed src/cm/ffi which does not (and did not) belong here.
2506    
2507    ----------------------------------------------------------------------
2508    Name: Matthias Blume
2509    Date: 2000/12/21 23:55:00 JST
2510    Tag: blume-20001221-exn
2511    Description:
2512    
2513    Probably most important: CM no longer silently swallows all exceptions
2514    in the compiler.
2515    Plus: some other minor CM changes.  For example, CM now reports some
2516    sizes for generated binfiles (code, data, envpickle, lambdapickle).
2517    
2518    ----------------------------------------------------------------------
2519    Name: Matthias Blume
2520    Date: 2000/12/15 00:01:05 JST
2521    Tag: blume-20001215-dirtool
2522    Description:
2523    
2524    - "dir" tool added.
2525    - improvements and cleanup to Tools structure
2526    - documentation updates
2527    
2528    ----------------------------------------------------------------------
2529    Name: Allen Leung
2530    Date: Thu Dec 14 03:45:24 EST 2000
2531    Description:
2532    Tag:  leunga-20001214-int-inf
2533    Description:
2534    
2535       In IntInf, added these standard functions, which are missing from our
2536    implementation:
2537    
2538        andb : int * int -> int
2539        xorb : int * int -> int
2540        orb  : int * int -> int
2541        notb : int -> int
2542         <<   : int * word -> int
2543        ~>>  : int * word -> int
2544    
2545       Not tested, I hope they are correct.
2546    
2547    ----------------------------------------------------------------------
2548    Name: Allen Leung
2549    Date: Fri Dec  8 19:23:26 EST 2000
2550    Description:
2551    Tag:  leunga-20001208-nowhere
2552    Description:
2553    
2554      Slight improvements to the 'nowhere' tool to handle OR-patterns,
2555    to generate better error messages etc.  Plus a brief manual.
2556    
2557    ----------------------------------------------------------------------
2558    Name: Lal George
2559    Date: 2000/12/08 09:54:02 EST 2000
2560    Tag: Release_110_31
2561    Description:
2562    
2563    - Version 110.31
2564    ----------------------------------------------------------------------
2565    Name: Allen Leung
2566    Date: Thu Dec  7 22:01:04 EST 2000
2567    Tag:  leunga-20001207-cell-monster-hack
2568    Description:
2569    
2570    Major MLRISC internal changes.  Affect all clients.
2571    Summary:
2572    
2573    1.  Type CELLS.cell = int is now replaced by a datatype.
2574        As a result, the old regmap is now gone.  Almost all interfaces
2575        in MLRISC change as a consequence.
2576    
2577    2.  A new brand version of machine description tool (v3.0) that generates
2578        modules expecting the new interface.  The old version is removed.
2579    
2580    3.  The RA interface has been further abstracted into two new functors.
2581        RISC_RA and X86RA.  These functors have much simpler interfaces.
2582        [See also directory MLRISC/demo.]
2583    
2584    4.  Some other new source->source code generation tools are available:
2585    
2586        a. MLRISC/Tools/RewriteGen -- generate rewriters from rules.
2587        b. MLRISC/Tools/WhereGen -- expands conditional pattern matching rules.
2588           I use this tool to generate the peephole optimizers---with the new
2589           cell type changes, peephole rules are becoming difficult to write
2590           without conditional pattern matching.
2591    
2592    5.  More Intmap -> IntHashTable change.  Previous changes by Matthias didn't
2593        cover the entire MLRISC source tree so many things broke.
2594    
2595    6.  CM files have been moved to the subdirectory MLRISC/cm.
2596        They are moved because there are a lot of them and they clutter up the
2597        root dir.
2598    
2599    7.  More detailed documentation to come...
2600    
2601        NOTE: To rebuild from 110.30 (ftp distribution), you'll have to do
2602        a makeml -rebuild first.  This is because of other other
2603        changes that Matthias has made (see below).
2604    
2605    
2606    ----------------------------------------------------------------------
2607    Name: Matthias Blume
2608    Date: 2000/11/30 23:12:00 JST
2609    Tag: blume-20001130-filereorg
2610    Description:
2611    
2612    Some manual updates and some file reorganizations in CM.
2613    
2614    ----------------------------------------------------------------------
2615    Name: Matthias Blume
2616    Date: 2000/11/24 17:45:00 JST
2617    Tag: blume-20001124-link
2618    Description:
2619    
2620    Drastically improved link traversal code for the case that the dynamic
2621    value was already loaded at bootstrap time.  As a result, CM and CMB
2622    now both load blazingly fast -- even on a very slow machine.  Also,
2623    memory consumption has been further reduced by this.
2624    
2625    Warning: The format of the PIDMAP file has changed.  THerefore, to
2626    bootstrap you have to do this:
2627    
2628    1. Run CMB.make
2629    2. Make a symbolic link for the boot directory:
2630         ln -s sml.boot.ARCH-OS xxx
2631    3. "Rebuild" the boot directory:
2632         ./makeml -boot xxx -rebuild sml ; rm xxx
2633    4. Boot normally:
2634          ./makeml
2635    
2636    ----------------------------------------------------------------------
2637    Name: Matthias Blume
2638    Date: 2000/11/21 21:20:00 JST
2639    Tag: blume-20001121-tools
2640    Description:
2641    
2642    Continued hacking on autoloading problem -- with success this time.
2643    Also changed tool-plugin mechanism.  See new CM manual.
2644    
2645    ----------------------------------------------------------------------
2646    Name: Matthias Blume
2647    Date: 2000/11/19 14:30:00 JST
2648    Tag:  blume-20001119-autoload
2649    Description:
2650    
2651    Some hacking to make autoloading faster.  Success for CMB, no success
2652    so far for CM.  There is a reduced structure CM' that autoloads faster.
2653    (This is a temporary, non-documented hack to be eliminated again when
2654    the general problem is solved.)
2655    
2656    ----------------------------------------------------------------------
2657    Name: Matthias Blume
2658    Date: 2000/11/17 14:10:00 JST
2659    Tag: blume-20001117-pickle-lib
2660    Description:
2661    
2662    1. Eliminated comp-lib.cm
2663    2. Made pickle-lib.cm
2664    3. Eliminated all uses of intset.sml (from comp-lib.cm)
2665    4. Replaced all uses of intmap.{sig,sml} (from comp-lib.cm) with
2666       equivalent constructs from smlnj-lib.cm (INtHashTable).
2667    5. Point 4. also goes for those uses of intmap.* in MLRISC.
2668       Duplicated intmap modules thrown out.
2669    6. Hunted down all duplicated SCC code and replaced it with
2670       equivalent stuff (GraphSCCFn from smlnj-lib.cm).
2671    7. Rewrote Feedback module.
2672    8. Moved sortedlist.sml into viscomp-lib.cm.  Eventually it
2673       should be thrown out and equivalent modules from smlnj-lib.cm
2674       should be used (IntRedBlackSet, IntListSet, ...).
2675    
2676    Confirmed that compiler compiles to fixpoint.
2677    
2678    ----------------------------------------------------------------------
2679    Name: Allen Leung
2680    Date: 2000/11/10 18:00:00
2681    Tag: leunga-20001110-new-x86-fp
2682    
2683    A new x86 floating point code generator has been added.
2684    By default this is turned off.  To turn this on, do:
2685    
2686        CM.autoload "$smlnj/compiler.cm";
2687        Compiler.Control.MLRISC.getFlag "x86-fast-fp" := true;
2688    
2689    Changes:
2690    
2691    1.  Changed FTAN to FPTAN so that the assembly output is correct.
2692    2.  Changed the extension callback for FTANGENT to generate:
2693    
2694              fptan
2695              fstp  %st(0)
2696        instead of
2697              fptan
2698              fstpl ftempmem
2699    
2700    3.  Numerous assembly fixes for x86.
2701    
2702    5.  Cleaned up the machine code output module x86/x86MC.sml and added
2703        support for a whole bunch of instructions and addressing modes:
2704    
2705          fadd/fsub/fsubr/fmul/fdiv/fdivr  %st, %st(n)
2706          faddp/fsubp/fsubrp/fmulp/fdivp/fdivrp  %st, %st(n)
2707          fadd/fsub/fsubr/fmul/fdiv/fdivr  %st(n), %st
2708          fiadd/fisub/fisubr/fimul/fidiv/fidivr mem
2709          fxch %st(n)
2710          fld %st(n)
2711          fst %st(n)
2712          fst mem
2713          fstp %st(n)
2714          fucom %st(n)
2715          fucomp %st(n)
2716    
2717        All these are now generated when the fast fp mode is turned on.
2718    
2719    6.  Removed the dedicated registers %st(0), ..., %st(7) from X86CpsRegs
2720    
2721    ----------------------------------------------------------------------
2722    Name: Matthias Blume
2723    Date: 2000/11/09 11:20:00 JST
2724    Tag: blume-20001109-scc
2725    Description:
2726    
2727    Eliminated some code duplication:
2728    
2729    1. Added "where" clause to GraphSCCFn in SML/NJ Library.
2730       (Otherwise the functor is useless.)
2731    2. Used GraphSCCFn where SCCUtilFun was used previously.
2732    3. Got rid of SCCUtilFun (in comp-lib.cm).
2733    
2734    ----------------------------------------------------------------------
2735    Name: Lal George
2736    Date: 2000/11/06 09:02:21 EST 2000
2737    Tag: Release_110_30
2738    Description:
2739    
2740    - Version 110.30
2741    ----------------------------------------------------------------------
2742    Name: Matthias Blume
2743    Date: 2000/11/04 14:45:00
2744    Tag: blume-20001104-mlbuild
2745    Description:
2746    
2747    - Made ml-build faster on startup.
2748    - Documentation fixes.
2749    
2750    ----------------------------------------------------------------------
2751    Name: Matthias Blume
2752    Date: 2000/11/02 17:00:00 JST
2753    Tag: blume-20001102-condcomp
2754    Description:
2755    
2756    - Small tweaks to pickler -- new BOOTFILES!
2757    - Version bumped to 110.29.2.
2758    - Added conditional compilation facility to init.cmi (see comment there).
2759    ----------------------------------------------------------------------
2760    Name: Allen Leung
2761    Date: 2000/10/23 19:31:00
2762    Tag: leunga-20001023-demo-ra
2763    
2764    1. Minor RA changes that improves spilling on x86 (affects Moby and C-- only)
2765    2. Test programs for the graph library updated
2766    3. Some new MLRISC demo programs added
2767    
2768    ----------------------------------------------------------------------
2769    Name: Matthias Blume
2770    Date: 2000/08/31 22:15:00 JST
2771    Tag: blume-20001017-errmsg
2772    Description:
2773    
2774    More error message grief: Where there used to be no messages, there
2775    now were some that had bogus error regions.  Fixed.
2776    
2777    ----------------------------------------------------------------------
2778    Name: Matthias Blume
2779    Date: 2000/08/31 17:30:00 JST
2780    Tag: blume-20001017-v110p29p1
2781    Description:
2782    
2783    I made a version 110.29.1 with new bootfiles.
2784    
2785    Changes:  Modified pickler/unpickler for faster and leaner unpickling.
2786              CM documentation changes and a small bugfix in CM's error reporting.
2787    
2788    ----------------------------------------------------------------------
2789    Name: Lal George
2790    Date: 2000/09/27 14:42:35 EDT
2791    Tag: george-20000927-nodestatus
2792    Description:
2793    
2794    Changed the type of the nodestatus, so that:
2795    
2796            SPILLED(~1)             is now SPILLED
2797            SPILLED(m) where m>=0   is now MEMREG(m)
2798            SPILLED(s) where s<~1   is now SPILL_LOC(~s)
2799    
2800    ----------------------------------------------------------------------
2801    Name: Matthias Blume
2802    Date: 2000/09/07 14:45:00 JST
2803    Tag: blume-20000907-cmerrmsg
2804    Description:
2805    
2806    Small tweak to CM to avoid getting ML syntax error messages twice.
2807    
2808    ----------------------------------------------------------------------
2809    Name: Matthias Blume
2810    Date: 2000/08/31 18:00:00 JST
2811    Tag: blume-20000831-cvsbootfiles
2812    Description:
2813    
2814    New URL for boot files (because the 110.29 files on the BL server do
2815    now work correctly with my updated install scripts for yacc and lex).
2816    
2817    ----------------------------------------------------------------------
2818    Name: Matthias Blume
2819    Date: 2000/08/08 12:33:00 JST
2820    Tag: blume-20000808-manual
2821    Description:
2822    
2823    Tiny update to CM manual.
2824    
2825    ----------------------------------------------------------------------
2826    Name: Allen Leung
2827    Date: 2000/08/7 19:31:00
2828    Tag: leunga-20000807-a-whole-bunch-of-stuff
2829    
2830      Moby, C--, SSA, x86, machine descriptions etc.  Should only affect C--
2831    and Mobdy.
2832    
2833    1.  x86
2834    
2835       a.  Fixes to peephole module by John and Dan.
2836       b.  Assembly fix to SETcc by Allen.
2837       c.  Fix to c-call by John.
2838       d.  Fix to spilling by John.  (This one deals with the missing FSTPT case)
2839       e.  Instruction selection optimization to SETcc as suggested by John.
2840    
2841           For example,
2842    
2843            MV(32, x, COND(32, CMP(32, LT, a, b), LI 1, LI 0))
2844    
2845           should generate:
2846    
2847            MOVL a, x
2848            SUBL b, x
2849            SHRL 31, x
2850    
2851    2.  IR stuff
2852    
2853         A bunch of new DJ-graph related algorithms added.  These
2854         speed up SSA construction.
2855    
2856    3.  SSA + Scheduling
2857    
2858         Added code for SSA and scheduling to the repository
2859    
2860    ----------------------------------------------------------------------
2861    Name: Lal George
2862    Date: 2000/07/27 11:53:14 EDT
2863    
2864    Tag: lal-20000727-linux-ppc
2865    Description:
2866    
2867     Made changes to support Linux PPC.
2868     p.s. I have confirmation that the 110.29 boot files work fine.
2869    
2870    ----------------------------------------------------------------------
2871    Name: Matthias Blume
2872    Date: 2000/07/27 17:40:00 JST
2873    Tag: blume-20000727-scripts
2874    Description:
2875    
2876    !!!! WARNING !!!!
2877    You must recompile the runtime system!
2878    !!!! WARNING !!!!
2879    
2880    This is basically another round of script-enhancements:
2881    
2882    1. sml, ml-build, and ml-makedepend accept options -D and -U to define
2883       and undefine CM preprocessor symbols.
2884    
2885    2. ml-build avoids generating a new heap image if it finds that the
2886       existing one is still ok.  (The condition is that no ML file had to
2887       be recompiled and all ML files are found to be older that the heap
2888       file.)
2889    
2890       To make this work smoothly, I also hacked the runtime system as
2891       well as SMLofNJ.SysInfo to get access to the heap image suffix
2892       (.sparc-solaris, ...) that is currently being used.
2893    
2894       Moreover, the signature of CM.mk_standalone has changed.  See the
2895       CM manual.
2896    
2897    3. ml-makedepend accepts additional options -n, -a, and -o.  (See the
2898       CM manual for details.)
2899    
2900    4. More CM manual updates:
2901        - all of the above has been documented.
2902        - there is now a section describing the (CM-related) command line
2903          arguments that are accepted by the "sml" command
2904    
2905    ----------------------------------------------------------------------
2906    Name: Matthias Blume
2907    Date: 2000/07/25 16:20:00 JST
2908    Tag: blume-20000725-makedepend
2909    Description:
2910    
2911    Added a script called ml-makedepend.  This can be used in makefiles
2912    for Unix' make in a way very similar to the "makedepend" command for
2913    C.
2914    
2915    The script internally uses function CM.sources.
2916    
2917    Synopsis:
2918    
2919        ml-makedepend [-f makefile] cmfile targetname
2920    
2921    The default for the makefile is "makefile" (or "Makefile" should
2922    "makefile" not exist).
2923    
2924    ml-makedepend adds a cmfile/targetname-specific section to this
2925    makefile (after removing the previous version of this section).  The
2926    section contains a single dependency specification with targetname on
2927    the LHS (targetname is an arbitrary name), and a list of files derived
2928    from the cmfile on the RHS.  Some of the files on the RHS are
2929    ARCH/OPSYS-specific.  Therefore, ml-makedepend inserts references to
2930    "make" variables $(ARCH) and $(OPSYS) in place of the corresponding
2931    path names.  The makefile writer is responsible for making sure that
2932    these variables have correct at the time "make" is invoked.
2933    
2934    ----------------------------------------------------------------------
2935    Name: Matthias Blume
2936    Date: 2000/07/22 23:30:00 JST
2937    Tag: blume-20000722-urlupdate
2938    Description:
2939    
2940    Changed BOOT and config/srcarchiveurl to point to BL server:
2941    
2942        ftp://ftp.research.bell-labs.com/dist/smlnj/working/110.29/
2943    
2944    ----------------------------------------------------------------------
2945    Name: Matthias Blume
2946    Date: 2000/07/18 18:00:00 JST
2947    Tag: blume-20000718-Version_110_29
2948    Description:
2949    
2950    1. Updated src/compiler/TopLevel/main/version.sml to version 110.29
2951    
2952    2. Updated config/version to 110.29
2953    
2954    3. Updated config/srcarchiveurl
2955    
2956    3. New boot files!
2957       ftp://ftp.cs.princeton.edu/pub/people/blume/sml/110.29-autofetch
2958    
2959    ----------------------------------------------------------------------
2960    Name: Matthias Blume
2961    Date: 2000/07/11 13:58:00 JST
2962    Tag: blume-20000711-doctypo
2963    Description:
2964    
2965    Fixed a few typos in CM manual.
2966    
2967    ----------------------------------------------------------------------
2968    Name: Allen Leung
2969    Date: 2000/06/15 00:38:00
2970    Tag: leunga-20000704-sparc-x86
2971    
2972    1. x86 peephole improvement sp += k; sp -= k => nop  [from John]
2973    2. fix to x86 RET bug [found by Dan Grossman]
2974    3. sparc assembly bug fix for ticc instructions [found by Fermin]
2975    
2976       Affects c-- and moby only
2977    
2978    ----------------------------------------------------------------------
2979    Name: Matthias Blume
2980    Date: 2000/07/04 15:26:00
2981    Tag: blume-20000704-trigger
2982    Description:
2983    
2984    1. Improvements to CM manual.
2985    2. SMLofNJ.Internals.BTrace.trigger reinstated as an alternative way
2986       of getting a back-trace.  The function, when called, raises an
2987       internal exception which explicitly carries the full back-trace history,
2988       so it is unaffected by any intervening handle-raise pairs ("trivial"
2989       or not).  The interactive loop will print that history once it arrives
2990       at top level.
2991       Short of having all exceptions implicitly carry the full history, the
2992       recommended way of using this facility is:
2993         - compile your program with instrumentation "on"
2994         - run it, when it raises an exception, look at the history
2995         - if the history is "cut off" because of some handler, go and modify
2996           your program so that it explicitly calls BTrace.trigger
2997         - recompile (still instrumented), and rerun; look at the full history
2998    
2999    ----------------------------------------------------------------------
3000    Name: Matthias Blume
3001    Date: 2000/07/03 15:36:00 JST
3002    Tag: blume-20000702-manual
3003    Description:
3004    
3005    Small corrections and updates to CM manual.
3006    
3007    ----------------------------------------------------------------------
3008    Name: Matthias Blume
3009    Date: 2000/06/29 16:04:00 JST
3010    Tag: blume-20000629-yacctool
3011    Description:
3012    
3013    Changes:
3014    
3015    1. Class "mlyacc" now takes separate arguments to pass options to
3016       generated .sml- and .sig-files independently.
3017    2. Corresponding CM manual updates.
3018    3. BTrace module now also reports call sites.  (However, for loop clusters
3019       it only shows from where the cluster was entered.)  There are associated
3020       modifications to core.sml, internals.{sig,sml}, btrace.sml, and btimp.sml.
3021    
3022    ----------------------------------------------------------------------
3023    Name: Matthias Blume
3024    Date: 2000/06/27 16:51:00 JST
3025    Tag: blume-20000627-noweb
3026    Description:
3027    
3028    Changes:
3029    
3030     1. Implemented "subdir" and "witness" options for noweb tool.
3031        This caused some slight internal changes in CM's tool implementation.
3032     2. Fixed bug in "tool plugin" mechanism.  This is essentially cleaning
3033        some remaining issues from earlier path anchor changes.
3034     3. Updated CM manual accordingly.
3035    
3036     4. Changed implementation of back-tracing so that I now consider it
3037        ready for prime-time.
3038    
3039        In particular, you don't have to explicitly trigger the back-trace
3040        anymore.  Instead, if you are running BTrace-instrumented code and
3041        there is an uncaught exception (regardless of whether or not it was
3042        raised in instrumented code), the top-level evalloop will print
3043        the back-trace.
3044    
3045        Features:
3046    
3047          - Instrumented and uninstrumented code work together seemlessly.
3048            (Of course, uninstrumented code is never mentioned in actual
3049             back-traces.)
3050    
3051          - Asymptotic time- and space-complexity of instrumented code is
3052            equal to that of uninstrumented code.  (This means that
3053            tail-recursion is preserved by the instrumentation phase.)
3054    
3055          - Modules whose code has been instrumented in different sessions
3056            work together without problem.
3057    
3058          - There is no penalty whatsoever on uninstrumented code.
3059    
3060          - There is no penalty on "raise" expressions, even in
3061            instrumented code.
3062    
3063        A potential bug (or perhaps it is a feature, too):
3064    
3065          A back-trace reaches no further than the outermost instrumented
3066          non-trivial "raise".  Here, a "trivial" raise is one that is the
3067          sole RHS of a "handle" rule.  Thus, back-traces reach trough
3068    
3069               <exp> handle e => raise e
3070    
3071          and even
3072    
3073               <exp> handle Foo => raise Bar
3074    
3075          and, of course, through
3076    
3077               <exp> handle Foo => ...
3078    
3079         if the exception was not Foo.
3080    
3081         Back-traces always reach right through any un-instrumented code
3082         including any of its "handle" expressions, trivial or not.
3083    
3084       To try this out, do the following:
3085    
3086         - Erase all existing binfiles for your program.
3087           (You may keep binfiles for those modules where you think you
3088            definitely don't need back-tracing.)
3089         - Turn on back-trace instrumentation:
3090              SMLofNJ.Internals.BTrace.mode (SOME true);
3091         - Recompile your program.  (I.e., run "CM.make" or "use".)
3092         - You may now turn instrumentation off again (if you want):
3093              SMLofNJ.Internals.BTrace.mode (SOME false);
3094         - Run your program as usual.  If it raises an exception that
3095           reaches the interactive toplevel, then a back-trace will
3096           automatically be printed.  After that, the toplevel loop
3097           will print the exception history as usual.
3098    
3099    ----------------------------------------------------------------------
3100    Name: Matthias Blume
3101    Date: 2000/06/26 09:56:46 JST
3102    Tag: blume-20000626-setup
3103    Description:
3104    
3105    CM: - setup-parameter to "sml" added; this can be used to run arbitrary
3106          ML code before and after compiling a file (e.g., to set compiler
3107          flags)
3108    
3109    Compiler: - improved btrace API (in core.sml, internals.{sig,sml})
3110              - associated changes to btrace.sml (BTrace instrumentation pass)
3111              - cleaner implementation of btimp.sml (BTrace tracing and report
3112                module)
3113    
3114    CM manual: * new path encoding documented
3115               * description of setup-parameter to "sml" added
3116    
3117    The biggest user-visible change to back-tracing is that it is no
3118    longer necessary to compile all traced modules within the same
3119    session.  (This was a real limitation.)
3120    
3121    ----------------------------------------------------------------------
3122    Name: Matthias Blume
3123    Date: 2000/06/24 12:40:00 JST
3124    Tag: blume-20000624-startup
3125    Description:
3126    
3127    Fixes startup slowdown problem.  (I was calling SrcPath.sync a _tad_
3128    bit too often -- to put it mildly. :)
3129    
3130    ----------------------------------------------------------------------
3131    Name: Matthias Blume
3132    Date: 2000/06/23 18:20:00 JST
3133    Tag: blume-20000623-btrace
3134    Description:
3135    
3136    This updates adds a backtrace facility to aid programmers in debugging
3137    their programs.  This involves the following changes:
3138    
3139    1. Module system/smlnj/init/core.sml (structure _Core) now has hooks for
3140       keeping track of the current call stack.  When programs are compiled
3141       in a special mode, the compiler will insert calls to these hooks
3142       into the user program.
3143       "Hook" means that it is possible for different implementations of
3144       back-tracing to register themselves (at different times).
3145    
3146    2. compiler/MiscUtil/profile/btrace.sml implements the annotation phase
3147       as an Absyn.dec->Absyn.dec rewrite.  Normally this phase is turned off.
3148       It can be turned on using this call:
3149         SMLofNJ.Internals.BTrace.mode (SOME true);
3150       Turning it off again:
3151         SMLofNJ.Internals.BTrace.mode (SOME false);
3152       Querying the current status:
3153         SMLofNJ.Internals.BTrace.mode NONE;
3154       Annotated programs are about twice as big as normal ones, and they
3155       run a factor of 2 to 4 slower with a dummy back-trace plugin (one
3156       where all hooks do nothing).  The slowdown with a plugin that is
3157       actually useful (such as the one supplied by default) is even greater,
3158       but in the case of the default plugin it is still only an constant
3159       factor (amortized).
3160    
3161    3. system/Basis/Implementation/NJ/internals.{sig,sml} have been augmented
3162       with a sub-structure BTrace for controlling back-tracing.  In particular,
3163       the above-mentioned function "mode" controls whether the annotation
3164       phase is invoked by the compiler.  Another important function is
3165       "trigger": when called it aborts the current execution and causes
3166       the top-level loop to print a full back-trace.
3167    
3168    4. compiler/MiscUtil/profile/btimp.sml is the current default plugin
3169       for back-tracing.  It keeps track of the dynamic call stack and in
3170       addition to that it keeps a partial history at each "level" of that
3171       stack.  For example, if a tail-calls b, b tail-calls c, and c tail-calls
3172       d and b (at separate times, dynamically), then the report will show:
3173    
3174       GOTO   d
3175             /c
3176       GOTO  \b
3177       CALL   a
3178    
3179       This shows that there was an initial non-tail call of a, then a
3180       tail-call to b or c, looping behavior in a cluster of functions that
3181       consist of b and c, and then a goto from that cluster (i.e., either from
3182       b or from c) to d.
3183    
3184       Note that (depending on the user program) the amount of information
3185       that the back-trace module has to keep track of at each level is bounded
3186       by a constant.  Thus, the whole implementation has the same asymptotical
3187       complexity as the original program (both in space and in time).
3188    
3189    5. compiler/TopLevel/interact/evalloop.sml has been modified to
3190       handle the special exception SMLofNJ.Internals.BTrace.BTrace
3191       which is raised by the "trigger" function mentioned above.
3192    
3193    Notes on usage:
3194    
3195    - Annotated code works well together with unannotated code:
3196    Unannotated calls simply do not show up at all in the backtrace.
3197    
3198    - It is not a good idea to let modules that were annotated during
3199    different sessions run at the same time.  This is because the compiler
3200    chooses small integers to identify individual functions, and there
3201    will be clashes if different modules were compiled in separate sessions.
3202    (Nothing will crash, and you will even be told about the clashes, but
3203    back-trace information will in general not be useful.)
3204    
3205    - Back-tracing can be confused by callcc and capture.
3206    
3207    - The only way of getting a back-trace right now is to explicitly
3208    invoke the "trigger" function from your user program.  Eventually, we
3209    should make every exception carry back-trace information (if
3210    available).  But since this creates more overhead at "raise"-time
3211    (similar to the current exnHistory overhead), I have not yet
3212    implemented this.  (The implementation will be rather easy.)  With
3213    exceptions carrying back-trace information, this facility will be even
3214    more useful because users don't need to modify their programs...
3215    
3216    - While it is possible to compile the compiler with back-trace
3217    annotations turned on (I did it to get some confidence in
3218    correctness), you must make absolutely sure that core.sml and
3219    btimp.sml are compiled WITHOUT annotation!  (core.sml cannot actually
3220    be compiled with annotation because there is no core access yet, but
3221    if you compile btimp.sml with annotation, then the system will go into
3222    an infinite recursion and crash.)
3223    Since CM currently does not know about BTrace, the only way to turn
3224    annotations on and off for different modules of the compiler is to
3225    interrupt CMB.make, change the settings, and re-invoke it.  Of course,
3226    this is awkward and clumsy.
3227    
3228    Sample sessions:
3229    
3230    Standard ML of New Jersey v110.28.1 [FLINT v1.5], June 5, 2000
3231    - SMLofNJ.Internals.BTrace.mode (SOME true);
3232    [autoloading]
3233    [autoloading done]
3234    val it = false : bool
3235    - structure X = struct
3236    -     fun main n = let
3237    -         fun a (x, 0) = d x
3238    -           | a (x, n) = b (x, n - 1)
3239    -         and b (x, n) = c (x, n)
3240    -         and c (x, n) = a (x, n)
3241    -         and d x = e (x, 3)
3242    -         and e (x, 0) = f x
3243    -           | e (x, n) = e (x, n - 1)
3244    -         and f 0 = SMLofNJ.Internals.BTrace.trigger ()
3245    -           | f n = n * g (n - 1)
3246    -         and g n = a (n, 3)
3247    -     in
3248    -         f n
3249    -     end
3250    - end;
3251    structure X : sig val main : int -> int end
3252    - X.main 3;
3253    *** BACK-TRACE ***
3254    GOTO   stdIn:4.2-13.20: X.main[2].f
3255    GOTO-( stdIn:4.2-13.20: X.main[2].e
3256    GOTO   stdIn:4.2-13.20: X.main[2].d
3257         / stdIn:4.2-13.20: X.main[2].a
3258         | stdIn:4.2-13.20: X.main[2].b
3259    GOTO-\ stdIn:4.2-13.20: X.main[2].c
3260    CALL   stdIn:4.2-13.20: X.main[2].g
3261    GOTO   stdIn:4.2-13.20: X.main[2].f
3262    GOTO-( stdIn:4.2-13.20: X.main[2].e
3263    GOTO   stdIn:4.2-13.20: X.main[2].d
3264         / stdIn:4.2-13.20: X.main[2].a
3265         | stdIn:4.2-13.20: X.main[2].b
3266    GOTO-\ stdIn:4.2-13.20: X.main[2].c
3267    CALL   stdIn:4.2-13.20: X.main[2].g
3268    GOTO   stdIn:4.2-13.20: X.main[2].f
3269    GOTO-( stdIn:4.2-13.20: X.main[2].e
3270    GOTO   stdIn:4.2-13.20: X.main[2].d
3271         / stdIn:4.2-13.20: X.main[2].a
3272         | stdIn:4.2-13.20: X.main[2].b
3273    GOTO-\ stdIn:4.2-13.20: X.main[2].c
3274    CALL   stdIn:4.2-13.20: X.main[2].g
3275    GOTO   stdIn:4.2-13.20: X.main[2].f
3276    CALL   stdIn:2.15-17.4: X.main[2]
3277    -
3278    
3279    (Note that because of a FLINt bug the above code currently does not
3280    compile without BTrace turned on.)
3281    
3282    Here is another example, using my modified Tiger compiler:
3283    
3284    Standard ML of New Jersey v110.28.1 [FLINT v1.5], June 5, 2000
3285    - SMLofNJ.Internals.BTrace.mode (SOME true);
3286    [autoloading]
3287    [autoloading done]
3288    val it = false : bool
3289    - CM.make "sources.cm";
3290    [autoloading]
3291    ...
3292    [autoloading done]
3293    [scanning sources.cm]
3294    [parsing (sources.cm):parse.sml]
3295    [creating directory CM/SKEL ...]
3296    [parsing (sources.cm):tiger.lex.sml]
3297    ...
3298    [wrote CM/sparc-unix/semant.sml]
3299    [compiling (sources.cm):main.sml]
3300    [wrote CM/sparc-unix/main.sml]
3301    [New bindings added.]
3302    val it = true : bool
3303    - Main.compile ("../testcases/merge.tig", "foo.out");
3304    *** BACK-TRACE ***
3305    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trvar
3306    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3307    CALL   lib/semant.sml:289.3-295.22: SemantFun[2].transExp.trexp.check[2]
3308    GOTO   lib/semant.sml:289.3-295.22: SemantFun[2].transExp.trexp.check[2]
3309    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3310    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3311    CALL   lib/semant.sml:488.3-505.6: SemantFun[2].transDec.trdec[2].transBody[2]
3312         / lib/semant.sml:411.65-543.8: SemantFun[2].transDec
3313    CALL-\ lib/semant.sml:413.2-540.9: SemantFun[2].transDec.trdec[2]
3314    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3315    CALL   lib/semant.sml:8.52-558.4: SemantFun[2].transProg[2]
3316    CALL   main.sml:1.18-118.4: Main.compile[2]
3317    -
3318    
3319    ----------------------------------------------------------------------
3320    Name: Matthias Blumen
3321    Date: 2000/06/21 18:00:00 JST
3322    Tag: blume-20000621-manual
3323    Description:
3324    
3325    CM manual update: Path environments documented.
3326    
3327    ----------------------------------------------------------------------
3328    Name: Matthias Blume
3329    Date: 2000/06/19 13:40:00
3330    Tag: blume-20000619-manual
3331    Description:
3332    
3333    CM manual and system/README update.  This only covers the fact that
3334    there are no more implicit anchors.  (Path environments and the "bind"
3335    option to "cm" have yet to be documented.)
3336    
3337    ----------------------------------------------------------------------
3338    Name: Matthias Blume
3339    Date: 2000/06/19 11:05:00 JST
3340    Tag: blume-20000619-chdir-bugfix
3341    Description:
3342    
3343    Fixed a bug in new SrcPath module that sometimes led to a bad chDir call.
3344    
3345    ----------------------------------------------------------------------
3346    Name: Matthias Blume
3347    Date: 2000/06/18 22:00:10 JST
3348    Tag: blume-20000618-implicit-anchors-really-gone
3349    Description:
3350    
3351    I updates the previous HISTORY entry where I forgot to mention that
3352    implicit anchors are no longer with us.
3353    
3354    The current update also gets rid of the (now useless) controller
3355    CM.Control.implicit_anchors.
3356    
3357    ----------------------------------------------------------------------
3358    Name: Matthias Blume
3359    Date: 2000/06/16 17:30:00 JST
3360    Tag: blume-20000616-anchorenv
3361    Description:
3362    
3363    This patch implements the long anticipated (just kidding :) "anchor
3364    environment" mechanism.  In the course of doing this, I also
3365    re-implemented CM's internal "SrcPath" module from scratch.  The new
3366    one should be more robust in certain boundary cases.  In any case, it
3367    is a lot cleaner than its predecessor (IMHO).
3368    
3369    This time, although there is yet another boot file format change, I
3370    kept the unpickler backward-compatible.  As a result, no new bootfiles
3371    are necessary and bootstrapping is straightforward.  (You cannot read
3372    new bootfiles into an old system, but the other way around is no
3373    problem.)
3374    
3375    Visible changes:
3376    
3377    ** 0. Implicit path anchors (without the leading $-symbol) are no
3378    longer recognized at all. This means that such path names are not
3379    illegal either.  For example, the name basis.cm simply refers to a
3380    local file called "basis.cm" (i.e, the name is an ordinary path
3381    relative to .cm-files directory).  Or, to put it differently, only
3382    names that start with $ are anchored paths.
3383    
3384    ** 1. The $<singlearc> abbreviation for $/<singlearc> has finally
3385    vanished.
3386    
3387    John (Reppy) had critizised this as soon as I originally proposed and
3388    implemented it, but at that time I did not really deeply believe
3389    him. :) Now I came full-circle because I need the $<singlearc> syntax
3390    in another place where it cannot be seen as an abbreviation for
3391    $/<singlearc>.  To avoid the confusion, $<singlearc> now means what it
3392    seems to mean (i.e., it "expands" into the corresponding anchor
3393    value).
3394    
3395    However, when paths are used as members in CM description files, it
3396    continues to be true that there must be at least another arc after the
3397    anchor.  This is now enforced separately during semantic analysis
3398    (i.e., from a lexical/syntactical point of view, the notation is ok.)
3399    
3400    ** 2. The "cm" class now accepts an option "bind".  The option's value
3401    is a sub-option list of precisely two items -- one labeled "anchor"
3402    and the other one labeled "value".  As you might expect, "anchor" is
3403    used to specify an anchor name to be bound, and "value" specifies what
3404    the anchor is being bound to.
3405    
3406    The value must be a directory name and can be given in either standard
3407    syntax (including the possibility that it is itself an anchored path)
3408    or native syntax.
3409    
3410    Examples:
3411    
3412       foo.cm (bind:(anchor:bar value:$mystuff/bar))
3413       lib.cm (bind:(anchor:a value:"H:\\x\\y\\z"))  (* only works under windows *)
3414    
3415    and so on.
3416    
3417    The meaning of this is that the .cm-file will be processed with an
3418    augmented anchor environment where the given anchor(s) is/are bound to
3419    the given values(s).
3420    
3421    The rationale for having this feature is this: Suppose you are trying
3422    to use two different (already stable) libraries a.cm and b.cm (that
3423    you perhaps didn't write yourself).  Further, suppose each of these
3424    two libraries internally uses its own auxiliary library $aux/lib.cm.
3425    Normally you would now have a problem because the anchor "lib" can not
3426    be bound to more than one value globally.  Therefore, the project that
3427    uses both a.cm and b.cm must locally redirect the anchor to some other
3428    place:
3429    
3430       a.cm (bind:(anchor:lib value:/usr/lib/smlnj/a-stuff))
3431       b.cm (bind:(anchor:lib value:/usr/lib/smlnj/b-stuff))
3432    
3433    This hard-wires $lib/aux.cm to /usr/lib/smlnj/a-stuff/aux.cm or
3434    /usr/lib/smlnj/b-stuff/aux.cm, respectively.
3435    
3436    Hard-wiring path names is a bit inflexible (and CM will verbosely warn
3437    you when you do so at the time of CM.stabilize).  Therefore, you can
3438    also use an anchored path as the value:
3439    
3440      a.cm (bind:(anchor:lib value:$a-lib))
3441      b.cm (bind:(anchor:lib value:$b-lib))
3442    
3443    Now you can globally configure (using the usual CM.Anchor.anchor or
3444    pathconfig machinery) bindings for "a-lib" and "b-lib".  Since "lib"
3445    itself is always locally bound, setting it globally is no longer
3446    meaningful or necessary (but it does not hurt either).  In fact, "lib"
3447    can still be used as a global anchor for separate purposes.  As a
3448    matter of fact, one can locally define "lib" in terms of a global
3449    "lib":
3450    
3451      a.cm (bind:(anchor:lib value:$lib/a))
3452      b.cm (bind:(anchor:lib value:$lib/b))
3453    
3454    ** 3: The encoding of path names has changed.  This affects the way
3455    path names are shown in CM's progress report and also the internal
3456    protocol encoding used for parallel make.
3457    
3458    The encoding now uses one or more ':'-separated segments.  Each
3459    segments corresponds to a file that has been specified relative to the
3460    file given by its preceding segment.  The first segment is either
3461    relative to the CWD, absolute, or anchored.  Each segment itself is
3462    basically a Unix pathname; all segments but the first are relative.
3463    
3464    Example:
3465    
3466       $foo/bar/baz.cm:a/b/c.sml
3467    
3468    This path denotes the file bar/a/b/c.sml relative to the directory
3469    denoted by anchor "foo".  Notice that the encoding also includes
3470    baz.cm which is the .cm-file that listed a/b/c.sml.  As usual, such
3471    paths are resolved relative to the .cm-files directory, so baz.cm must
3472    be ignored to get the "real" pathname.
3473    
3474    To make this fact more obvious, CM puts the names of such "virtual
3475    arcs" into parentheses when they appear in progress reports. (No
3476    parentheses will appear in the internal protocol encoding.)  Thus,
3477    what you really see is:
3478    
3479      $foo/bar/(baz.cm):a/b/c.sml
3480    
3481    I find this notation to be much more informative than before.
3482    
3483    Another new feature of the encoding is that special characters
3484    including parentheses, colons, (back)slashes, and white space are
3485    written as \ddd (where ddd is the decimal encoding of the character).
3486    
3487    *** The CM manual still needs to be updated.
3488    
3489    ----------------------------------------------------------------------
3490    Name: Allen Leung
3491    Date: 2000/06/15 00:38:00
3492    Tag: leunga-20000615-x86-peephole
3493    
3494    x86 Peephole fix by Fermin.  Affects c-- and moby only.
3495    
3496    ----------------------------------------------------------------------
3497    Name: Matthias Blume
3498    Date: 2000/06/12 11:40:00
3499    Tag: blume-20000612-parmakefix
3500    Description:
3501    
3502    More cleanup after changing the file naming scheme: This time I
3503    repaired the parallel make mechanism for CMB.make which I broke earlier.
3504    
3505    ----------------------------------------------------------------------
3506    Name: Allen Leung
3507    Date: 2000/06/09 01:25:00
3508    Tag: leunga-20000609-various
3509    
3510    None of these things should affect normal SML/NJ operations
3511    
3512    1. Peephole improvements provided by Fermin (c--)
3513    2. New annotation DEFUSE for adding extra dependence (moby)
3514    3. New X86 LOCK instructions (moby)
3515    4. New machine description language for reservation tables (scheduling)
3516    5. Fixes to various optimization/analysis modules (branch chaining, dominator
3517       trees etc.)
3518    6. I've changed the CM files so that they can work with versions
3519       110.0.6, 110.25 and 110.28
3520    
3521    ----------------------------------------------------------------------
3522    Name: Matthias Blume
3523    Date: 2000/06/09 12:40:00
3524    Tag: blume-20000609-log
3525    Description:
3526    
3527    - Removed all(?) remaining RCS Log entries from sources.
3528    
3529    - Fixed bug in ml-yacc and ml-lex sources (use explicit anchors for
3530      anchored paths).
3531    
3532    ----------------------------------------------------------------------
3533    Name: Matthias Blume
3534    Date: 2000/06/07 17:00:00 JST
3535    Tag: blume-20000607-no-implicit-anchors
3536    Description:
3537    
3538    1. This update changes the default setting for
3539    CM.Control.implicit_anchors from true to false.  This means that
3540    implicit anchors are no longer permitted by default.  I also tried to
3541    make sure that nothing else still relies on implicit anchors.
3542    (This is the next step on the schedule towards a CM that does not even
3543    have the notion of implicit anchors anymore.)
3544    
3545    2. More CM manual updates.
3546    
3547    3. I managed to track down and fix the pickling bug I mentioned last
3548    time.  Because of the previously existing workaround, this entails no
3549    immediate practical changes.
3550    
3551    ----------------------------------------------------------------------
3552    Name: Matthias Blume
3553    Date: 2000/06/06 11:15:00 JST
3554    Tag: blume-20000606-lazierpickle
3555    Description:
3556    
3557    !!!! NEW BOOT FILES !!!!
3558    
3559    * The main purpose of this update is to make library pickles lazier in
3560    order to reduce the initial space penalty for autoloading a library.
3561    As a result, it is now possible to have $smlnj/compiler.cm
3562    pre-registered.  This should take care of the many complaints or
3563    inquiries about missing structure Compiler.  This required changes to
3564    CM's internal data structures and small tweaks to some algorithms.
3565    
3566    As a neat additional effect, it is no longer necessary (for the sake
3567    of lean heap image files) to distinguish between a "minimal" CM and a
3568    "full" CM.  Now, there is only one CM (i.e., the "full" version:
3569    $smlnj/cm.cm aka $smlnj/cm/full.cm), and it is always available at the
3570    interactive top level. ($smlnj/cm/minimal.cm is gone.)
3571    
3572    To make the life of compiler-hackers easier, "makeml" now also
3573    pre-registers $smlnj/cmb.cm (aka $smlnj/cmb/current.cm).  In other
3574    words, after you bootstrap a new sml for the first time, you will not
3575    have to autoload $smlnj/cmb.cm again afterwards.  (The first time
3576    around you will still have to do it, though.)
3577    
3578    * A second change consists of major updates to the CM manual.  There
3579    are now several appendices with summary information and also a full
3580    specification of the CM description file syntax.
3581    
3582    * In directory src/system I added the script "allcross".  This script
3583    invokes sml and cross-compiles the compiler for all supported
3584    architectures.  (Useful when providing a new set of boot files.)
3585    
3586    * There seems to be a latent bug in my "lazy pickles" mechanism.  I
3587    added a small tweak to pickle-util.sml to work around this problem,
3588    but it is not a proper fix yet.  I will investigate further.  (The
3589    effect of the bug was an inflation of library pickle size.)
3590    
3591    * Version number increased to 110.28.1 (to avoid compatibility problems).
3592    
3593    ----------------------------------------------------------------------
3594    Name: Allen Leung
3595    Date: 2000/05/25 17:28 EDT
3596    Tag: leunga-20000525-ra
3597    Description:
3598    
3599      Fixed a bug in freezing phase of the register allocator.
3600    
3601    ----------------------------------------------------------------------
3602    Name: Allen Leung
3603    Date: 2000/05/15 22:53 EDT
3604    Tag: leunga-20000515-alpha-x86-ra
3605    Description:
3606    
3607      1. Alpha
3608    
3609          Slight cleanup.  Removed the instruction SGNXL
3610    
3611      2. X86
3612    
3613          Added the following instructions to the instruction set:
3614    
3615