Home My Page Projects Code Snippets Project Openings SML/NJ
Summary Activity Forums Tracker Lists Tasks Docs Surveys News SCM Files

SCM Repository

[smlnj] Diff of /sml/trunk/HISTORY
ViewVC logotype

Diff of /sml/trunk/HISTORY

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 577, Fri Mar 10 08:07:18 2000 UTC revision 1124, Thu Mar 7 19:49:22 2002 UTC
# Line 8  Line 8 
8  The form of an entry should be:  The form of an entry should be:
9    
10  Name:  Name:
11  Date:  Date: yyyy/mm/dd
12  Tag: <post-commit CVS tag>  Tag: <post-commit CVS tag>
13  Description:  Description:
14    
15    ----------------------------------------------------------------------
16    Name: Lal George
17    Date: 2002/03/07 14:44:24 EST 2002
18    Tag: george-20020307-weighted-block-placement
19    
20    Tested the weighted block placement optimization on all architectures
21    (except the hppa) using AMPL to generate the block and edge frequencies.
22    Changes were required in the machine properties to correctly
23    categorize trap instructions. There is an MLRISC flag
24    "weighted-block-placement" that can be used to enable weighted block
25    placement, but this will be ineffective without block/edge
26    frequencies (coming soon).
27    
28    
29    ----------------------------------------------------------------------
30    Name: Lal George
31    Date: 2002/03/05 17:24:48 EST
32    Tag: george-20020305-linkage-cluster
33    
34    In order to support the block placement optimization, a new cluster
35    is generated as the very first cluster (called the linkage cluster).
36    It contains a single jump to the 'real' entry point for the compilation
37    unit. Block placement has no effect on the linkage cluster itself, but
38    all the other clusters  have full freedom in the manner in which they
39    reorder blocks or functions.
40    
41    On the x86 the typical linkage code that is generated is:
42       ----------------------
43            .align 2
44       L0:
45            addl    $L1-L0, 72(%esp)
46            jmp     L1
47    
48    
49            .align  2
50       L1:
51       ----------------------
52    
53    72(%esp) is the memory location for the stdlink register. This
54    must contain the address of the CPS function being called. In the
55    above example, it contains the address of  L0; before
56    calling L1 (the real entry point for the compilation unit), it
57    must contain the address for L1, and hence
58    
59            addl $L1-L0, 72(%esp)
60    
61    I have tested this on all architectures except the hppa.The increase
62    in code size is of course negligible
63    
64    ----------------------------------------------------------------------
65    Name: Allen Leung
66    Date: 2002/03/03 13:20:00 EST
67    Tag: leunga-20020303-mlrisc-tools
68    
69      Added #[ ... ] expressions to mlrisc tools
70    
71    ----------------------------------------------------------------------
72    Name: Matthias Blume
73    Date: 2002/02/27 12:29:00 EST
74    Tag: blume-20020227-cdebug
75    Description:
76    
77    - made types in structure C and C_Debug to be equal
78    - got rid of code duplication (c-int.sml vs. c-int-debug.sml)
79    - there no longer is a C_Int_Debug (C_Debug is directly derived from C)
80    
81    ----------------------------------------------------------------------
82    Name: Matthias Blume
83    Date: 2002/02/26 12:00:00 EST
84    Tag: blume-20020226-ffi
85    Description:
86    
87    1. Fixed a minor bug in CM's "noweb" tool:
88       If numbering is turned off, then truly don't number (i.e., do not
89       supply the -L option to noweb).  The previous behavior was to supply
90       -L'' -- which caused noweb to use the "default" line numbering scheme.
91       Thanks to Chris Richards for pointing this out (and supplying the fix).
92    
93    2. Once again, I reworked some aspects of the FFI:
94    
95       A. The incomplete/complete type business:
96    
97       - Signatures POINTER_TO_INCOMPLETE_TYPE and accompanying functors are
98         gone!
99       - ML types representing an incomplete type are now *equal* to
100         ML types representing their corresponding complete types (just like
101         in C).  This is still safe because ml-nlffigen will not generate
102         RTTI for incomplete types, nor will it generate functions that
103         require access to such RTTI.   But when ML code generated from both
104         incomplete and complete versions of the C type meet, the ML types
105         are trivially interoperable.
106    
107         NOTE:  These changes restore the full generality of the translation
108         (which was previously lost when I eliminated functorization)!
109    
110       B. Enum types:
111    
112       - Structure C now has a type constructor "enum" that is similar to
113         how the "su" constructor works.  However, "enum" is not a phantom
114         type because each "T enum" has values (and is isomorphic to
115         MLRep.Signed.int).
116       - There are generic access operations for enum objects (using
117         MLRep.Signed.int).
118       - ml-nlffigen will generate a structure E_foo for each "enum foo".
119         * The structure contains the definition of type "mlrep" (the ML-side
120         representation type of the enum).  Normally, mlrep is the same
121         as "MLRep.Signed.int", but if ml-nlffigen was invoked with "-ec",
122         then mlrep will be defined as a datatype -- thus facilitating
123         pattern matching on mlrep values.
124         ("-ec" will be suppressed if there are duplicate values in an
125          enumeration.)
126         * Constructors ("-ec") or values (no "-ec") e_xxx of type mlrep
127         will be generated for each C enum constant xxx.
128         * Conversion functions m2i and i2m convert between mlrep and
129         MLRep.Signed.int.  (Without "-ec", these functions are identities.)
130         * Coversion functions c and ml convert between mlrep and "tag enum".
131         * Access functions (get/set) fetch and store mlrep values.
132       - By default (unless ml-nlffigen was invoked with "-nocollect"), unnamed
133         enumerations are merged into one single enumeration represented by
134         structure E_'.
135    
136    ----------------------------------------------------------------------
137    Name: Allen Leung
138    Date: 2002/02/25 04:45:00 EST
139    Tag: leunga-20020225-cps-spill
140    
141    This is a new implementation of the CPS spill phase.
142    The new phase is in the new file compiler/CodeGen/cpscompile/spill-new.sml
143    In case of problems, replace it with the old file spill.sml
144    
145    The current compiler runs into some serious performance problems when
146    constructing a large record.  This can happen when we try to compile a
147    structure with many items.  Even a very simple structure like the following
148    makes the compiler slow down.
149    
150        structure Foo = struct
151           val x_1 = 0w1 : Word32.int
152           val x_2 = 0w2 : Word32.int
153           val x_3 = 0w3 : Word32.int
154           ...
155           val x_N = 0wN : Word32.int
156        end
157    
158    The following table shows the compile time, from N=1000 to N=4000,
159    with the old compiler:
160    
161    N
162    1000   CPS 100 spill                           0.04u  0.00s  0.00g
163           MLRISC ra                               0.06u  0.00s  0.05g
164              (spills = 0 reloads = 0)
165           TOTAL                                   0.63u  0.07s  0.21g
166    
167    1100   CPS 100 spill                           8.25u  0.32s  0.64g
168           MLRISC ra                               5.68u  0.59s  3.93g
169              (spills = 0 reloads = 0)
170           TOTAL                                   14.71u  0.99s  4.81g
171    
172    1500   CPS 100 spill                           58.55u  2.34s  1.74g
173           MLRISC ra                               5.54u  0.65s  3.91g
174              (spills = 543 reloads = 1082)
175           TOTAL                                   65.40u  3.13s  6.00g
176    
177    2000   CPS 100 spill                           126.69u  4.84s  3.08g
178           MLRISC ra                               0.80u  0.10s  0.55g
179              (spills = 42 reloads = 84)
180           TOTAL                                   129.42u  5.10s  4.13g
181    
182    3000   CPS 100 spill                           675.59u  19.03s  11.64g
183           MLRISC ra                               2.69u  0.27s  1.38g
184              (spills = 62 reloads = 124)
185           TOTAL                                   682.48u  19.61s  13.99g
186    
187    4000   CPS 100 spill                           2362.82u  56.28s  43.60g
188           MLRISC ra                               4.96u  0.27s  2.72g
189              (spills = 85 reloads = 170)
190           TOTAL                                   2375.26u  57.21s  48.00g
191    
192    As you can see the old cps spill module suffers from some serious
193    performance problem.  But since I cannot decipher the old code fully,
194    instead of patching the problems up, I'm reimplementing it
195    with a different algorithm.  The new code is more modular,
196    smaller when compiled, and substantially faster
197    (O(n log n) time and O(n) space).  Timing of the new spill module:
198    
199    4000  CPS 100 spill                           0.02u  0.00s  0.00g
200          MLRISC ra                               0.25u  0.02s  0.15g
201             (spills=1 reloads=3)
202          TOTAL                                   7.74u  0.34s  1.62g
203    
204    Implementation details:
205    
206    As far as I can tell, the purpose of the CPS spill module is to make sure the
207    number of live variables at any program point (the bandwidth)
208    does not exceed a certain limit, which is determined by the
209    size of the spill area.
210    
211    When the bandwidth is too large, we decrease the register pressure by
212    packing live variables into spill records.  How we achieve this is
213    completely different than what we did in the old code.
214    
215    First, there is something about the MLRiscGen code generator
216    that we should be aware of:
217    
218    o MLRiscGen performs code motion!
219    
220       In particular, it will move floating point computations and
221       address computations involving only the heap pointer to
222       their use sites (if there is only a single use).
223       What this means is that if we have a CPS record construction
224       statement
225    
226           RECORD(k,vl,w,e)
227    
228       we should never count the new record address w as live if w
229       has only one use (which is often the case).
230    
231       We should do something similar to floating point, but the transformation
232       there is much more complex, so I won't deal with that.
233    
234    Secondly, there are now two new cps primops at our disposal:
235    
236     1. rawrecord of record_kind option
237        This pure operator allocates some uninitialized storage from the heap.
238        There are two forms:
239    
240         rawrecord NONE [INT n]  allocates a tagless record of length n
241         rawrecord (SOME rk) [INT n] allocates a tagged record of length n
242                                     and initializes the tag.
243    
244     2. rawupdate of cty
245          rawupdate cty (v,i,x)
246          Assigns to x to the ith component of record v.
247          The storelist is not updated.
248    
249    We use these new primops for both spilling and increment record construction.
250    
251     1. Spilling.
252    
253        This is implemented with a linear scan algorithm (but generalized
254        to trees).  The algorithm will create a single spill record at the
255        beginning of the cps function and use rawupdate to spill to it,
256        and SELECT or SELp to reload from it.  So both spills and reloads
257        are fine-grain operations.  In contrast, in the old algorithm
258        "spills" have to be bundled together in records.
259    
260        Ideally, we should sink the spill record construction to where
261        it is needed.  We can even split the spill record into multiple ones
262        at the places where they are needed.  But CPS is not a good
263        representation for global code motion, so I'll keep it simple and
264        am not attempting this.
265    
266     2. Incremental record construction (aka record splitting).
267    
268        Long records with many component values which are simulatenously live
269        (recall that single use record addresses are not considered to
270         be live) are constructed with rawrecord and rawupdate.
271        We allocate space on the heap with rawrecord first, then gradually
272        fill it in with rawupdate.  This is the technique suggested to me
273        by Matthias.
274    
275        Some restrictions on when this is applicable:
276        1. It is not a VECTOR record.  The code generator currently does not handle
277           this case. VECTOR record uses double indirection like arrays.
278        2. All the record component values are defined in the same "basic block"
279           as the record constructor.  This is to prevent speculative
280           record construction.
281    
282    ----------------------------------------------------------------------
283    Name: Allen Leung
284    Date: 2002/02/22 01:02:00 EST
285    Tag: leunga-20020222-mlrisc-tools
286    
287    Minor bug fixes in the parser and rewriter
288    
289    ----------------------------------------------------------------------
290    Name: Allen Leung
291    Date: 2002/02/21 20:20:00 EST
292    Tag: leunga-20020221-peephole
293    
294    Regenerated the peephole files.  Some contained typos in the specification
295    and some didn't compile because of pretty printing bugs in the old version
296    of 'nowhere'.
297    
298    ----------------------------------------------------------------------
299    Name: Allen Leung
300    Date: 2002/02/19 20:20:00 EST
301    Tag: leunga-20020219-mlrisc-tools
302    Description:
303    
304       Minor bug fixes to the mlrisc-tools library:
305    
306       1.  Fixed up parsing colon suffixed keywords
307       2.  Added the ability to shut the error messages up
308       3.  Reimplemented the pretty printer and fixed up/improved
309           the pretty printing of handle and -> types.
310       4.  Fixed up generation of literal symbols in the nowhere tool.
311       5.  Added some SML keywords to to sml.sty
312    
313    ----------------------------------------------------------------------
314    Name: Matthias Blume
315    Date: 2002/02/19 16:20:00 EST
316    Tag: blume-20020219-cmffi
317    Description:
318    
319    A wild mix of changes, some minor, some major:
320    
321    * All C FFI-related libraries are now anchored under $c:
322        $/c.cm      --> $c/c.cm
323        $/c-int.cm  --> $c/internals/c-int.cm
324        $/memory.cm --> $c/memory/memory.cm
325    
326    * "make" tool (in CM) now treats its argument pathname slightly
327      differently:
328        1. If the native expansion is an absolute name, then before invoking
329           the "make" command on it, CM will apply OS.Path.mkRelative
330           (with relativeTo = OS.FileSys.getDir()) to it.
331        2. The argument will be passed through to subsequent phases of CM
332           processing without "going native".  In particular, if the argument
333           was an anchored path, then "make" will not lose track of that anchor.
334    
335    * Compiler backends now "know" their respective C calling conventions
336      instead of having to be told about it by ml-nlffigen.  This relieves
337      ml-nlffigen from one of its burdens.
338    
339    * The X86Backend has been split into X86CCallBackend and X86StdCallBackend.
340    
341    * Export C_DEBUG and C_Debug from $c/c.cm.
342    
343    * C type encoding in ml-nlffi-lib has been improved to model the conceptual
344      subtyping relationship between incomplete pointers and their complete
345      counterparts.  For this, ('t, 'c) ptr has been changed to 'o ptr --
346      with the convention of instantiating 'o with ('t, 'c) obj whenever
347      the pointer target type is complete.  In the incomplete case, 'o
348      will be instantiated with some "'c iobj" -- a type obtained by
349      using one of the functors PointerToIncompleteType or PointerToCompleteType.
350    
351      Operations that work on both incomplete and complete pointer types are
352      typed as taking an 'o ptr while operations that require the target to
353      be known are typed as taking some ('t, 'c) obj ptr.
354    
355      voidptr is now a bit "more concrete", namely "type voidptr = void ptr'"
356      where void is an eqtype without any values.  This makes it possible
357      to work on voidptr values using functions meant to operate on light
358      incomplete pointers.
359    
360    * As a result of the above, signature POINTER_TO_INCOMPLETE_TYPE has
361      been vastly simplified.
362    
363    ----------------------------------------------------------------------
364    Name: Matthias Blume
365    Date: 2002/02/19 10:48:00 EST
366    Tag: blume-20020219-pqfix
367    Description:
368    
369    Applied Chris Okasaki's bug fix for priority queues.
370    
371    ----------------------------------------------------------------------
372    Name: Matthias Blume
373    Date: 2002/02/15 17:05:00
374    Tag: Release_110_39
375    Description:
376    
377    Last-minute retagging is becoming a tradition... :-(
378    
379    This is the working release 110.39.
380    
381    ----------------------------------------------------------------------
382    Name: Matthias Blume
383    Date: 2002/02/15 16:00:00 EST
384    Tag: Release_110_39-orig
385    Description:
386    
387    Working release 110.39.  New bootfiles.
388    
389    (Update: There was a small bug in the installer so it wouldn't work
390    with all shells.  So I retagged. -Matthias)
391    
392    ----------------------------------------------------------------------
393    Name: Matthias Blume
394    Date: 2002/02/15 14:17:00 EST
395    Tag: blume-20020215-showbindings
396    Description:
397    
398    Added EnvRef.listBoundSymbols and CM.State.showBindings.  Especially
399    the latter can be useful for exploring what bindings are available at
400    the interactive prompt.  (The first function returns only the list
401    of symbols that are really bound, the second prints those but also the
402    ones that CM's autoloading mechanism knows about.)
403    
404    ----------------------------------------------------------------------
405    Name: Matthias Blume
406    Date: 2002/02/15 12:08:00 EST
407    Tag: blume-20020215-iptrs
408    Description:
409    
410    Two improvements to ml-nlffigen:
411    
412      1. Write files only if they do not exist or if their current contents
413         do not coincide with what's being written.  (That is, avoid messing
414         with the time stamps unless absolutely necessary.)
415    
416      2. Implement a "repository" mechanism for generated files related
417         to "incomplete pointer types".   See the README file for details.
418    
419    ----------------------------------------------------------------------
420    Name: Matthias Blume
421    Date: 2002/02/14 11:50:00 EST
422    Tag: blume-20020214-quote
423    Description:
424    
425    Added a type 't t_' to tag.sml (in ml-nlffi-lib.cm).  This is required
426    because of the new and improved tag generation scheme.  (Thanks to Allen
427    Leung for pointing it out.)
428    
429    ----------------------------------------------------------------------
430    Name: Lal George
431    Date: 2002/02/14 09:55:27 EST 2002
432    Tag: george-20020214-isabelle-bug
433    Description:
434    
435    Fixed the MLRISC bug sent by Markus Wenzel regarding the compilation
436    of Isabelle on the x86.
437    
438    From Allen:
439    -----------
440     I've found the problem:
441    
442         in ra-core.sml, I use the counter "blocked" to keep track of the
443         true number of elements in the freeze queue.  When the counter goes
444         to zero, I skip examining the queue.  But I've messed up the
445         bookkeeping in combine():
446    
447             else ();
448             case !ucol of
449               PSEUDO => (if !cntv > 0 then
450                     (if !cntu > 0 then blocked := !blocked - 1 else ();
451                                        ^^^^^^^^^^^^^^^^^^^^^^^
452                      moveu := mergeMoveList(!movev, !moveu)
453                     )
454                  else ();
455    
456         combine() is called to coalesce two nodes u and v.
457         I think I was thinking that if the move counts of u and v are both
458         greater than zero then after they are coalesced then one node is
459         removed from the freeze queue.  Apparently I was thinking that
460         both u and v are of low degree, but that's clearly not necessarily true.
461    
462    
463    02/12/2002:
464        Here's the patch.  HOL now compiles.
465    
466        I don't know how this impact on performance (compile
467        time or runtime).  This bug caused the RA (especially on the x86)
468        to go thru the potential spill phase when there are still nodes on the
469        freeze queue.
470    
471    
472    
473    
474    ----------------------------------------------------------------------
475    Name: Matthias Blume
476    Date: 2002/02/13 22:40:00 EST
477    Tag: blume-20020213-fptr-rtti
478    Description:
479    
480    Fixed a bug in ml-nlffigen that was introduced with one of the previous
481    updates.
482    
483    ----------------------------------------------------------------------
484    Name: Matthias Blume
485    Date: 2002/02/13 16:41:00 EST
486    Tag: blume-20020213-cmlpq
487    Description:
488    
489    Added new priority queue export symbols (which have just been added to
490    smlnj-lib.cm) to CML's version of smlnj-lib.cm.  (Otherwise CML would
491    not compile and the installer would choke.)
492    
493    ----------------------------------------------------------------------
494    Name: Matthias Blume
495    Date: 2002/02/13 16:15:00 EST
496    Tag: blume-20020213-various
497    Description:
498    
499    1. More tweaks to ml-nlffigen:
500    
501       - better internal datastructures (resulting in slight speedup)
502       - "-match" option requires exact match
503       - "localized" gensym counters (untagged structs/unions nested within
504         other structs/unions or within typedefs get a fresh counter; their
505         tag will be prefixed by a concatenation of their parents' tags)
506       - bug fixes (related to calculation of transitive closure of types
507         to be included in the output)
508    
509    2. Minor Basis updates:
510    
511       - added implementations for List.collate and Option.app
512    
513    ----------------------------------------------------------------------
514    Name: Matthias Blume
515    Date: 2002/02/11 15:55:00 EST
516    Tag: blume-20020211-gensym
517    Description:
518    
519    Added a "-gensym" option to command line of ml-nlffigen.  This can be
520    used to specify a "stem" -- a string that is inserted in all "gensym'd"
521    names (ML structure names that correspond to unnamed C structs, unions,
522    and enums), so that separate runs of ml-nlffigen do not clash.
523    
524    ----------------------------------------------------------------------
525    Name: Matthias Blume
526    Date: 2002/02/11 12:05:00 EST
527    Tag: blume-20020211-gensml
528    Description:
529    
530    A quick fix for a problem with GenSML (in the pgraph-util library):
531    Make generation of toplevel "local" optional.  (Strictly speaking,
532    signature definitions within "local" are not legal SML.)
533    
534    Other than that: updates to INSTALL and cm/TODO.
535    
536    ----------------------------------------------------------------------
537    Name: Matthias Blume
538    Date: 2002/02/08 15:00:00 EST
539    Tag: blume-20020208-uniquepid
540    Description:
541    
542    0. Version number has been bumped to 110.38.1.  NEW BOOTFILES!!!
543    
544    1. The installer (config/install.sh) has gotten smarter:
545    
546         - Configuration options are a bit easier to specify now
547           (in config/targets).
548         - Bug in recognizing .tar.bz2 files fixed.
549         - Installer automatically resolves dependencies between
550           configuration options (e.g., if you ask for eXene, you will
551           also get cml -- regardless whether you asked for it or not).
552         - Installer can run in "quieter mode" by setting the environment
553           variable INSTALL_QUIETLY to "true".  "Quieter" does not mean
554           "completely silent", though.
555         - Build HashCons library as part of smlnj-lib.
556    
557    2. A new scheme for assigning persistent identifiers to compilation
558       units (and, by extension, to types etc.) has been put into place.
559       This fixes a long-standing bug where types and even dynamic values
560       can get internally confused, thereby compromising type safety
561       (abstraction) and dynamic correctness.  See
562    
563         http://cm.bell-labs.com/cm/cs/who/blume/pid-confusion.tgz
564    
565       for an example of how things could go wrong until now.
566    
567       The downside of the new scheme is that pids are not quite as
568       persistent as they used to be: CM will generate a fresh pid
569       for every compilation unit that it thinks it sees for the first
570       time.  That means that if you compile starting from a clean, fresh
571       source tree at two different times, you end up with different
572       binaries.
573    
574       Cutoff recompilation, however, has not been compromised because
575       CM keeps pid information in special caches between runs.
576    
577    ----------------------------------------------------------------------
578    Name: Lal George
579    Date: 2002/02/07 15:34:13 EST 2002
580    Tag: <none>
581    Description:
582    
583    Compilers that generate assembly code may produce  global labels
584    whose value is resolved at link time. The various peephole optimization
585    modules did not take this in account.
586    
587    TODO. The Labels.addrOf function should really return an option
588    type so that clients are forced to deal with this issue, rather
589    than an exception being raised.
590    
591    ----------------------------------------------------------------------
592    Name: Lal George
593    Date: 2002/02/06 13:55:02 EST
594    Tag: george-20020206-ra-breakup
595    Description:
596    
597    1. A bug fix from Allen.
598    
599        A typo causes extra fstp %st(0)'s to be generated at compensation
600        edges, which might cause stack underflow traps at runtime.  This
601        occurs in fft where there are extraneous fstps right before the 'into'
602        trap instruction (in this case they are harmless since none of the
603        integers overflow.)
604    
605    2. Pulled out various utility modules that were embedded in the modules
606       of the register allocator. I need these modules for other purposes, but
607       they are not complete enough to put into a library (just yet).
608    ----------------------------------------------------------------------
609    Name: Matthias Blume
610    Date: 2002/01/31 16:05:00 EST
611    Tag: blume-20020131-sparc-ccalls
612    Description:
613    
614    1. C-calls on Sparc needlessly allocated a huge chunk (96 bytes)
615       of extra stack space by mistake.  Fixed.
616    
617    2. Bug in logic of handling of command-line options in ml-nlffigen fixed.
618    
619    ----------------------------------------------------------------------
620    Name: Allen Leung
621    Date: 2002/01/30
622    Tag: leunga-20020130-nowhere-bug-fix
623    Description:
624    
625       MLRISC bug fixes:
626       1. Fixed a bindings computation bug in the 'nowhere' program generator tool.
627       2. MachineInt.fromString was negating its value.
628    
629    ----------------------------------------------------------------------
630    Name: Matthias Blume
631    Date: 2002/01/29
632    Tag: blume-20020129-INSTALL
633    Description:
634    
635    - Added somewhat detailed installation instructions (file INSTALL).
636    - Fixed curl-detection bug in config/install.sh.
637    - It is now possible to select the URL getter using the URLGETTER
638      environment variable:
639    
640          not set / "unknown"      --> automatic detection (script tries wget,
641                                       curl, and lynx)
642          "wget" / "curl" / "lynx" --> use the specified program (script "knows"
643                                       how to properly invoke them)
644          other                    --> use $URLGETTER directly, it must take
645                                       precisely two command-line arguments
646                                       (source URL and destination file name)
647    
648    ----------------------------------------------------------------------
649    Name: Matthias Blume
650    Date: 2002/01/28
651    Tag: blume-20020128-sparc-ccalls
652    Description:
653    
654    - Fixed problem with calculation of "used" registers in sparc-c-calls.
655    - Make use of the allocParam argument in sparc-c-calls.
656    
657    ----------------------------------------------------------------------
658    Name: Matthias Blume
659    Date: 2002/01/28
660    Tag: blume-20020128-allocParam
661    Description:
662    
663    John Reppy:  Changes c-calls API to accept client-callback for
664    allocating extra stack space.
665    me: Corresponding changes to mlriscGen (using a dummy argument that
666        does not change the current behavior).
667    
668    ----------------------------------------------------------------------
669    Name: Matthias Blume
670    Date: 2002/01/28 12:00:00
671    Tag: Release_110_38
672    Description:
673    
674    This time for real!!!
675    
676    ----------------------------------------------------------------------
677    Name: Matthias Blume
678    Date: 2002/01/28 10:56:00 EST
679    Tag: blume-20020128-retraction
680    Description:
681    
682    0. Retracted earlier 110.38.  (The Release_110_38 tag has been replaced
683       with blume-Release_110_38-retracted.)
684    
685    1. Fixed a problem with incorrect rounding modes in real64.sml.
686       (Thanks to Andrew Mccreight <andrew.mccreight@yale.edu>.)
687    
688    2. A bug in ml-nlffigen related to the handling of unnamed structs, unions,
689       and enums fixed.  The naming of corresponding ML identifiers should
690       now be consistent again.
691    
692    ----------------------------------------------------------------------
693    Name: Allen Leung
694    Date: 2002/01/27
695    Tag: leunga-20020127-nowhere
696    Description:
697    
698       Added a target called nowhere in the configuration scripts.
699       Enabling this will build the MLRISC 'nowhere' tool (for translating
700       programs with where-clauses into legal SML code) during installation.
701    
702    ----------------------------------------------------------------------
703    Name: Matthias Blume
704    Date: 2002/01/25 21:27:00 EST
705    Tag: blume-Release_110_38-retracted
706    Description:
707    
708    Call it a (working) release!  Version is 110.38. Bootfiles are ready.
709    
710    README will be added later.
711    
712    !!! NOTE:  Re-tagged as blume-Release_110_38-retracted. Original tag
713    (Release_110_38) removed.  Reason: Last-minute bug fixes.
714    
715    ----------------------------------------------------------------------
716    Name: Matthias Blume
717    Date: 2002/01/25
718    Tag: blume-20020125-ffi
719    Description:
720    
721    A large number of tweaks and improvements to ml-nlffi-lib and
722    ml-nlffigen:
723    
724       - ML represenation types have been streamlined
725       - getter and setter functions work with concrete values, not abstract
726         ones where possible
727       - ml-nlffigen command line more flexible (see README file there)
728       - some bugs have been fixed (hopefully)
729    
730    ----------------------------------------------------------------------
731    Name: Lal George
732    Date: 2002/01/24
733    Tag: george-20020124-risc-ra-interface
734    Description:
735    
736       There is a dramatic simplification in the interface to the
737       register allocator for RISC architectures as a result of making
738       parallel copy instructions explicit.
739    
740    ----------------------------------------------------------------------
741    Name: Matthias Blume
742    Date: 2002/01/22
743    Tag: blume-20020122-x86-ccalls
744    Description:
745    
746    Bug fix for c-calls on x86 (having to do with how char- and
747    short-arguments are being handled).
748    
749    ----------------------------------------------------------------------
750    Name: Matthias Blume
751    Date: 2002/01/21
752    Tag: blume-20020121-ff
753    Description:
754    
755    Another day of fiddling with the FFI...
756    
757    1. Bug fix/workaround:  CKIT does not complain about negative array
758       dimensions, so ml-nlffigen has to guard itself against this possibility.
759       (Otherwise a negative dimension would send it into an infinite loop.)
760    
761    2. Some of the abstract types (light objects, light pointers, most "base"
762       types) in structure C are now eqtypes.
763    
764    3. Added constructors and test functions for NULL function pointers.
765    
766    ----------------------------------------------------------------------
767    Name: Matthias Blume
768    Date: 2002/01/18
769    Tag: blume-20020118-ready-for-new-release
770    Description:
771    
772    Made config/srcarchiveurl point to a new place.  (Will provide boot
773    files shortly.)
774    
775    Maybe we christen this to be 110.38?
776    
777    ----------------------------------------------------------------------
778    Name: Matthias Blume
779    Date: 2002/01/18
780    Tag: blume-20020118-more-ffifiddle
781    Description:
782    
783    Today's FFI fiddling:
784    
785      - Provided a structure CGetSet with "convenient" versions of C.Get.* and
786        C.Set.* that use concrete (MLRep.*) arguments and results instead
787        of abstract ones.
788    
789      - Provided word-style bit operations etc. for "int" representation
790        types in MLRep.S<Foo>Bitops where <Foo> ranges over Char, Int, Short,
791        and Long.
792    
793    ----------------------------------------------------------------------
794    Name: Matthias Blume
795    Date: 2002/01/18
796    Tag: blume-20020118-use-x86-fp
797    Description:
798    
799    Now that x86-fast-fp seems to be working, I turned it back on again
800    by default.  (Seems to work fine now, even with the FFI.)
801    
802    Other than that, I added some documentation about the FFI to
803    src/ml-nlffigen/README and updated the FFI test examples in
804    src/ml-nlffi-lib/Tests/*.
805    
806    ----------------------------------------------------------------------
807    Name: Allen Leung
808    Date: 2002/01/17
809    Tag: leunga-20020117-x86-fast-fp-call
810    Description:
811    
812       1. Fixed a problem with handling return fp values when x86's fast fp
813          mode is turned on.
814    
815       2. Minor pretty printing fix for cellset.  Print %st(0) as %st(0) instead
816          of %f32.
817    
818       3. Added a constructor INT32lit to the ast of MLRISC tools.
819    
820    ----------------------------------------------------------------------
821    Name: Matthias Blume
822    Date: 2002/01/16
823    Tag: blume-20020116-ffifiddle
824    Description:
825    
826    More fiddling with the FFI interface:
827    
828     - Make constness 'c instead of rw wherever possible.  This eliminates
829       the need for certain explicit coercions.  (However, due to ML's
830       value polymorphism, there will still be many cases where explicit
831       coercions are necessary.  Phantom types are not the whole answer
832       to modeling a subtyping relationship in ML.)
833    
834     - ro/rw coersions for pointers added.  (Avoids the detour through */&.)
835    
836     - "printf" test example added to src/ml-nlffi-lib/Tests.  (Demonstrates
837       clumsy workaround for varargs problem.)
838    
839    ----------------------------------------------------------------------
840    Name: Lal George
841    Date: 2002/01/15
842    Tag: <none>
843    Description:
844    
845    1. Since COPY instructions are no longer native to the architecture,
846       a generic functor can be used to implement the expandCopies function.
847    
848    2. Allowed EXPORT and IMPORT pseudo-op declarations to appear inside a
849       TEXT segment.
850    
851    ----------------------------------------------------------------------
852    Name: Matthias Blume
853    Date: 2002/01/15
854    Tag: blume-20020115-ffiupdates
855    Description:
856    
857    1. Fix for bug resulting in single-precision float values being returned
858       incorrectly from FFI calls.
859    
860    2. Small modifications to C FFI API:
861    
862        - memory-allocation routines return straight objects (no options)
863          and raise an exception in out-of-memory situations
864        - unsafe extensions to cast between function pointers and pointers
865          from/to ints
866        - added structure C_Debug as an alternative to structure C where
867          pointer-dereferencing (|*| and |*!) always check for null-pointers
868        - added open_lib' to DynLinkage;  open_lib' works like open_lib
869          but also takes a (possibly empty) list of existing library handles
870          that the current library depends on
871    
872    ----------------------------------------------------------------------
873    Name: Matthias Blume
874    Date: 2002/01/10
875    Tag: blume-20020110-newffigen
876    Description:
877    
878    1. Updates to portable graph code.
879    
880    2. Major update to ml-nlffigen and ml-nlffi-lib.  Things are much
881       more scalable now so that even huge interfaces such as the one
882       for GTK compile in finite time and space. :-)
883       See src/ml-nlffigen/README for details on what's new.
884    
885    ----------------------------------------------------------------------
886    Name: Lal George
887    Date: 2001/01/09 14:31:35 EST 2002
888    Tag: george-20011206-rm-native-copy
889    Description:
890    
891            Removed the native COPY and FCOPY instructions
892            from all the architectures and replaced it with the
893            explicit COPY instruction from the previous commit.
894    
895            It is now possible to simplify many of the optimizations
896            modules that manipulate copies. This has not been
897            done in this change.
898    
899    ----------------------------------------------------------------------
900    Name: Lal George
901    Date: 2001/12/06 16:50:13 EST 2001
902    Tag: george-20011206-mlrisc-instruction
903    Description:
904    
905    Changed the representation of instructions from being fully abstract
906    to being partially concrete. That is to say:
907    
908      from
909            type instruction
910    
911      to
912            type instr                              (* machine instruction *)
913    
914            datatype instruction =
915                LIVE of {regs: C.cellset, spilled: C.cellset}
916              | KILL of {regs: C.cellset, spilled: C.cellset}
917              | COPYXXX of {k: CB.cellkind, dst: CB.cell list, src: CB.cell list}
918              | ANNOTATION of {i: instruction, a: Annotations.annotation}
919              | INSTR of instr
920    
921    This makes the handling of certain special instructions that appear on
922    all architectures easier and uniform.
923    
924    LIVE and KILL say that a list of registers are live or killed at the
925    program point where they appear. No spill code is generated when an
926    element of the 'regs' field is spilled, but the register is moved to
927    the 'spilled' (which is present, more for debugging than anything else).
928    
929    LIVE replaces the (now deprecated) DEFFREG instruction on the alpha.
930    We used to generate:
931    
932            DEFFREG f1
933            f1 := f2 + f3
934            trapb
935    
936    but now generate:
937    
938            f1 := f2 + f3
939            trapb
940            LIVE {regs=[f1,f2,f3], spilled=[]}
941    
942    Furthermore, the DEFFREG (hack) required that all floating point instruction
943    use all registers mentioned in the instruction. Therefore f1 := f2 + f3,
944    defines f1 and uses [f1,f2,f3]! This hack is no longer required resulting
945    in a cleaner alpha implementation. (Hopefully, intel will not get rid of
946    this architecture).
947    
948    COPYXXX is intended to replace the parallel COPY and FCOPY  available on
949    all the architectures. This will result in further simplification of the
950    register allocator that must be aware of them for coalescing purposes, and
951    will also simplify certain aspects of the machine description that provides
952    callbacks related to parallel copies.
953    
954    ANNOTATION should be obvious, and now INSTR represents the honest to God
955    machine instruction set!
956    
957    The <arch>/instructions/<arch>Instr.sml files define certain utility
958    functions for making porting easier -- essentially converting upper case
959    to lower case. All machine instructions (of type instr) are in upper case,
960    and the lower case form generates an MLRISC instruction. For example on
961    the alpha we have:
962    
963      datatype instr =
964         LDA of {r:cell, b:cell, d:operand}
965       | ...
966    
967      val lda : {r:cell, b:cell, d:operand} -> instruction
968        ...
969    
970    where lda is just (INSTR o LDA), etc.
971    
972    ----------------------------------------------------------------------
973    Name: Matthias Blume
974    Date: 2001/11/22 21:40:00 EST
975    Tag: Release_110_37
976    Description:
977    
978    Release 110.37.  This time for real.
979    
980    ----------------------------------------------------------------------
981    Name: Matthias Blume
982    Date: 2001/11/21 16:35:00 EST
983    Tag: blume-20011121-foot-in-mouth
984    Description:
985    
986    Removed the "Release_110_37" tag because of a serious bug.
987    This will be re-tagged once the bug is fixed.
988    
989    ----------------------------------------------------------------------
990    Name: Matthias Blume
991    Date: 2001/11/21 16:14:00 EST
992    Tag: blume-20011121-forgottenfile
993    Description:
994    
995    Forgot to add a file.  (Just a .tex-file -- part of
996    the CM manual source.)
997    
998    ----------------------------------------------------------------------
999    Name: Matthias Blume
1000    Date: 2001/11/21 16:10:00 EST
1001    Tag: blume-20011121-invalid_110_37
1002    Description:
1003    
1004    Note: I removed the original tag "Release_110_37" from this commit
1005          because we found a serious bug in all non-x86 backends.
1006          - Matthias
1007    
1008    1. Modifications to the SML/NJ code generator and to the runtime system
1009       so that code object name strings are directly inserted into code
1010       objects at code generation time.  The only business the runtime system
1011       has with this is now to read the name strings on occasions.
1012       (The encoding of the name string has also changed somewhat.)
1013    
1014    2. CM now implements a simple "set calculus" for specifying export lists.
1015       In particular, it is now possible to refer to the export lists of
1016       other libraries/groups/sources and form unions as well as differences.
1017       See the latest CM manual for details.
1018    
1019    3. An separate notion of "proxy" libraries has again be eliminated from
1020       CM's model.  (Proxy libraries are now simply a special case of using
1021       the export list calculus.)
1022    
1023    4. Some of the existing libraries now take advantage of the new set
1024       calculus.
1025       (Notice that not all libraries have been converted because some
1026       of the existing .cm-files are supposed to be backward compatible
1027       with 110.0.x.)
1028    
1029    5. Some cleanup in stand-alone programs.  (Don't use "exnMessage" -- use
1030       "General.exnMessage"!  The former relies on a certain hook to be
1031       initialized, and that often does not happen in the stand-alone case.)
1032    
1033    ----------------------------------------------------------------------
1034    Name: Lal George
1035    Date: 2001/11/21  13:56:18 EST
1036    Tag: george-2001121-pseudo-ops
1037    Description:
1038    
1039      Implemented a complete redesign of MLRISC pseudo-ops. Now there
1040      ought to never be any question of incompatabilities with
1041      pseudo-op syntax expected by host assemblers.
1042    
1043      For now, only modules supporting GAS syntax are implemented
1044      but more should follow, such as MASM, and vendor assembler
1045      syntax, e.g. IBM as, Sun as, etc.
1046    
1047    ----------------------------------------------------------------------
1048    Name: Matthias Blume
1049    Date: 2001/11/14 11:52:00 EST
1050    Tag: blume-20011114-srcname
1051    Description:
1052    
1053    1. Routed the name of the current source file to mlriscgen where it
1054       should be directly emitted into the code object.  (This last part
1055       is yet to be done.)
1056    
1057    2. Some cleanup of the pgraph code to make it match the proposal that
1058       I put out the other day.  (The proposal notwithstanding, things are
1059       still in flux here.)
1060    
1061    ----------------------------------------------------------------------
1062    Name: Lal George
1063    Date: 2001/11/14 09:44:04 EST
1064    Tag:
1065    Description:
1066    
1067      Fix for a backpatching bug reported by Allen.
1068    
1069      Because the boundary between short and long span-dependent
1070      instructions is +/- 128, there are an astounding number of
1071      span-dependent instructions whose size is over estimated.
1072    
1073      Allen came up with the idea of letting the size of span
1074      dependent instructions be non-monotonic, for a maxIter
1075      number of times, after which the size must be monotonically
1076      increasing.
1077    
1078      This table shows the number of span-dependent instructions
1079      whose size was over-estimated as a function of maxIter, for the
1080      file Parse/parse/ml.grm.sml:
1081    
1082         maxIter            # of instructions:
1083            10                      687
1084            20                      438
1085            30                      198
1086            40                        0
1087    
1088      In compiling the compiler, there is no significant difference in
1089      compilation speed between maxIter=10 and maxIter=40. Actually,
1090      my measurements showed that maxIter=40 was a tad faster than
1091      maxIter=10! Also 96% of the  files in the compiler reach a fix
1092      point within 13 iterations, so fixing maxIter at 40, while high,
1093      is okay.
1094    
1095    ----------------------------------------------------------------------
1096    Name: Matthias Blume
1097    Date: 2001/10/31 15:25:00 EST
1098    Tag: blume-20011031-pgraph
1099    Description:
1100    
1101    CKIT:
1102    * Changed the "Function" constructor of type Ast.ctype to carry optional
1103      argument identifiers.
1104    * Changed the return type of TypeUtil.getFunction accordingly.
1105    * Type equality ignores the argument names.
1106    * TypeUtil.composite tries to preserve argument names but gives up quickly
1107      if there is a mismatch.
1108    
1109    installation script:
1110    * attempts to use "curl" if available (unless "wget" is available as well)
1111    
1112    CM:
1113    * has an experimental implementation of "portable graphs" which I will
1114      soon propose as an implementation-independent library format
1115    * there are also new libraries $/pgraph.cm and $/pgraph-util.cm
1116    
1117    NLFFI-LIB:
1118    * some cleanup (all cosmetic)
1119    
1120    NLFFIGEN:
1121    * temporarily disabled the mechanism that suppresses ML output for
1122      C definitions whose identifiers start with an underscore character
1123    * generate val bindings for enum constants
1124    * user can request that only one style (light or heavy) is being used;
1125      default is to use both (command-line arguments: -heavy and -light)
1126    * fixed bug in handling of function types involving incomplete pointers
1127    * generate ML entry points that take record arguments (i.e., using
1128      named arguments) for C functions that have a prototype with named
1129      arguments
1130      (see changes to CKIT)
1131    
1132    ----------------------------------------------------------------------
1133    Name: Allen Leung
1134    Date: 2001/10/27 20:34:00 EDT
1135    Tag: leunga-20011027-x86-fast-fp-call
1136    Description:
1137    
1138       Fixed the bug described in blume-20010920-slowfp.
1139    
1140       The fix involves
1141          1. generating FCOPYs in FSTP in ia32-svid
1142          2. marking a CALL with the appropriate annotation
1143    
1144    ----------------------------------------------------------------------
1145    Name: Matthias Blume
1146    Date: 2001/10/16 11:32:00 EDT
1147    Tag: blume-20011016-netbsd
1148    Description:
1149    
1150    Underscore patch from Chris Richards (fixing problem with compiling
1151    runtime system under recent NetBSD).
1152    
1153    ----------------------------------------------------------------------
1154    Name: Allen Leung
1155    Date: 2001/10/12 17:18:32 EDT 2001
1156    Tag: leung-20011012-x86-printflowgraph
1157    Description:
1158    
1159    X86RA now uses a valid (instead of dummy) PrintFlowgraph module.
1160    
1161    ----------------------------------------------------------------------
1162    Name: Lal George
1163    Date: 2001/10/11 23:51:34 EDT
1164    Tag: george-20011011-too-many-instrs
1165    Description:
1166    
1167    The representation of a program point never expected to see more
1168    than 65536 instructions in a basic block!
1169    
1170    ----------------------------------------------------------------------
1171    Name: Lal George
1172    Date: 2001/10/09 09:41:37 EDT
1173    Tag: george-20011008-mlrisc-labels
1174    Description:
1175    
1176    Changed the machine description files to support printing of
1177    local and global labels in assembly code, based on host assembler
1178    conventions.
1179    
1180    ----------------------------------------------------------------------
1181    Name: Matthias Blume
1182    Date: 2001/09/25 15:25:00 EDT
1183    Tag: blume-20010925-exninfo
1184    Description:
1185    
1186    I provided a non-hook implementation of exnName (at the toplevel) and
1187    made the "dummy" implementation of exnMessage (at the toplevel) more
1188    useful: if nothing gets "hooked in", then at least you are going to
1189    see the exception name and a message indicating why you don't see more.
1190    
1191    [For the time being, programs that need exnMessage and want to use
1192    ml-build should either use General.exnMessage (strongly recommended) or
1193    refer to structure General at some other point so that CM sees a
1194    static dependency.]
1195    
1196    [Similar remarks go for "print" and "use":  If you want to use their
1197    functionality in stand-alone programs generated by ml-build, then use
1198    TextIO.output and Backend.Interact.useFile (from $smlnj/compiler.cm).]
1199    
1200    ----------------------------------------------------------------------
1201    Name: Matthias Blume
1202    Date: 2001/09/20 17:28:00 EDT
1203    Tag: blume-20010920-slowfp
1204    Description:
1205    
1206    Allen says that x86-fast-fp is not safe yet, so I turned it off again...
1207    
1208    ----------------------------------------------------------------------
1209    Name: Matthias Blume
1210    Date: 2001/09/20 17:20:00 EDT
1211    Tag: blume-20010920-canonicalpaths
1212    Description:
1213    
1214    0. Updated the BOOT file (something that I forgot to do earlier).
1215    
1216    1. Small internal change to CM so that it avoids "/../" in filenames
1217       as much as possible (but only where it is safe).
1218    
1219    2. Changed config/_run-sml (resulting in a changed bin/.run-sml) so
1220       that arguments that contain delimiters are passed through correctly.
1221       This change also means that all "special" arguments of the form
1222       @SMLxxx... must come first.
1223    
1224    3. Changed install script to put relative anchor names for tool commands
1225       into pathconfig.
1226    
1227    ----------------------------------------------------------------------
1228    Name: Matthias Blume
1229    Date: 2001/09/18 15:35:00 EDT
1230    Tag: blume-20010918-readme11036
1231    Description:
1232    
1233    Added README files.
1234    
1235    ----------------------------------------------------------------------
1236    Name: Matthias Blume
1237    Date: 2001/09/18 11:45:00 EDT
1238    Tag: Release_110_36 (retag)
1239    Description:
1240    
1241    Fixed mistake in config/preloads. Retagged as 110.36.
1242    
1243    ----------------------------------------------------------------------
1244    Name: Matthias Blume
1245    Date: 2001/09/18 09:40:00 EDT
1246    Tag: Release_110_36_orig (tag changed)
1247    Description:
1248    
1249    New version (110.36).  New bootfiles.
1250    
1251    ----------------------------------------------------------------------
1252    Name: Matthias Blume
1253    Date: 2001/09/14 16:15:00 EDT
1254    Tag: blume-20010914-x86fastfp
1255    Description:
1256    
1257    John committed some changes that Allen made, in particular a (hopefully)
1258    correctly working version of the x86-fp module.
1259    
1260    I changed the default setting of the Control.MLRISC.getFlag "x86-fast-fp"
1261    flag to "true".  Everything seems to compile to a fixpoint ok, and
1262    "mandelbrot" speeds up by about 15%.
1263    
1264    ----------------------------------------------------------------------
1265    Name: Matthias Blume
1266    Date: 2001/09/13 11:20:00 EDT
1267    Tag: blume-20010913-minimal
1268    Description:
1269    
1270    1. Stefan Monnier's patch to fix a miscompilation problem that
1271       was brought to light by John Reppy's work on Moby.
1272    
1273    2. Implemented a minimal "structure Compiler" that contains just
1274       "version" and "architecture".  The minimal version will be
1275       available when the full version is not.  This is for backward-
1276       compatibility with code that wants to test Compiler.version.
1277    
1278    ----------------------------------------------------------------------
1279    Name: Matthias Blume
1280    Date: 2001/08/28 14:03:00 EDT
1281    Tag: blume-20010828-ml-lex
1282    Description:
1283    
1284    Fix for bug 1581, received from Neophytos Michael.
1285    
1286    ----------------------------------------------------------------------
1287    Name: Matthias Blume
1288    Date: 2001/08/27 11:20:00 EDT
1289    Tag: blume-20010827-readme11035
1290    Description:
1291    
1292    Fleshed out the README file for 110.35.
1293    
1294    ----------------------------------------------------------------------
1295    Name: Matthias Blume
1296    Date: 2001/08/24 17:10:00 EDT
1297    Tag: Release_110_35
1298    Description:
1299    
1300    New version number (110.35).  New bootfiles.
1301    
1302    ----------------------------------------------------------------------
1303    Name: Lal George
1304    Date: 2001/08/24 13:47:18 EDT 2001
1305    Tag: george-20010824-MLRISC-graphs
1306    Description:
1307    
1308     removed clusters from MLRISC completely and replaced with graphs.
1309    
1310    ----------------------------------------------------------------------
1311    Name: Matthias Blume
1312    Date: 2001/08/23 17:50:00 EDT
1313    Tag: blume-20010823-toplevel
1314    Description:
1315    
1316    - some reorganization of the code that implements various kinds of
1317      environments in the compiler (static, dynamic, symbolic, combined)
1318    - re-implemented the EnvRef module so that evalStream works properly
1319      (if the stream contains references to "use", "CM.make", etc.)
1320    - cleaned up evalloop.sml and interact.sml (but they need more cleaning)
1321    
1322    ----------------------------------------------------------------------
1323    Name: Matthias Blume
1324    Date: 2001/08/20 15:50 EDT
1325    Tag: blume20010820-slipup
1326    Description:
1327    
1328    I forgot to commit a few files.  Here they are...
1329    
1330    ----------------------------------------------------------------------
1331    Name: Matthias Blume
1332    Date: 2001/08/20 15:35:00 EDT
1333    Tag: blume-20010820-debugprof
1334    Description:
1335    
1336    !!!! NEW BOOTFILES !!!!
1337    
1338    This is another round of reorganizing the compiler sources.  This
1339    time the main goal was to factor out all the "instrumentation"
1340    passes (for profiling and backtracing) into their own library.
1341    The difficulty was to do it in such a way that it does not depend
1342    on elaborate.cm but only on elabdata.cm.
1343    
1344    Therefore there have been further changes to both elaborate.cm and
1345    elabdata.cm -- more "generic" things have been moved from the former
1346    to the latter.  As a result, I was forced to split the assignment
1347    of numbers indicating "primtyc"s into two portions: SML-generic and
1348    SML/NJ-specific.  Since it would have been awkward to maintain,
1349    I bit the bullet and actually _changed_ the mapping between these
1350    numbers and primtycs.  The bottom line of this is that you need
1351    a new set of bin- and bootfiles.
1352    
1353    I have built new bootfiles for all architectures, so doing a fresh
1354    checkout and config/install.sh should be all you need.
1355    
1356    The newly created library's name is
1357    
1358        $smlnj/viscomp/debugprof.cm
1359    
1360    and its sources live under
1361    
1362        src/compiler/DebugProf
1363    
1364    ----------------------------------------------------------------------
1365    Name: Matthias Blume
1366    Date: 2001/08/15 17:15:00 EDT
1367    Tag: blume-20010815-compreorg
1368    Description:
1369    
1370    This is a first cut at reorganizing the CM libraries that make up the
1371    core of the compiler.  The idea is to separate out pieces that could
1372    be used independently by tools, e.g., the parser, the typechecker, etc.
1373    
1374    The current status is a step in this direction, but it is not quite
1375    satisfactory yet.  Expect more changes in the future.
1376    
1377    Here is the current (new) organization...
1378    
1379        What used to be $smlnj/viscomp/core.cm is now divided into
1380        six CM libraries:
1381    
1382             $smlnj/viscomp/basics.cm
1383                           /parser.cm
1384                           /elabdata.cm
1385                           /elaborate.cm
1386                           /execute.cm
1387                           /core.cm
1388    
1389        The CM files for these libraries live under src/system/smlnj/viscomp.
1390        All these libraries are proxy libraries that contain precisely
1391        one CM library component.  Here are the locations of the components
1392        (all within the src/compiler tree):
1393    
1394             Basics/basics.cm
1395             Parse/parser.cm
1396             ElabData/elabdata.cm
1397             Elaborator/elaborate.cm
1398             Execution/execute.cm
1399             core.cm
1400    
1401         [This organization is the same that has been used already
1402         for a while for the architecture-specific parts of the visible
1403         compiler and for the old version of core.cm.]
1404    
1405         As you will notice, many source files have been moved from their
1406         respective original locations to a new home in one of the above
1407         subtrees.
1408    
1409         The division of labor between the new libraries is the following:
1410    
1411             basics.cm:
1412                - Simple, basic definitions that pertain to many (or all) of
1413                  the other libraries.
1414             parser.cm:
1415                - The SML parser, producing output of type Ast.dec.
1416                - The type family for Ast is also defined and exported here.
1417             elabdata.cm:
1418                - The datatypes that describe input and output of the elaborator.
1419                  This includes types, absyn, and static environments.
1420             elaborator.cm:
1421                - The SML/NJ type checker and elaborator.
1422                  This maps an Ast.dec (with a given static environment) to
1423                  an Absyn.dec (with a new static environment).
1424                - This libraries implements certain modules that used to be
1425                  structures as functors (to remove dependencies on FLINT).
1426             execute.cm:
1427                - Everything having to do with executing binary code objects.
1428                - Dynamic environments.
1429             core.cm:
1430                - SML/NJ-specific instantiations of the elaborator and MLRISC.
1431                - Top-level modules.
1432                - FLINT (this should eventually become its own library)
1433    
1434    Notes:
1435    
1436    I am not 100% happy with the way I separated the elaborator (and its
1437    data structures) from FLINT.  Two instances of the same problem:
1438    
1439        1. Data structures contain certain fields that carry FLINT-specific
1440           information.  I hacked around this using exn and the property list
1441           module from smlnj-lib.  But the fact that there are middle-end
1442           specific fields around at all is a bit annoying.
1443    
1444        2. The elaborator calculates certain FLINT-related information.  I tried
1445           to make this as abstract as I could using functorization, but, again,
1446           the fact that the elaborator has to perform calculations on behalf
1447           of the middle-end at all is not nice.
1448    
1449        3. Having to used exn and property lists is unfortunate because it
1450           weakens type checking.  The other alternative (parameterizing
1451           nearly *everything*) is not appealing, though.
1452    
1453    I removed the "rebinding =" warning hack because due to the new organization
1454    it was awkward to maintain it.  As a result, the compiler now issues some of
1455    these warnings when compiling init.cmi during bootstrap compilation. On
1456    the plus side, you also get a warning when you do, for example:
1457       val op = = Int32.+
1458    which was not the case up to now.
1459    
1460    I placed "assign" and "deref" into the _Core structure so that the
1461    code that deals with the "lazy" keyword can find them there.  This
1462    removes the need for having access to the primitive environment
1463    during elaboration.
1464    
1465    ----------------------------------------------------------------------
1466    Name: Matthias Blume
1467    Date: 2001/08/13
1468    Tag: blume-20010813-closures
1469    Description:
1470    
1471    This fix was sent to us by Zhong Shao.  It is supposed to improve the
1472    performance of certain loops by avoiding needless closure allocation.
1473    
1474    ----------------------------------------------------------------------
1475    Name: Lal George
1476    Date: 2001/07/31 10:03:23 EDT 2001
1477    Tag: george-20010731-x86-fmalloc
1478    Description: Fixed bug in x86 calls
1479    
1480        There was a bug where call instructions would mysteriously
1481        vanish. The call instruction had to be one that returned
1482        a floating point value.
1483    
1484    ----------------------------------------------------------------------
1485    Name: Lal George
1486    Date: 2001/07/19 16:36:29 EDT 2001
1487    Tag: george-20010719-simple-cells
1488    Description:
1489    
1490    I have dramatically simplified the interface for CELLS in MLRISC.
1491    
1492    In summary, the cells interface is broken up into three parts:
1493    
1494      1. CellsBasis : CELLS_BASIS
1495    
1496            CellsBasis is a top level structure and common for all
1497            architectures.  it contains the definitions of basic datatypes
1498            and utility  functions over these types.
1499    
1500      2. functor Cells() : CELLS
1501    
1502            Cells generates an interface for CELLS that incorporates the
1503            specific resources on the target architecture, such as the
1504            presence of special register classes, their number and size,
1505            and various useful substructures.
1506    
1507      3. <ARCH>CELLS
1508    
1509            e.g. SparcCells: SPARCCELLS
1510    
1511            <ARCH>CELLS usually contains additional bindings for special
1512            registers  on the architecture, such as:
1513    
1514                    val r0 : cell           (* register zero *)
1515                    val y : cell            (* Y register *)
1516                    val psr : cell          (* processor status register *)
1517                    ...
1518    
1519            The structure returned by applying the Cells functor is opened
1520            in this interface.
1521    
1522    The main implication of all this is that the datatypes for cells is
1523    split between CellsBasis and CELLS -- a fairly simple change for user
1524    code.
1525    
1526    In the old scheme the CELLS interface had a definitional binding of
1527    the form:
1528    
1529            signature CELLS = sig
1530    
1531               structure CellsBasis = CellsBasis
1532    
1533               ...
1534    
1535            end
1536    
1537    With all the sharing constraints that goes on in MLRISC, this old
1538    design  quickly leads to errors such as:
1539    
1540            "structure definition spec inside of sharing ... "
1541    
1542    
1543    and appears to require an unacceptable amount of sharing and where
1544    constraint hackery.
1545    
1546    I think this error message (the interaction of definitional specs and
1547    sharing) requires more explanation on our web page.
1548    
1549    ----------------------------------------------------------------------
1550    Name: Matthias Blume
1551    Date: 2001/07/19 15:00:00 EDT
1552    Tag: blume-20010719-libreorg
1553    Description:
1554    
1555    This update puts together a fairly extensive but straightforward change
1556    to the way the libraries that implement the interactive system are
1557    organized:
1558    
1559       The biggest change is the elimination of structure Compiler.  As a
1560       replacement for this structure, there is now a CM library
1561       (known as $smlnj/compiler.cm or $smlnj/compiler/current.cm)
1562       that exports all the substructures of the original structure Compiler
1563       directly.  So instead of saying Compiler.Foo.bar one now simply
1564       says Foo.bar.  (The CM libraries actually export a collection of
1565       structures that is richer than the collection of substructures of
1566       structure Compiler.)
1567    
1568       To make the transition smooth, there is a separate library called
1569       $smlnj/compiler/compiler.cm which puts together and exports the
1570       original structure Compiler (or at least something very close to it).
1571    
1572       There are five members of the original structure Compiler
1573       that are not exported directly but which instead became members
1574       of a new structure Backend (described by signature BACKEND).  These are:
1575       structure Profile (: PROFILE), structure Compile (: COMPILE), structure
1576       Interact (: INTERACT), structure Machine (: MACHINE), and val
1577       architecture (: string).
1578    
1579       Structure Compiler.Version has become structure CompilerVersion.
1580    
1581       Cross-compilers for alpha32, hppa, ppc, sparc, and x86 are provided
1582       by $smlnj/compiler/<arch>.cm where <arch> is alpha32, hppa, ppc, sparc,
1583       or x86, respectively.
1584       Each of these exports the same frontend structures that
1585       $smlnj/compiler.cm exports.  But they do not have a structure Backend
1586       and instead export some structure <Arch>Backend where <Arch> is Alpha32,
1587       Hppa, PPC, Sparc, or X86, respectively.
1588    
1589       Library $smlnj/compiler/all.cm exports the union of the exports of
1590       $smlnj/compiler/<arch>.cm
1591    
1592       There are no structures <Arch>Compiler anymore, use
1593       $smlnj/compiler/<arch>.cm instead.
1594    
1595       Library host-compiler-0.cm is gone.  Instead, the internal library
1596       that instantiates CM is now called cm0.cm.  Selection of the host
1597       compiler (backend) is no longer done here but. (Responsibility for it
1598       now lies with $smlnj/compiler/current.cm.  This seems to be more
1599       logical.)
1600    
1601       Many individual files have been moved or renamed.  Some files have
1602       been split into multiple files, and some "dead" files have been deleted.
1603    
1604    Aside from these changes to library organization, there are also changes
1605    to the way the code itself is organized:
1606    
1607       Structure Binfile has been re-implemented in such a way that it no
1608       longer needs any knowledge of the compiler.  It exclusively deals
1609       with the details of binfile layout.  It no longer invokes the
1610       compiler (for the purpose of creating new prospective binfile
1611       content), and it no longer has any knowledge of how to interpret
1612       pickles.
1613    
1614       Structure Compile (: COMPILE) has been stripped down to the bare
1615       essentials of compilation.  It no longer deals with linking/execution.
1616       The interface has been cleaned up considerably.
1617    
1618       Utility routines for dealing with linking and execution have been
1619       moved into their own substructures.
1620    
1621       (The ultimate goal of these changes is to provide a light-weight
1622       binfile loader/linker (at least for, e.g., stable libraries) that
1623       does not require CM or the compiler to be present.)
1624    
1625    CM documentation has been updated to reflect the changes to library
1626    organization.
1627    
1628    ----------------------------------------------------------------------
1629    Name: Matthias Blume
1630    Date: 2001/07/10 17:30:00 EDT
1631    Tag: Release_110_34
1632    Description:
1633    
1634    Minor tweak to 110.34 (re-tagged):
1635    
1636      - README.html file added to CVS repository
1637      - runtime compiles properly under FreeBSD 3.X and 4.X
1638    
1639    ----------------------------------------------------------------------
1640    Name: Matthias Blume
1641    Date: 2001/07/10 17:30:00 EDT
1642    Tag: Release_110_34
1643    Description:
1644    
1645    New version number (110.34). New bootfiles.
1646    
1647    ----------------------------------------------------------------------
1648    Name: Matthias Blume
1649    Date: 2001/07/09 16:00:00 EDT
1650    Tag: blume-20010709-more-varargs
1651    Description:
1652    
1653    I changed the handling of varargs in ml-nlffigen again:
1654    The ellipsis ... will now simply be ignored (with an accompanying warning).
1655    
1656    The immediate effect is that you can actually call a varargs function
1657    from ML -- but you can't actually supply any arguments beyond the ones
1658    specified explicitly.  (For example, you can call printf with its format
1659    string, but you cannot pass additional arguments.)
1660    
1661    This behavior is only marginally more useful than the one before, but
1662    it has the advantage that a function or, more importantly, a function
1663    type never gets dropped on the floor, thus avoiding follow-up problems with
1664    other types that refer to the offending one.
1665    
1666    ----------------------------------------------------------------------
1667    Name: Matthias Blume
1668    Date: 2001/07/09 11:25:00 EDT
1669    Tag: blume-20010709-varargs
1670    Description:
1671    
1672    1. ckit-lib.cm now exports structure Error
1673    2. ml-nlffigen reports occurences of "..." (i.e., varargs function types)
1674       with a warning accompanied by a source location.  Moreover, it
1675       merely skips the offending function or type and proceeds with the
1676       rest of its work.u  As a result, one can safely feed C code containing
1677       "..." to ml-nlffigen.
1678    3. There are some internal improvements to CM, providing slightly
1679       more general string substitutions in the tools subsystem.
1680    
1681    ----------------------------------------------------------------------
1682    Name: Matthias Blume
1683    Date: 2001/06/27 15:10:00 EDT
1684    Tag: blume-20010627-concur
1685    Description:
1686    
1687    Fixed a small bug in CM's handling of parallel compilation.
1688    (You could observe the bug by Control-C-interrupting an ordinary
1689    CMB.make or CM.stabilize and then attaching some compile servers.
1690    The result was that all of a sudden the previously interrupted
1691    compilation would continue on its own.  This was because of
1692    an over-optimization: CM did not bother to clean out certain queues
1693    when no servers were attached "anyway", resulting in the contents
1694    of these queues to grab control when new servers did get attached.)
1695    
1696    There is also another minor update to the CM manual.
1697    
1698    ----------------------------------------------------------------------
1699    Name: Matthias Blume
1700    Date: 2001/06/26 16:15:00 EDT
1701    Tag: blume-20010626-cmdoc
1702    Description:
1703    
1704    Minor typo fixed in CM manual (syntax diagram for libraries).
1705    
1706    ----------------------------------------------------------------------
1707    Name: Matthias Blume
1708    Date: 2001/06/25 22:55:00 EDT
1709    Tag: blume-20010625-x86pc
1710    Description:
1711    
1712    Fixed a nasty bug in the X86 assembly code that caused signal
1713    handlers to fail (crash) randomly.
1714    
1715    ----------------------------------------------------------------------
1716    Name: Matthias Blume
1717    Date: 2001/06/25 12:05:00 EDT
1718    Tag: blume-20010625-nlffigen
1719    Description:
1720    
1721    This update fixes a number of minor bugs in ml-nlffigen as reported by
1722    Nick Carter <nbc@andrew.cmu.edu>.
1723    
1724      1. Silly but ok typedefs of the form "typedef void myvoid;" are now accepted.
1725      2. Default names for generated files are now derived from the name of
1726         the C file *without its directory*.  In particular, this causes generated
1727         files to be placed locally even if the C file is in some system directory.
1728      3. Default names for generated signatures and structures are also derived
1729         from the C file name without its directory.  This avoids silly things
1730         like "structure GL/GL".
1731         (Other silly names are still possible because ml-nlffigen does not do
1732          a thorough check of whether generated names are legal ML identifiers.
1733          When in doubt, use command line arguments to force particular names.)
1734    
1735    ----------------------------------------------------------------------
1736    Name: Matthias Blume
1737    Date: 2001/06/21 12:25:00 EDT
1738    Tag: blume-20010621-eXene
1739    Description:
1740    
1741    eXene now compiles and (sort of) works again.
1742    
1743    The library name (for version > 110.33) is $/eXene.cm.
1744    
1745    I also added an new example in src/eXene/examples/nbody.  See the
1746    README file there for details.
1747    
1748    ----------------------------------------------------------------------
1749    Name: Matthias Blume
1750    Date: 2001/06/20 16:40:00 EDT
1751    Tag: blume-20010620-cml
1752    Description:
1753    
1754    CML now compiles and works again.
1755    
1756    Libraries (for version > 110.33):
1757    
1758      $cml/cml.cm            Main CML library.
1759      $cml/basis.cm          CML's version of $/basis.cm.
1760      $cml/cml-internal.cm   Internal helper library.
1761      $cml/core-cml.cm       Internal helper library.
1762      $cml-lib/trace-cml.cm  Tracing facility.
1763      $cml-lib/smlnj-lib.cm  CML's version of $/smlnj-lib.cm
1764    
1765    The installer (config/install.sh) has been taught how to properly
1766    install this stuff.
1767    
1768    ----------------------------------------------------------------------
1769    Name: Matthias Blume
1770    Date: 2001/06/19 17:55:00 EDT
1771    Tag: blume-20010619-instantiate
1772    Description:
1773    
1774    This un-breaks the fix for bug 1432.
1775    (The bug was originally fixed in 110.9 but I broke it again some
1776    time after that.)
1777    
1778    ----------------------------------------------------------------------
1779    Name: Matthias Blume
1780    Date: 2001/06/19 17:25:00 EDT
1781    Tag: blume-20010619-signals
1782    Description:
1783    
1784    This should (hopefully) fix the long-standing signal handling bug.
1785    (The runtime system was constructing a continuation record with an
1786    incorrect descriptor which would cause the GC to drop data on the floor...)
1787    
1788    ----------------------------------------------------------------------
1789    Name: Matthias Blume
1790    Date: 2001/06/15 15:05:00 EDT
1791    Tag: blume-20010615-moresparc
1792    Description:
1793    
1794    Here is a short late-hour update related to Sparc c-calls:
1795    
1796     -- made handling of double-word arguments a bit smarter
1797    
1798     -- instruction selection phase tries to collapse certain clumsily
1799        constructed ML-Trees; typical example:
1800    
1801            ADD(ty,ADD(_,e,LI d1),LI d2)  ->  ADD(ty,e,LI(d1+d2))
1802    
1803        This currently has no further impact on SML/NJ since mlriscGen does
1804        not seem to generate such patterns in the first place, and c-calls
1805        (which did generate them in the beginning) has meanwhile been fixed
1806        so as to avoid them as well.
1807    
1808    ----------------------------------------------------------------------
1809    Name: Matthias Blume
1810    Date: 2001/06/15 15:05:00 EDT
1811    Tag: blume-20010615-sparc
1812    Description:
1813    
1814    The purpose of this update is to provide an implementation of NLFFI
1815    on Sparc machines.
1816    
1817    Here are the changes in detail:
1818    
1819       * src/MLRISC/sparc/c-calls/sparc-c-calls.sml is a new file containing
1820       the Sparc implementation of the c-calls API.
1821       * The Sparc backend of SML/NJ has been modified to uniformely use %fp
1822       for accessing the ML frame.  Thus, we have a real frame pointer and
1823       can freely modify %sp without need for an omit-frame-ptr phase.
1824       The vfp logic in src/compiler/CodeGen/* has been changed to accomodate
1825       this case.
1826       * ml-nlffigen has been taught to produce code for different architectures
1827       and calling conventions.
1828       * In a way similar to what was done in the x86 case, the Sparc
1829       backend uses its own specific extension to mltree.  (For example,
1830       it needs to be able to generate UNIMP instructions which are part
1831       of the calling convention.)
1832       * ml-nlffi-lib was reorganized to make it more modular (in particular,
1833       to make it easier to plug in new machine- and os-dependent parts).
1834    
1835    There are some other fairly unrelated bug fixes and cleanups as well:
1836    
1837       * I further hacked the .cm files for MLRISC tools (like MDLGen) so
1838       that they properly share their libraries with existing SML/NJ libraries.
1839       * I fixed a minor cosmetic bug in CM, supressing certain spurious
1840       follow-up error messages.
1841       * Updates to CM/CMB documentation.
1842    
1843    TODO items:
1844    
1845       * MLRISC should use a different register as its asmTemp on the Sparc.
1846         (The current %o2 is a really bad choice because it is part of the
1847         calling conventions, so things might interfere in unexpected ways.)
1848    
1849    ----------------------------------------------------------------------
1850    Name: Matthias Blume
1851    Date: 2001/06/07
1852    Tag: blume-20010607-calls
1853    Description:
1854    
1855    A number of internal changes related to C calls and calling conventions:
1856    
1857    1. ML-Tree CALL statements now carry a "pops" field.  It indicates the
1858       number of bytes popped implicitly (by the callee).  In most cases
1859       this field is 0 but on x86/win32 it is some non-zero value.  This
1860       is information provided for the benefit of the "omit-frameptr" pass.
1861    2. The CALL instruction on the x86 carries a similar "pops" field.
1862       The instruction selection phase copies its value from the ML-Tree
1863       CALL statement.
1864    3. On all other architectures, the instruction selection phase checks
1865       whether "pops=0" and complains if not.
1866    4. The c-calls implementation for x86 now accepts two calling conventions:
1867       "ccall" and "stdcall".  When "ccall" is selected, the caller cleans
1868       up after the call and pops is set to 0.  For "stdcall", the caller
1869       does nothing, leaving the cleanup to the callee; pops is set to
1870       the number of bytes that were pushed onto the stack.
1871    5. The cproto decoder (compiler/Semant/types/cproto.sml) now can
1872       distinguish between "ccall" and "stdcall".
1873    6. The UNIMP instruction has been added to the supported Sparc instruction
1874       set. (This is needed for implementing the official C calling convention
1875       on this architecture.)
1876    7. I fixed some of the .cm files under src/MLRISC/Tools to make them
1877       work with the latest CM.
1878    
1879    ----------------------------------------------------------------------
1880    Name: Matthias Blume
1881    Date: 2001/06/05 15:10:00 EDT
1882    Tag: blume-20010605-cm-index
1883    Description:
1884    
1885    0. The "lambdasplit" parameter for class "sml" in CM has been documented.
1886    
1887    1. CM can now generate "index files".  These are human-readable files
1888       that list on a per-.cm-file basis each toplevel symbol defined or
1889       imported.  The location of the index file for
1890       <p>/<d>.cm is <p>/CM/INDEX/<d>.cm.
1891       To enable index-file generation, set CM.Control.generate_index to true
1892       or export an environment-symbol: export CM_GENERATE_INDEX=true.
1893    
1894       The CM manual has been updated accordingly.
1895    
1896    2. I made some slight modifications to the c-calls API in MLRISC.
1897    
1898         a) There is now a callback to support saving/restoring of
1899            dedicated but caller-save registers around the actual call
1900            instruction.
1901         b) One can optionally specify a comment-annotation for the
1902            call instruction.
1903    
1904    3. SML/NJ (mlriscGen.sml) uses this new API for the rawccall primop.
1905       (For example, the comment annotation shows the C prototype of
1906        the function being called.)
1907    
1908    ----------------------------------------------------------------------
1909    Name: Matthias Blume
1910    Date: 2001/06/01 13:30:00 EDT
1911    Tag: blume-20010601-nlffi-cleanup
1912    Description:
1913    
1914    This is mostly a cleanup of MLFFI stuff:
1915    
1916       - some signature files have been put into a more exposed place
1917       - the ugly 'f type parameter is gone (simplifies types tremendously!)
1918       - ml-nlffigen changed accordingly
1919       - tutorial updated
1920    
1921    Other changes:
1922    
1923       - author's affiliation in CM manual(s) updated
1924       - some more recognized keywords added to Allen's sml.sty
1925    
1926    ----------------------------------------------------------------------
1927    Name: Matthias Blume
1928    Date: 2001/05/25 15:30:00 EDT
1929    Tag: blume-20010525-iptr
1930    Description:
1931    
1932      - put the official 110.33-README (as it appears on the ftp server) under
1933        CVS
1934      - fixed a small bug related to incomplete pointer types in
1935        ml-nlffigen
1936      - small cosmetic change to the ml-nlffi-lib's "arr" type constructor
1937        (it does not need the 'f type parameter)
1938    
1939    ----------------------------------------------------------------------
1940    Name: Matthias Blume
1941    Date: 2001/05/23 14:30:00 EDT
1942    Tag: Release_110_33
1943    Description:
1944    
1945    New version number (110.33).  New bootfiles.
1946    
1947    ----------------------------------------------------------------------
1948    Name: Matthias Blume
1949    Date: 2001/05/22 18:06:00 EDT
1950    Tag: blume-20010522-targets
1951    Description:
1952    
1953    Made install.sh use file config/targets.customized if it exists, falling
1954    back to config/targets if it doesn't.  This way one can have a customized
1955    version of the targets file without touching the "real thing", thus
1956    eliminating the constant fear of accidentally checking something bogus
1957    back into the CVS repository...  (File config/targets.customized must
1958    not be added to the repository!)
1959    
1960    ----------------------------------------------------------------------
1961    Name: Matthias Blume
1962    Date: 2001/05/22 16:30:00 EDT
1963    Tag: blume-20010522-minitut
1964    Description:
1965    
1966    1. Bug fix in ml-nlffigen; now (hopefully) correctly handling
1967       struct returns.
1968    2. Added src/ml-nlffi-lib/Doc/mini-tutorial.txt.  This is some very
1969       incomplete, preliminary documentation for NLFFI.
1970    
1971    ----------------------------------------------------------------------
1972    Name: Matthias Blume
1973    Date: 2001/05/14 11:30:00 EDT
1974    Tag: blume-20010514-script
1975    Description:
1976    
1977    Some bugs in install script fixed.
1978    
1979    In addition to that I also made a slight change to the NLFFI API:
1980    Functors generated by ml-nlffigen now take the dynamic library as a
1981    straight functor argument, not as a suspended one.  (The original
1982    functor code used to force the suspension right away anyway, so there
1983    was nothing gained by this complication of the interface.)
1984    
1985    ----------------------------------------------------------------------
1986    Name: Matthias Blume
1987    Date: 2001/05/11 14:35:00 EDT
1988    Tag: blume-20010511-ml-nlffi
1989    Description:
1990    
1991    I finally took the plunge and added my new FFI code to the main
1992    repository.  For x86-linux it is now ready for prime-time.
1993    
1994    There are two new subdirectories of "src":
1995    
1996      - ml-nlffi-lib:
1997           The utility library for programs using the FFI interface.
1998           Here is the implementation of $/c.cm and its associated low-level
1999           partners $/c-int.cm and $/memory.cm.
2000      - ml-nlffigen:
2001           A stand-alone program for generating ML glue code from C source
2002           code.
2003    
2004    Building ml-nlffigen requires $/ckit-lib.cm.
2005    
2006    The config/install.sh script has been updates to do the Right Thing
2007    (hopefully).
2008    
2009    Notice that the source tree for the C-Kit will not be put under "src"
2010    but directly under the installation root directory.  (This is the
2011    structure that currently exists on the CVS server when you check out
2012    module "sml".)  Fortunately, config/install.sh knows about this oddity.
2013    
2014    Bugs: No documentation yet.
2015    
2016    ----------------------------------------------------------------------
2017    Name: Matthias Blume
2018    Date: 2001/05/09 16:35:00 EDT
2019    Tag: blume-20010509-cpscontract
2020    Description:
2021    
2022    Fixed a bug in the accounting code in cpsopt/contract.sml.  (The
2023    wrapper/unwrapper elimination did not decrement usage counts and some
2024    dead variables got overlooked by the dead-up logic.)
2025    
2026    ----------------------------------------------------------------------
2027    Name: Lal George
2028    Date: 2001/05/08  17:26:09 EDT
2029    Tag: george-20010508-omit-frameptr
2030    Description:
2031    
2032    Changes to implement the omit-frame-pointer optimization to support
2033    raw C calls. For now, there is only support on the Intel x86, but
2034    other architectures will follow as more experience is gained with this.
2035    
2036    
2037    ----------------------------------------------------------------------
2038    Name: Matthias Blume
2039    Date: 2001/05/07 14:40:00 EDT
2040    Tag: blume-20010507-proxies
2041    Description:
2042    
2043    I made into "proxy libraries" all libraries that qualify for such a
2044    change.  (A qualifying library is a library that has another library or
2045    groups as its sole member and repeats that member's export list
2046    verbatim.  A proxy library avoids this repetition by omitting its export
2047    list, effectively inheriting the list that its (only) member exports.
2048    See the CM manual for more explanation.)
2049    The main effect is that explicit export lists for these libraries
2050    do not have to be kepts in sync, making maintenance a bit easier.
2051    
2052    I also added copyright notices to many .cm-files.
2053    
2054    Last but not least, I made a new set of bootfiles.
2055    
2056    ----------------------------------------------------------------------
2057    Name: Matthias Blume
2058    Date: 2001/05/04 17:00:00 EDT
2059    Tag: blume-20010504-cm-lsplit
2060    Description:
2061    
2062    0. John merged pending changes to $/smlnj-lib.cm
2063    
2064    1. Allen's previous change accidentally backed out of one of Lal's
2065       earlier changes.  I undid this mistake (re-introducing Lal's change).
2066    
2067    2. I used the new topOrder' function from graph-scc.sml (from $/smlnj-lib.cm)
2068       within the compiler where applicable.  There is some code simplification
2069       because of that.
2070    
2071    3. The "split" phase (in FLINT) is now part of the default list of phases.
2072       Compiler.Control.LambdaSplitting.* can be used to globally control the
2073       lambda-splitting (cross-module-inlining) engine.  In addition to that,
2074       it can now also be controlled on a per-source basis: CM has been taught
2075       a new tool parameter applicable to ML source files.
2076    
2077       - To turn lambda-splitting off completely:
2078            local open Compiler.Control.LambdaSplitting in
2079                val _ = set Off
2080            end
2081       - To make "no lambda-splitting" the global default (but allow per-source
2082         overriding); this is the initial setting:
2083            local open Compiler.Control.LambdaSplitting in
2084                val _ = set (Default NONE)
2085            end
2086       - To make "lambda-splitting with aggressiveness a" the global default
2087         (and allow per-source overriding):
2088            local open Compiler.Control.LambdaSplitting in
2089                val _ = set (Default (SOME a))
2090            end
2091    
2092       - To turn lambda-splitting off for a given ML souce file (say: a.sml)
2093         write (in the respective .cm-file):
2094            a.sml (lambdasplitting:off)
2095       - To turn lambda-splitting for a.sml on with minimal aggressiveness:
2096            a.sml (lambdasplitting:on)
2097       - To turn lambda-splitting for a.sml on with aggressiveness <a> (where
2098         <a> is a decimal non-negative integer):
2099            a.sml (lambdasplitting:<a>)
2100       - To turn lambda-splitting for a.sml on with maximal aggressiveness:
2101            a.sml (lambdasplitting:infinity)
2102       - To use the global default for a.sml:
2103            a.sml (lambdasplitting:default)
2104         or simply
2105            a.sml
2106    
2107    ----------------------------------------------------------------------
2108    Name: Allen Leung
2109    Date: 2001/05/04 01:57:00 EDT
2110    Tag: leunga-20010504-sync
2111    Description:
2112    
2113      MLRISC features.
2114    
2115      1. Fix to CMPXCHG instructions.
2116      2. Changed RA interface to allow annotations in callbacks.
2117      3. Added a new method to the stream interface to allow annotations updates.
2118    
2119    ----------------------------------------------------------------------
2120    Name: Matthias Blume
2121    Date: 2001/05/01 11:45:00 EDT
2122    Tag: blume-20010501-pcedittmp
2123    Description:
2124    
2125    Changed install.sh to use the current working directory instead of
2126    /usr/tmp for a temporary file (pcedittmp).  The previous choice
2127    of /usr/tmp caused trouble with MacOS X because of file premission
2128    problems.
2129    
2130    ----------------------------------------------------------------------
2131    Name: Matthias Blume
2132    Date: 2001/04/20 11:10:00 EDT
2133    Tag: blume-20010420-inMLflag
2134    Description:
2135    
2136     - added vp_limitPtrMask to vproc-state.h
2137       (for use by the raw-C-calls mechanism to implement proper interrupt
2138        handling)
2139     - made the ML compiler aware of various data-structure offsets so it
2140       can generate code for accessing the vp_inML flag and vp_limitPtrMask
2141     - tweaked mlriscGen.sml to have it emit interrupt-handling code for
2142       raw C-calls
2143    
2144    ----------------------------------------------------------------------
2145    Name: Lal George
2146    Date: 2001/04/20 09:15:28 EDT
2147    Tag: george-20010420-macosX
2148    Description:
2149    
2150     - Changes to port to Mac OS X; Darwin.
2151    
2152     - In the process I found that sqrt was broken on the PPC, because the
2153       fsqrt instruction is not implemented.
2154    
2155    ----------------------------------------------------------------------
2156    Name: Matthias Blume
2157    Date: 2001/04/18 12:45:00 EDT
2158    Tag: blume-20010418-ccalls
2159    Description:
2160    
2161     - fixed two off-by-4 errors in the x86-specific c-calls implementation
2162       (this bug prevented structure arguments containing pointers from being
2163        passed correctly)
2164     - changed the raw-C-call code in mlriscGen.sml in such a way that
2165       structure arguments are represented as a pointer to the beginning
2166       of the structure (instead of having a series of synthesized arguments,
2167       one for each structure member)
2168    
2169     - made makeml script's verbosity level configurable via environment
2170       variable (MAKEML_VERBOSITY)
2171    
2172     - eliminated placeholder implementations for f32l, w16s, i16s, and f32s
2173       in rawmem-x86.sml; we are now using the real thing
2174    
2175    ----------------------------------------------------------------------
2176    Name: Matthias Blume
2177    Date: 2001/03/22 16:25:00 EST
2178    Tag: blume-20010322-bootfiles
2179    Description:
2180    
2181    Created a new set of bootfiles (for your automatic installation convenience).
2182    
2183    ----------------------------------------------------------------------
2184    Name: Matthias Blume
2185    Date: 2001/03/22 15:10:00 EST
2186    Tag: blume-20010322-rawmem-parcm
2187    Description:
2188    
2189    1. All "raw memory access" primitives for the new FFI are implemented now
2190       (at least on the x86).
2191    2. Some further cleanup of CM's parallel make mechanism.
2192    
2193    ----------------------------------------------------------------------
2194    Name: Matthias Blume
2195    Date: 2001/03/19 17:53:00 EST
2196    Tag: blume-20010319-parallel
2197    Description:
2198    
2199    Parallel make (using compile servers) now works again.
2200    
2201    To this end, CM.stabilize and CMB.make have been modified to work in
2202    two passes when compile servers are attached:
2203       1. Compile everything, do not perform stabilization; this pass
2204          uses compile servers
2205       2. Stabilize everything; this pass does not use compile servers
2206    If there are no compile servers, the two passes are combined into one
2207    (as before).  Splitting the passes increases the inherent parallelism
2208    in the dependency graph because the entire graph including all
2209    libraries is available at the same time.  This, in turn, improves
2210    server utilization.  The downside is that the master process will
2211    have to do some extra work after compilation is done (because for
2212    technical reasons it must re-read all the binfiles during stabilization).
2213    
2214    ----------------------------------------------------------------------
2215    Name: Matthias Blume
2216    Date: 2001/03/16 12:22:00 EST
2217    Tag: blume-20010316-bootfiles
2218    Description:
2219    
2220    Created a new set of bootfiles (for your automatic installation convenience).
2221    
2222    ----------------------------------------------------------------------
2223    Name: Matthias Blume
2224    Date: 2001/03/16 11:00:00 EST
2225    Tag: blume-20010316-MLTREE-fixup
2226    Description:
2227    
2228    This is a minor fixup for an (untagged) earlier commit by Allen.
2229    (A file was missing).
2230    
2231    ----------------------------------------------------------------------
2232    Name: Allen Leung
2233    Date: Mon Mar  5 18:54:57 EST 2001
2234    Tag: leunga-20010305-cut-support
2235    
2236    1. New support for alternative control-flow in MLTREE.
2237       Currently we support
2238    
2239          FLOW_TO(CALL ...., [k1,...,kn])
2240    
2241       This is needed for 'cuts to' in C-- and try/handle-like constructs
2242       in Moby
2243    
2244       New assembler flag "asm-show-cutsto" to turn on control-flow debugging.
2245    
2246    2. Register Allocator
2247    
2248       Changes in interface [from Fermin, John]
2249    
2250    3. Alpha 8-bit SLL support [Fermin]
2251    
2252    4. All architectures
2253    
2254       A new module (ClusterExpandCopies) for expanding parallel copies.
2255    
2256    ----------------------------------------------------------------------
2257    Name: Allen Leung
2258    Date: 2001/02/27 23:07:00 EST
2259    Tag: leunga-20010227-minor-stuff
2260    
2261    1. Alpha bug fix for CMOVNE
2262    2. Handle mltree COND(..,FCMP ...,...)
2263    3. Bug fix in simplifier
2264    
2265    ----------------------------------------------------------------------
2266    Name: Matthias Blume
2267    Date: 2001/01/30 17:50:00 EST
2268    Tag: blume-20010130-sync
2269    Description:
2270    
2271    This is just a minor update to sync my devel branch with the main brach.
2272    The only visible change is the addition of some README files.
2273    
2274    ----------------------------------------------------------------------
2275    Name: Matthias Blume
2276    Date: 2001/01/12 23:30:00 JST
2277    Tag: blume-20010112-bootfiles
2278    Description:
2279    
2280    Made a new set of bootfiles that goes with the current state of the
2281    repository.
2282    
2283    ----------------------------------------------------------------------
2284    Name: Matthias Blume
2285    Date: 2001/01/12 21:20:00 JST
2286    Tag: blume-20010112-sync
2287    Description:
2288    
2289    I am just flushing out some minor changes that had accumulated in
2290    my private branch in order to sync with the main tree.  (This is
2291    mainly because I had CVS trouble when trying to merge _into_ my
2292    private branch.)
2293    
2294    Most people should be completely unaffected by this.
2295    
2296    ----------------------------------------------------------------------
2297    Name: Allen Leung
2298    Date: Thu Jan 11 21:03:00 EST 2001
2299    Tag: leunga-20010111-labexp=mltree
2300    Description:
2301    
2302    1.  Removed the type LabelExp and replace it by MLTree.
2303    2.  Rewritten mltree-simplify with the pattern matcher tool.
2304    3.  There were some bugs in alpha code generator which would break
2305        64-bit code generation.
2306    4.  Redo the tools to generate code with the
2307    5.  The CM files in MLRISC (and in src/system/smlnj/MLRISC)
2308        are now generated by perl scripts.
2309    
2310    ----------------------------------------------------------------------
2311    Name: Matthias Blume
2312    Date: 2001/01/10 21:55:00 JST
2313    Tag: blume-20010110-rcc
2314    Description:
2315    
2316    The RCC stuff now seems to work (but only on the x86).
2317    This required hacking of the c-calls interface (and -implementation) in
2318    MLRISC.
2319    
2320    Normal compiler users should be unaffected.
2321    
2322    ----------------------------------------------------------------------
2323    Name: Matthias Blume
2324    Date: 2001/01/09 01:20:00 JST
2325    Tag: blume-20010109-rcc
2326    Description:
2327    
2328    This is a fairly big patch, flushing out a large number of pending
2329    changes that I made to my development copy over the last couple of days.
2330    
2331    Of practical relevance at this moment is a workaround for a pickling
2332    bug that Allen ran into the other day.  The cause of the bug itself is
2333    still unknown and it might be hard to fix it properly, but the
2334    workaround has some merits of its own (namely somewhat reducing pickling
2335    overhead for certain libraries).  Therefore, I think this solution should
2336    be satisfactory at this time.
2337    
2338    The rest of the changes (i.e., the vast majority) has to do with my
2339    ongoing efforts of providing direct support for C function calls from
2340    ML.  At the moment there is a new primop "RAW_CCALL", typing magic
2341    in types/cproto.sml (invoked from FLINT/trans/translate.sml), a new
2342    case in the FLINT CPS datatype (RCC), changes to cps/convert.sml to
2343    translate uses of RAW_CCALL into RCC, and changes to mlriscGen.sml to
2344    handle RCC.
2345    
2346    The last part (the changes to mlriscGen.sml) are still known to be
2347    wrong on the x86 and not implemented on all other architectures.  But
2348    the infrastructure is in place. I had to change a few functor
2349    signatures in the backend to be able to route the CCalls interface
2350    from MLRISC there, and I had to specialize the mltree type (on the
2351    x86) to include the necessary extensions. (The extensions themselves
2352    were already there and redy to go in MLRISC/x86).
2353    
2354    Everything should be very happy as soon as someone helps me with
2355    mlriscGen.sml...
2356    
2357    In any case, nothing of this should matter to anyone as long as the
2358    new primop is not being used (which is going to be the case unless you
2359    find it where I hid it :). The rest of the compiler is completely
2360    unaffected.
2361    
2362    ----------------------------------------------------------------------
2363    Name: Matthias Blume
2364    Date: 2001/01/05 00:30:00 JST
2365    Tag: blume-20010105-primops
2366    Description:
2367    
2368    Added some experimental support for work that I am doing right now.
2369    These changes mostly concern added primops, but there is also a new
2370    experimental C library in the runtime system (but currently not enabled
2371    anywhere except on Linux/X86).
2372    
2373    In the course of adding primops (and playing with them), I discovered that
2374    Zhong's INL_PRIM hack (no type info for certain primops) was, in fact, badly
2375    broken.  (Zhong was very right he labeled this stuff as "major gross hack".)
2376    To recover, I made type information in INL_PRIM mandatory and changed
2377    prim.sml as well as built-in.sml accordingly.  The InLine structure now
2378    has complete, correct type information (i.e., no bottom types).
2379    
2380    Since all these changes mean that we need new binfiles, I also bumped the
2381    version number to 110.32.1.
2382    
2383    ----------------------------------------------------------------------
2384    Name: Matthias Blume
2385    Date: 2000/12/30 22:10:00 JST
2386    Tag: blume-20001230-various
2387    Description:
2388    
2389    Added proxy libraries for MLRISC and let MLRISC libraries refer
2390    to each other using path anchors.  (See CM manual for explanation.)
2391    
2392    Updated CM documentation.
2393    
2394    Fixed some bugs in CM.
2395    
2396    Implemented "proxy" libraries (= syntactic sugar for CM).
2397    
2398    Added "-quiet" option to makeml and changed runtime system accordingly.
2399    
2400    Added cleanup handler for exportML to reset timers and compiler stats.
2401    
2402    ----------------------------------------------------------------------
2403    Name: Lal George
2404    Date: 2000/12/22 22:22:58 EST 2000
2405    Tag: Release_110_32
2406    Description:
2407    
2408            Infinite precision used throughout MLRISC.
2409            see MLRISC/mltree/machine-int.sig
2410    
2411    ----------------------------------------------------------------------
2412    Name: Matthias Blume
2413    Date: 2000/12/22 23:16:00 JST
2414    Tag: blume-20001222-warn
2415    Description:
2416    
2417    Corrected wording and formatting of some CM warning message which I
2418    broke in my previous patch.
2419    
2420    ----------------------------------------------------------------------
2421    Name: Matthias Blume
2422    Date: 2000/12/22 21:20:00 JST
2423    Tag: blume-20001222-anchorenv
2424    Description:
2425    
2426    Fixed CM's handling of anchor environments in connection with CMB.make.
2427    
2428    ----------------------------------------------------------------------
2429    Name: Matthias Blume
2430    Date: 2000/12/22 13:15:00 JST
2431    Tag: blume-20001222-cleanup
2432    Description:
2433    
2434    Removed src/cm/ffi which does not (and did not) belong here.
2435    
2436    ----------------------------------------------------------------------
2437    Name: Matthias Blume
2438    Date: 2000/12/21 23:55:00 JST
2439    Tag: blume-20001221-exn
2440    Description:
2441    
2442    Probably most important: CM no longer silently swallows all exceptions
2443    in the compiler.
2444    Plus: some other minor CM changes.  For example, CM now reports some
2445    sizes for generated binfiles (code, data, envpickle, lambdapickle).
2446    
2447    ----------------------------------------------------------------------
2448    Name: Matthias Blume
2449    Date: 2000/12/15 00:01:05 JST
2450    Tag: blume-20001215-dirtool
2451    Description:
2452    
2453    - "dir" tool added.
2454    - improvements and cleanup to Tools structure
2455    - documentation updates
2456    
2457    ----------------------------------------------------------------------
2458    Name: Allen Leung
2459    Date: Thu Dec 14 03:45:24 EST 2000
2460    Description:
2461    Tag:  leunga-20001214-int-inf
2462    Description:
2463    
2464       In IntInf, added these standard functions, which are missing from our
2465    implementation:
2466    
2467        andb : int * int -> int
2468        xorb : int * int -> int
2469        orb  : int * int -> int
2470        notb : int -> int
2471         <<   : int * word -> int
2472        ~>>  : int * word -> int
2473    
2474       Not tested, I hope they are correct.
2475    
2476    ----------------------------------------------------------------------
2477    Name: Allen Leung
2478    Date: Fri Dec  8 19:23:26 EST 2000
2479    Description:
2480    Tag:  leunga-20001208-nowhere
2481    Description:
2482    
2483      Slight improvements to the 'nowhere' tool to handle OR-patterns,
2484    to generate better error messages etc.  Plus a brief manual.
2485    
2486    ----------------------------------------------------------------------
2487    Name: Lal George
2488    Date: 2000/12/08 09:54:02 EST 2000
2489    Tag: Release_110_31
2490    Description:
2491    
2492    - Version 110.31
2493    ----------------------------------------------------------------------
2494    Name: Allen Leung
2495    Date: Thu Dec  7 22:01:04 EST 2000
2496    Tag:  leunga-20001207-cell-monster-hack
2497    Description:
2498    
2499    Major MLRISC internal changes.  Affect all clients.
2500    Summary:
2501    
2502    1.  Type CELLS.cell = int is now replaced by a datatype.
2503        As a result, the old regmap is now gone.  Almost all interfaces
2504        in MLRISC change as a consequence.
2505    
2506    2.  A new brand version of machine description tool (v3.0) that generates
2507        modules expecting the new interface.  The old version is removed.
2508    
2509    3.  The RA interface has been further abstracted into two new functors.
2510        RISC_RA and X86RA.  These functors have much simpler interfaces.
2511        [See also directory MLRISC/demo.]
2512    
2513    4.  Some other new source->source code generation tools are available:
2514    
2515        a. MLRISC/Tools/RewriteGen -- generate rewriters from rules.
2516        b. MLRISC/Tools/WhereGen -- expands conditional pattern matching rules.
2517           I use this tool to generate the peephole optimizers---with the new
2518           cell type changes, peephole rules are becoming difficult to write
2519           without conditional pattern matching.
2520    
2521    5.  More Intmap -> IntHashTable change.  Previous changes by Matthias didn't
2522        cover the entire MLRISC source tree so many things broke.
2523    
2524    6.  CM files have been moved to the subdirectory MLRISC/cm.
2525        They are moved because there are a lot of them and they clutter up the
2526        root dir.
2527    
2528    7.  More detailed documentation to come...
2529    
2530        NOTE: To rebuild from 110.30 (ftp distribution), you'll have to do
2531        a makeml -rebuild first.  This is because of other other
2532        changes that Matthias has made (see below).
2533    
2534    
2535    ----------------------------------------------------------------------
2536    Name: Matthias Blume
2537    Date: 2000/11/30 23:12:00 JST
2538    Tag: blume-20001130-filereorg
2539    Description:
2540    
2541    Some manual updates and some file reorganizations in CM.
2542    
2543    ----------------------------------------------------------------------
2544    Name: Matthias Blume
2545    Date: 2000/11/24 17:45:00 JST
2546    Tag: blume-20001124-link
2547    Description:
2548    
2549    Drastically improved link traversal code for the case that the dynamic
2550    value was already loaded at bootstrap time.  As a result, CM and CMB
2551    now both load blazingly fast -- even on a very slow machine.  Also,
2552    memory consumption has been further reduced by this.
2553    
2554    Warning: The format of the PIDMAP file has changed.  THerefore, to
2555    bootstrap you have to do this:
2556    
2557    1. Run CMB.make
2558    2. Make a symbolic link for the boot directory:
2559         ln -s sml.boot.ARCH-OS xxx
2560    3. "Rebuild" the boot directory:
2561         ./makeml -boot xxx -rebuild sml ; rm xxx
2562    4. Boot normally:
2563          ./makeml
2564    
2565    ----------------------------------------------------------------------
2566    Name: Matthias Blume
2567    Date: 2000/11/21 21:20:00 JST
2568    Tag: blume-20001121-tools
2569    Description:
2570    
2571    Continued hacking on autoloading problem -- with success this time.
2572    Also changed tool-plugin mechanism.  See new CM manual.
2573    
2574    ----------------------------------------------------------------------
2575    Name: Matthias Blume
2576    Date: 2000/11/19 14:30:00 JST
2577    Tag:  blume-20001119-autoload
2578    Description:
2579    
2580    Some hacking to make autoloading faster.  Success for CMB, no success
2581    so far for CM.  There is a reduced structure CM' that autoloads faster.
2582    (This is a temporary, non-documented hack to be eliminated again when
2583    the general problem is solved.)
2584    
2585    ----------------------------------------------------------------------
2586    Name: Matthias Blume
2587    Date: 2000/11/17 14:10:00 JST
2588    Tag: blume-20001117-pickle-lib
2589    Description:
2590    
2591    1. Eliminated comp-lib.cm
2592    2. Made pickle-lib.cm
2593    3. Eliminated all uses of intset.sml (from comp-lib.cm)
2594    4. Replaced all uses of intmap.{sig,sml} (from comp-lib.cm) with
2595       equivalent constructs from smlnj-lib.cm (INtHashTable).
2596    5. Point 4. also goes for those uses of intmap.* in MLRISC.
2597       Duplicated intmap modules thrown out.
2598    6. Hunted down all duplicated SCC code and replaced it with
2599       equivalent stuff (GraphSCCFn from smlnj-lib.cm).
2600    7. Rewrote Feedback module.
2601    8. Moved sortedlist.sml into viscomp-lib.cm.  Eventually it
2602       should be thrown out and equivalent modules from smlnj-lib.cm
2603       should be used (IntRedBlackSet, IntListSet, ...).
2604    
2605    Confirmed that compiler compiles to fixpoint.
2606    
2607    ----------------------------------------------------------------------
2608    Name: Allen Leung
2609    Date: 2000/11/10 18:00:00
2610    Tag: leunga-20001110-new-x86-fp
2611    
2612    A new x86 floating point code generator has been added.
2613    By default this is turned off.  To turn this on, do:
2614    
2615        CM.autoload "$smlnj/compiler.cm";
2616        Compiler.Control.MLRISC.getFlag "x86-fast-fp" := true;
2617    
2618    Changes:
2619    
2620    1.  Changed FTAN to FPTAN so that the assembly output is correct.
2621    2.  Changed the extension callback for FTANGENT to generate:
2622    
2623              fptan
2624              fstp  %st(0)
2625        instead of
2626              fptan
2627              fstpl ftempmem
2628    
2629    3.  Numerous assembly fixes for x86.
2630    
2631    5.  Cleaned up the machine code output module x86/x86MC.sml and added
2632        support for a whole bunch of instructions and addressing modes:
2633    
2634          fadd/fsub/fsubr/fmul/fdiv/fdivr  %st, %st(n)
2635          faddp/fsubp/fsubrp/fmulp/fdivp/fdivrp  %st, %st(n)
2636          fadd/fsub/fsubr/fmul/fdiv/fdivr  %st(n), %st
2637          fiadd/fisub/fisubr/fimul/fidiv/fidivr mem
2638          fxch %st(n)
2639          fld %st(n)
2640          fst %st(n)
2641          fst mem
2642          fstp %st(n)
2643          fucom %st(n)
2644          fucomp %st(n)
2645    
2646        All these are now generated when the fast fp mode is turned on.
2647    
2648    6.  Removed the dedicated registers %st(0), ..., %st(7) from X86CpsRegs
2649    
2650    ----------------------------------------------------------------------
2651    Name: Matthias Blume
2652    Date: 2000/11/09 11:20:00 JST
2653    Tag: blume-20001109-scc
2654    Description:
2655    
2656    Eliminated some code duplication:
2657    
2658    1. Added "where" clause to GraphSCCFn in SML/NJ Library.
2659       (Otherwise the functor is useless.)
2660    2. Used GraphSCCFn where SCCUtilFun was used previously.
2661    3. Got rid of SCCUtilFun (in comp-lib.cm).
2662    
2663    ----------------------------------------------------------------------
2664    Name: Lal George
2665    Date: 2000/11/06 09:02:21 EST 2000
2666    Tag: Release_110_30
2667    Description:
2668    
2669    - Version 110.30
2670    ----------------------------------------------------------------------
2671    Name: Matthias Blume
2672    Date: 2000/11/04 14:45:00
2673    Tag: blume-20001104-mlbuild
2674    Description:
2675    
2676    - Made ml-build faster on startup.
2677    - Documentation fixes.
2678    
2679    ----------------------------------------------------------------------
2680    Name: Matthias Blume
2681    Date: 2000/11/02 17:00:00 JST
2682    Tag: blume-20001102-condcomp
2683    Description:
2684    
2685    - Small tweaks to pickler -- new BOOTFILES!
2686    - Version bumped to 110.29.2.
2687    - Added conditional compilation facility to init.cmi (see comment there).
2688    ----------------------------------------------------------------------
2689    Name: Allen Leung
2690    Date: 2000/10/23 19:31:00
2691    Tag: leunga-20001023-demo-ra
2692    
2693    1. Minor RA changes that improves spilling on x86 (affects Moby and C-- only)
2694    2. Test programs for the graph library updated
2695    3. Some new MLRISC demo programs added
2696    
2697    ----------------------------------------------------------------------
2698    Name: Matthias Blume
2699    Date: 2000/08/31 22:15:00 JST
2700    Tag: blume-20001017-errmsg
2701    Description:
2702    
2703    More error message grief: Where there used to be no messages, there
2704    now were some that had bogus error regions.  Fixed.
2705    
2706    ----------------------------------------------------------------------
2707    Name: Matthias Blume
2708    Date: 2000/08/31 17:30:00 JST
2709    Tag: blume-20001017-v110p29p1
2710    Description:
2711    
2712    I made a version 110.29.1 with new bootfiles.
2713    
2714    Changes:  Modified pickler/unpickler for faster and leaner unpickling.
2715              CM documentation changes and a small bugfix in CM's error reporting.
2716    
2717    ----------------------------------------------------------------------
2718    Name: Lal George
2719    Date: 2000/09/27 14:42:35 EDT
2720    Tag: george-20000927-nodestatus
2721    Description:
2722    
2723    Changed the type of the nodestatus, so that:
2724    
2725            SPILLED(~1)             is now SPILLED
2726            SPILLED(m) where m>=0   is now MEMREG(m)
2727            SPILLED(s) where s<~1   is now SPILL_LOC(~s)
2728    
2729    ----------------------------------------------------------------------
2730    Name: Matthias Blume
2731    Date: 2000/09/07 14:45:00 JST
2732    Tag: blume-20000907-cmerrmsg
2733    Description:
2734    
2735    Small tweak to CM to avoid getting ML syntax error messages twice.
2736    
2737    ----------------------------------------------------------------------
2738    Name: Matthias Blume
2739    Date: 2000/08/31 18:00:00 JST
2740    Tag: blume-20000831-cvsbootfiles
2741    Description:
2742    
2743    New URL for boot files (because the 110.29 files on the BL server do
2744    now work correctly with my updated install scripts for yacc and lex).
2745    
2746    ----------------------------------------------------------------------
2747    Name: Matthias Blume
2748    Date: 2000/08/08 12:33:00 JST
2749    Tag: blume-20000808-manual
2750    Description:
2751    
2752    Tiny update to CM manual.
2753    
2754    ----------------------------------------------------------------------
2755    Name: Allen Leung
2756    Date: 2000/08/7 19:31:00
2757    Tag: leunga-20000807-a-whole-bunch-of-stuff
2758    
2759      Moby, C--, SSA, x86, machine descriptions etc.  Should only affect C--
2760    and Mobdy.
2761    
2762    1.  x86
2763    
2764       a.  Fixes to peephole module by John and Dan.
2765       b.  Assembly fix to SETcc by Allen.
2766       c.  Fix to c-call by John.
2767       d.  Fix to spilling by John.  (This one deals with the missing FSTPT case)
2768       e.  Instruction selection optimization to SETcc as suggested by John.
2769    
2770           For example,
2771    
2772            MV(32, x, COND(32, CMP(32, LT, a, b), LI 1, LI 0))
2773    
2774           should generate:
2775    
2776            MOVL a, x
2777            SUBL b, x
2778            SHRL 31, x
2779    
2780    2.  IR stuff
2781    
2782         A bunch of new DJ-graph related algorithms added.  These
2783         speed up SSA construction.
2784    
2785    3.  SSA + Scheduling
2786    
2787         Added code for SSA and scheduling to the repository
2788    
2789    ----------------------------------------------------------------------
2790    Name: Lal George
2791    Date: 2000/07/27 11:53:14 EDT
2792    
2793    Tag: lal-20000727-linux-ppc
2794    Description:
2795    
2796     Made changes to support Linux PPC.
2797     p.s. I have confirmation that the 110.29 boot files work fine.
2798    
2799    ----------------------------------------------------------------------
2800    Name: Matthias Blume
2801    Date: 2000/07/27 17:40:00 JST
2802    Tag: blume-20000727-scripts
2803    Description:
2804    
2805    !!!! WARNING !!!!
2806    You must recompile the runtime system!
2807    !!!! WARNING !!!!
2808    
2809    This is basically another round of script-enhancements:
2810    
2811    1. sml, ml-build, and ml-makedepend accept options -D and -U to define
2812       and undefine CM preprocessor symbols.
2813    
2814    2. ml-build avoids generating a new heap image if it finds that the
2815       existing one is still ok.  (The condition is that no ML file had to
2816       be recompiled and all ML files are found to be older that the heap
2817       file.)
2818    
2819       To make this work smoothly, I also hacked the runtime system as
2820       well as SMLofNJ.SysInfo to get access to the heap image suffix
2821       (.sparc-solaris, ...) that is currently being used.
2822    
2823       Moreover, the signature of CM.mk_standalone has changed.  See the
2824       CM manual.
2825    
2826    3. ml-makedepend accepts additional options -n, -a, and -o.  (See the
2827       CM manual for details.)
2828    
2829    4. More CM manual updates:
2830        - all of the above has been documented.
2831        - there is now a section describing the (CM-related) command line
2832          arguments that are accepted by the "sml" command
2833    
2834    ----------------------------------------------------------------------
2835    Name: Matthias Blume
2836    Date: 2000/07/25 16:20:00 JST
2837    Tag: blume-20000725-makedepend
2838    Description:
2839    
2840    Added a script called ml-makedepend.  This can be used in makefiles
2841    for Unix' make in a way very similar to the "makedepend" command for
2842    C.
2843    
2844    The script internally uses function CM.sources.
2845    
2846    Synopsis:
2847    
2848        ml-makedepend [-f makefile] cmfile targetname
2849    
2850    The default for the makefile is "makefile" (or "Makefile" should
2851    "makefile" not exist).
2852    
2853    ml-makedepend adds a cmfile/targetname-specific section to this
2854    makefile (after removing the previous version of this section).  The
2855    section contains a single dependency specification with targetname on
2856    the LHS (targetname is an arbitrary name), and a list of files derived
2857    from the cmfile on the RHS.  Some of the files on the RHS are
2858    ARCH/OPSYS-specific.  Therefore, ml-makedepend inserts references to
2859    "make" variables $(ARCH) and $(OPSYS) in place of the corresponding
2860    path names.  The makefile writer is responsible for making sure that
2861    these variables have correct at the time "make" is invoked.
2862    
2863    ----------------------------------------------------------------------
2864    Name: Matthias Blume
2865    Date: 2000/07/22 23:30:00 JST
2866    Tag: blume-20000722-urlupdate
2867    Description:
2868    
2869    Changed BOOT and config/srcarchiveurl to point to BL server:
2870    
2871        ftp://ftp.research.bell-labs.com/dist/smlnj/working/110.29/
2872    
2873    ----------------------------------------------------------------------
2874    Name: Matthias Blume
2875    Date: 2000/07/18 18:00:00 JST
2876    Tag: blume-20000718-Version_110_29
2877    Description:
2878    
2879    1. Updated src/compiler/TopLevel/main/version.sml to version 110.29
2880    
2881    2. Updated config/version to 110.29
2882    
2883    3. Updated config/srcarchiveurl
2884    
2885    3. New boot files!
2886       ftp://ftp.cs.princeton.edu/pub/people/blume/sml/110.29-autofetch
2887    
2888    ----------------------------------------------------------------------
2889    Name: Matthias Blume
2890    Date: 2000/07/11 13:58:00 JST
2891    Tag: blume-20000711-doctypo
2892    Description:
2893    
2894    Fixed a few typos in CM manual.
2895    
2896    ----------------------------------------------------------------------
2897    Name: Allen Leung
2898    Date: 2000/06/15 00:38:00
2899    Tag: leunga-20000704-sparc-x86
2900    
2901    1. x86 peephole improvement sp += k; sp -= k => nop  [from John]
2902    2. fix to x86 RET bug [found by Dan Grossman]
2903    3. sparc assembly bug fix for ticc instructions [found by Fermin]
2904    
2905       Affects c-- and moby only
2906    
2907    ----------------------------------------------------------------------
2908    Name: Matthias Blume
2909    Date: 2000/07/04 15:26:00
2910    Tag: blume-20000704-trigger
2911    Description:
2912    
2913    1. Improvements to CM manual.
2914    2. SMLofNJ.Internals.BTrace.trigger reinstated as an alternative way
2915       of getting a back-trace.  The function, when called, raises an
2916       internal exception which explicitly carries the full back-trace history,
2917       so it is unaffected by any intervening handle-raise pairs ("trivial"
2918       or not).  The interactive loop will print that history once it arrives
2919       at top level.
2920       Short of having all exceptions implicitly carry the full history, the
2921       recommended way of using this facility is:
2922         - compile your program with instrumentation "on"
2923         - run it, when it raises an exception, look at the history
2924         - if the history is "cut off" because of some handler, go and modify
2925           your program so that it explicitly calls BTrace.trigger
2926         - recompile (still instrumented), and rerun; look at the full history
2927    
2928    ----------------------------------------------------------------------
2929    Name: Matthias Blume
2930    Date: 2000/07/03 15:36:00 JST
2931    Tag: blume-20000702-manual
2932    Description:
2933    
2934    Small corrections and updates to CM manual.
2935    
2936    ----------------------------------------------------------------------
2937    Name: Matthias Blume
2938    Date: 2000/06/29 16:04:00 JST
2939    Tag: blume-20000629-yacctool
2940    Description:
2941    
2942    Changes:
2943    
2944    1. Class "mlyacc" now takes separate arguments to pass options to
2945       generated .sml- and .sig-files independently.
2946    2. Corresponding CM manual updates.
2947    3. BTrace module now also reports call sites.  (However, for loop clusters
2948       it only shows from where the cluster was entered.)  There are associated
2949       modifications to core.sml, internals.{sig,sml}, btrace.sml, and btimp.sml.
2950    
2951    ----------------------------------------------------------------------
2952    Name: Matthias Blume
2953    Date: 2000/06/27 16:51:00 JST
2954    Tag: blume-20000627-noweb
2955    Description:
2956    
2957    Changes:
2958    
2959     1. Implemented "subdir" and "witness" options for noweb tool.
2960        This caused some slight internal changes in CM's tool implementation.
2961     2. Fixed bug in "tool plugin" mechanism.  This is essentially cleaning
2962        some remaining issues from earlier path anchor changes.
2963     3. Updated CM manual accordingly.
2964    
2965     4. Changed implementation of back-tracing so that I now consider it
2966        ready for prime-time.
2967    
2968        In particular, you don't have to explicitly trigger the back-trace
2969        anymore.  Instead, if you are running BTrace-instrumented code and
2970        there is an uncaught exception (regardless of whether or not it was
2971        raised in instrumented code), the top-level evalloop will print
2972        the back-trace.
2973    
2974        Features:
2975    
2976          - Instrumented and uninstrumented code work together seemlessly.
2977            (Of course, uninstrumented code is never mentioned in actual
2978             back-traces.)
2979    
2980          - Asymptotic time- and space-complexity of instrumented code is
2981            equal to that of uninstrumented code.  (This means that
2982            tail-recursion is preserved by the instrumentation phase.)
2983    
2984          - Modules whose code has been instrumented in different sessions
2985            work together without problem.
2986    
2987          - There is no penalty whatsoever on uninstrumented code.
2988    
2989          - There is no penalty on "raise" expressions, even in
2990            instrumented code.
2991    
2992        A potential bug (or perhaps it is a feature, too):
2993    
2994          A back-trace reaches no further than the outermost instrumented
2995          non-trivial "raise".  Here, a "trivial" raise is one that is the
2996          sole RHS of a "handle" rule.  Thus, back-traces reach trough
2997    
2998               <exp> handle e => raise e
2999    
3000          and even
3001    
3002               <exp> handle Foo => raise Bar
3003    
3004          and, of course, through
3005    
3006               <exp> handle Foo => ...
3007    
3008         if the exception was not Foo.
3009    
3010         Back-traces always reach right through any un-instrumented code
3011         including any of its "handle" expressions, trivial or not.
3012    
3013       To try this out, do the following:
3014    
3015         - Erase all existing binfiles for your program.
3016           (You may keep binfiles for those modules where you think you
3017            definitely don't need back-tracing.)
3018         - Turn on back-trace instrumentation:
3019              SMLofNJ.Internals.BTrace.mode (SOME true);
3020         - Recompile your program.  (I.e., run "CM.make" or "use".)
3021         - You may now turn instrumentation off again (if you want):
3022              SMLofNJ.Internals.BTrace.mode (SOME false);
3023         - Run your program as usual.  If it raises an exception that
3024           reaches the interactive toplevel, then a back-trace will
3025           automatically be printed.  After that, the toplevel loop
3026           will print the exception history as usual.
3027    
3028    ----------------------------------------------------------------------
3029    Name: Matthias Blume
3030    Date: 2000/06/26 09:56:46 JST
3031    Tag: blume-20000626-setup
3032    Description:
3033    
3034    CM: - setup-parameter to "sml" added; this can be used to run arbitrary
3035          ML code before and after compiling a file (e.g., to set compiler
3036          flags)
3037    
3038    Compiler: - improved btrace API (in core.sml, internals.{sig,sml})
3039              - associated changes to btrace.sml (BTrace instrumentation pass)
3040              - cleaner implementation of btimp.sml (BTrace tracing and report
3041                module)
3042    
3043    CM manual: * new path encoding documented
3044               * description of setup-parameter to "sml" added
3045    
3046    The biggest user-visible change to back-tracing is that it is no
3047    longer necessary to compile all traced modules within the same
3048    session.  (This was a real limitation.)
3049    
3050    ----------------------------------------------------------------------
3051    Name: Matthias Blume
3052    Date: 2000/06/24 12:40:00 JST
3053    Tag: blume-20000624-startup
3054    Description:
3055    
3056    Fixes startup slowdown problem.  (I was calling SrcPath.sync a _tad_
3057    bit too often -- to put it mildly. :)
3058    
3059    ----------------------------------------------------------------------
3060    Name: Matthias Blume
3061    Date: 2000/06/23 18:20:00 JST
3062    Tag: blume-20000623-btrace
3063    Description:
3064    
3065    This updates adds a backtrace facility to aid programmers in debugging
3066    their programs.  This involves the following changes:
3067    
3068    1. Module system/smlnj/init/core.sml (structure _Core) now has hooks for
3069       keeping track of the current call stack.  When programs are compiled
3070       in a special mode, the compiler will insert calls to these hooks
3071       into the user program.
3072       "Hook" means that it is possible for different implementations of
3073       back-tracing to register themselves (at different times).
3074    
3075    2. compiler/MiscUtil/profile/btrace.sml implements the annotation phase
3076       as an Absyn.dec->Absyn.dec rewrite.  Normally this phase is turned off.
3077       It can be turned on using this call:
3078         SMLofNJ.Internals.BTrace.mode (SOME true);
3079       Turning it off again:
3080         SMLofNJ.Internals.BTrace.mode (SOME false);
3081       Querying the current status:
3082         SMLofNJ.Internals.BTrace.mode NONE;
3083       Annotated programs are about twice as big as normal ones, and they
3084       run a factor of 2 to 4 slower with a dummy back-trace plugin (one
3085       where all hooks do nothing).  The slowdown with a plugin that is
3086       actually useful (such as the one supplied by default) is even greater,
3087       but in the case of the default plugin it is still only an constant
3088       factor (amortized).
3089    
3090    3. system/Basis/Implementation/NJ/internals.{sig,sml} have been augmented
3091       with a sub-structure BTrace for controlling back-tracing.  In particular,
3092       the above-mentioned function "mode" controls whether the annotation
3093       phase is invoked by the compiler.  Another important function is
3094       "trigger": when called it aborts the current execution and causes
3095       the top-level loop to print a full back-trace.
3096    
3097    4. compiler/MiscUtil/profile/btimp.sml is the current default plugin
3098       for back-tracing.  It keeps track of the dynamic call stack and in
3099       addition to that it keeps a partial history at each "level" of that
3100       stack.  For example, if a tail-calls b, b tail-calls c, and c tail-calls
3101       d and b (at separate times, dynamically), then the report will show:
3102    
3103       GOTO   d
3104             /c
3105       GOTO  \b
3106       CALL   a
3107    
3108       This shows that there was an initial non-tail call of a, then a
3109       tail-call to b or c, looping behavior in a cluster of functions that
3110       consist of b and c, and then a goto from that cluster (i.e., either from
3111       b or from c) to d.
3112    
3113       Note that (depending on the user program) the amount of information
3114       that the back-trace module has to keep track of at each level is bounded
3115       by a constant.  Thus, the whole implementation has the same asymptotical
3116       complexity as the original program (both in space and in time).
3117    
3118    5. compiler/TopLevel/interact/evalloop.sml has been modified to
3119       handle the special exception SMLofNJ.Internals.BTrace.BTrace
3120       which is raised by the "trigger" function mentioned above.
3121    
3122    Notes on usage:
3123    
3124    - Annotated code works well together with unannotated code:
3125    Unannotated calls simply do not show up at all in the backtrace.
3126    
3127    - It is not a good idea to let modules that were annotated during
3128    different sessions run at the same time.  This is because the compiler
3129    chooses small integers to identify individual functions, and there
3130    will be clashes if different modules were compiled in separate sessions.
3131    (Nothing will crash, and you will even be told about the clashes, but
3132    back-trace information will in general not be useful.)
3133    
3134    - Back-tracing can be confused by callcc and capture.
3135    
3136    - The only way of getting a back-trace right now is to explicitly
3137    invoke the "trigger" function from your user program.  Eventually, we
3138    should make every exception carry back-trace information (if
3139    available).  But since this creates more overhead at "raise"-time
3140    (similar to the current exnHistory overhead), I have not yet
3141    implemented this.  (The implementation will be rather easy.)  With
3142    exceptions carrying back-trace information, this facility will be even
3143    more useful because users don't need to modify their programs...
3144    
3145    - While it is possible to compile the compiler with back-trace
3146    annotations turned on (I did it to get some confidence in
3147    correctness), you must make absolutely sure that core.sml and
3148    btimp.sml are compiled WITHOUT annotation!  (core.sml cannot actually
3149    be compiled with annotation because there is no core access yet, but
3150    if you compile btimp.sml with annotation, then the system will go into
3151    an infinite recursion and crash.)
3152    Since CM currently does not know about BTrace, the only way to turn
3153    annotations on and off for different modules of the compiler is to
3154    interrupt CMB.make, change the settings, and re-invoke it.  Of course,
3155    this is awkward and clumsy.
3156    
3157    Sample sessions:
3158    
3159    Standard ML of New Jersey v110.28.1 [FLINT v1.5], June 5, 2000
3160    - SMLofNJ.Internals.BTrace.mode (SOME true);
3161    [autoloading]
3162    [autoloading done]
3163    val it = false : bool
3164    - structure X = struct
3165    -     fun main n = let
3166    -         fun a (x, 0) = d x
3167    -           | a (x, n) = b (x, n - 1)
3168    -         and b (x, n) = c (x, n)
3169    -         and c (x, n) = a (x, n)
3170    -         and d x = e (x, 3)
3171    -         and e (x, 0) = f x
3172    -           | e (x, n) = e (x, n - 1)
3173    -         and f 0 = SMLofNJ.Internals.BTrace.trigger ()
3174    -           | f n = n * g (n - 1)
3175    -         and g n = a (n, 3)
3176    -     in
3177    -         f n
3178    -     end
3179    - end;
3180    structure X : sig val main : int -> int end
3181    - X.main 3;
3182    *** BACK-TRACE ***
3183    GOTO   stdIn:4.2-13.20: X.main[2].f
3184    GOTO-( stdIn:4.2-13.20: X.main[2].e
3185    GOTO   stdIn:4.2-13.20: X.main[2].d
3186         / stdIn:4.2-13.20: X.main[2].a
3187         | stdIn:4.2-13.20: X.main[2].b
3188    GOTO-\ stdIn:4.2-13.20: X.main[2].c
3189    CALL   stdIn:4.2-13.20: X.main[2].g
3190    GOTO   stdIn:4.2-13.20: X.main[2].f
3191    GOTO-( stdIn:4.2-13.20: X.main[2].e
3192    GOTO   stdIn:4.2-13.20: X.main[2].d
3193         / stdIn:4.2-13.20: X.main[2].a
3194         | stdIn:4.2-13.20: X.main[2].b
3195    GOTO-\ stdIn:4.2-13.20: X.main[2].c
3196    CALL   stdIn:4.2-13.20: X.main[2].g
3197    GOTO   stdIn:4.2-13.20: X.main[2].f
3198    GOTO-( stdIn:4.2-13.20: X.main[2].e
3199    GOTO   stdIn:4.2-13.20: X.main[2].d
3200         / stdIn:4.2-13.20: X.main[2].a
3201         | stdIn:4.2-13.20: X.main[2].b
3202    GOTO-\ stdIn:4.2-13.20: X.main[2].c
3203    CALL   stdIn:4.2-13.20: X.main[2].g
3204    GOTO   stdIn:4.2-13.20: X.main[2].f
3205    CALL   stdIn:2.15-17.4: X.main[2]
3206    -
3207    
3208    (Note that because of a FLINt bug the above code currently does not
3209    compile without BTrace turned on.)
3210    
3211    Here is another example, using my modified Tiger compiler:
3212    
3213    Standard ML of New Jersey v110.28.1 [FLINT v1.5], June 5, 2000
3214    - SMLofNJ.Internals.BTrace.mode (SOME true);
3215    [autoloading]
3216    [autoloading done]
3217    val it = false : bool
3218    - CM.make "sources.cm";
3219    [autoloading]
3220    ...
3221    [autoloading done]
3222    [scanning sources.cm]
3223    [parsing (sources.cm):parse.sml]
3224    [creating directory CM/SKEL ...]
3225    [parsing (sources.cm):tiger.lex.sml]
3226    ...
3227    [wrote CM/sparc-unix/semant.sml]
3228    [compiling (sources.cm):main.sml]
3229    [wrote CM/sparc-unix/main.sml]
3230    [New bindings added.]
3231    val it = true : bool
3232    - Main.compile ("../testcases/merge.tig", "foo.out");
3233    *** BACK-TRACE ***
3234    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trvar
3235    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3236    CALL   lib/semant.sml:289.3-295.22: SemantFun[2].transExp.trexp.check[2]
3237    GOTO   lib/semant.sml:289.3-295.22: SemantFun[2].transExp.trexp.check[2]
3238    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3239    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3240    CALL   lib/semant.sml:488.3-505.6: SemantFun[2].transDec.trdec[2].transBody[2]
3241         / lib/semant.sml:411.65-543.8: SemantFun[2].transDec
3242    CALL-\ lib/semant.sml:413.2-540.9: SemantFun[2].transDec.trdec[2]
3243    CALL   lib/semant.sml:99.2-396.21: SemantFun[2].transExp.trexp
3244    CALL   lib/semant.sml:8.52-558.4: SemantFun[2].transProg[2]
3245    CALL   main.sml:1.18-118.4: Main.compile[2]
3246    -
3247    
3248    ----------------------------------------------------------------------
3249    Name: Matthias Blumen
3250    Date: 2000/06/21 18:00:00 JST
3251    Tag: blume-20000621-manual
3252    Description:
3253    
3254    CM manual update: Path environments documented.
3255    
3256    ----------------------------------------------------------------------
3257    Name: Matthias Blume
3258    Date: 2000/06/19 13:40:00
3259    Tag: blume-20000619-manual
3260    Description:
3261    
3262    CM manual and system/README update.  This only covers the fact that
3263    there are no more implicit anchors.  (Path environments and the "bind"
3264    option to "cm" have yet to be documented.)
3265    
3266    ----------------------------------------------------------------------
3267    Name: Matthias Blume
3268    Date: 2000/06/19 11:05:00 JST
3269    Tag: blume-20000619-chdir-bugfix
3270    Description:
3271    
3272    Fixed a bug in new SrcPath module that sometimes led to a bad chDir call.
3273    
3274    ----------------------------------------------------------------------
3275    Name: Matthias Blume
3276    Date: 2000/06/18 22:00:10 JST
3277    Tag: blume-20000618-implicit-anchors-really-gone
3278    Description:
3279    
3280    I updates the previous HISTORY entry where I forgot to mention that
3281    implicit anchors are no longer with us.
3282    
3283    The current update also gets rid of the (now useless) controller
3284    CM.Control.implicit_anchors.
3285    
3286    ----------------------------------------------------------------------
3287    Name: Matthias Blume
3288    Date: 2000/06/16 17:30:00 JST
3289    Tag: blume-20000616-anchorenv
3290    Description:
3291    
3292    This patch implements the long anticipated (just kidding :) "anchor
3293    environment" mechanism.  In the course of doing this, I also
3294    re-implemented CM's internal "SrcPath" module from scratch.  The new
3295    one should be more robust in certain boundary cases.  In any case, it
3296    is a lot cleaner than its predecessor (IMHO).
3297    
3298    This time, although there is yet another boot file format change, I
3299    kept the unpickler backward-compatible.  As a result, no new bootfiles
3300    are necessary and bootstrapping is straightforward.  (You cannot read
3301    new bootfiles into an old system, but the other way around is no
3302    problem.)
3303    
3304    Visible changes:
3305    
3306    ** 0. Implicit path anchors (without the leading $-symbol) are no
3307    longer recognized at all. This means that such path names are not
3308    illegal either.  For example, the name basis.cm simply refers to a
3309    local file called "basis.cm" (i.e, the name is an ordinary path
3310    relative to .cm-files directory).  Or, to put it differently, only
3311    names that start with $ are anchored paths.
3312    
3313    ** 1. The $<singlearc> abbreviation for $/<singlearc> has finally
3314    vanished.
3315    
3316    John (Reppy) had critizised this as soon as I originally proposed and
3317    implemented it, but at that time I did not really deeply believe
3318    him. :) Now I came full-circle because I need the $<singlearc> syntax
3319    in another place where it cannot be seen as an abbreviation for
3320    $/<singlearc>.  To avoid the confusion, $<singlearc> now means what it
3321    seems to mean (i.e., it "expands" into the corresponding anchor
3322    value).
3323    
3324    However, when paths are used as members in CM description files, it
3325    continues to be true that there must be at least another arc after the
3326    anchor.  This is now enforced separately during semantic analysis
3327    (i.e., from a lexical/syntactical point of view, the notation is ok.)
3328    
3329    ** 2. The "cm" class now accepts an option "bind".  The option's value
3330    is a sub-option list of precisely two items -- one labeled "anchor"
3331    and the other one labeled "value".  As you might expect, "anchor" is
3332    used to specify an anchor name to be bound, and "value" specifies what
3333    the anchor is being bound to.
3334    
3335    The value must be a directory name and can be given in either standard
3336    syntax (including the possibility that it is itself an anchored path)
3337    or native syntax.
3338    
3339    Examples:
3340    
3341       foo.cm (bind:(anchor:bar value:$mystuff/bar))
3342       lib.cm (bind:(anchor:a value:"H:\\x\\y\\z"))  (* only works under windows *)
3343    
3344    and so on.
3345    
3346    The meaning of this is that the .cm-file will be processed with an
3347    augmented anchor environment where the given anchor(s) is/are bound to
3348    the given values(s).
3349    
3350    The rationale for having this feature is this: Suppose you are trying
3351    to use two different (already stable) libraries a.cm and b.cm (that
3352    you perhaps didn't write yourself).  Further, suppose each of these
3353    two libraries internally uses its own auxiliary library $aux/lib.cm.
3354    Normally you would now have a problem because the anchor "lib" can not
3355    be bound to more than one value globally.  Therefore, the project that
3356    uses both a.cm and b.cm must locally redirect the anchor to some other
3357    place:
3358    
3359       a.cm (bind:(anchor:lib value:/usr/lib/smlnj/a-stuff))
3360       b.cm (bind:(anchor:lib value:/usr/lib/smlnj/b-stuff))
3361    
3362    This hard-wires $lib/aux.cm to /usr/lib/smlnj/a-stuff/aux.cm or
3363    /usr/lib/smlnj/b-stuff/aux.cm, respectively.
3364    
3365    Hard-wiring path names is a bit inflexible (and CM will verbosely warn
3366    you when you do so at the time of CM.stabilize).  Therefore, you can
3367    also use an anchored path as the value:
3368    
3369      a.cm (bind:(anchor:lib value:$a-lib))
3370      b.cm (bind:(anchor:lib value:$b-lib))
3371    
3372    Now you can globally configure (using the usual CM.Anchor.anchor or
3373    pathconfig machinery) bindings for "a-lib" and "b-lib".  Since "lib"
3374    itself is always locally bound, setting it globally is no longer
3375    meaningful or necessary (but it does not hurt either).  In fact, "lib"
3376    can still be used as a global anchor for separate purposes.  As a
3377    matter of fact, one can locally define "lib" in terms of a global
3378    "lib":
3379    
3380      a.cm (bind:(anchor:lib value:$lib/a))
3381      b.cm (bind:(anchor:lib value:$lib/b))
3382    
3383    ** 3: The encoding of path names has changed.  This affects the way
3384    path names are shown in CM's progress report and also the internal
3385    protocol encoding used for parallel make.
3386    
3387    The encoding now uses one or more ':'-separated segments.  Each
3388    segments corresponds to a file that has been specified relative to the
3389    file given by its preceding segment.  The first segment is either
3390    relative to the CWD, absolute, or anchored.  Each segment itself is
3391    basically a Unix pathname; all segments but the first are relative.
3392    
3393    Example:
3394    
3395       $foo/bar/baz.cm:a/b/c.sml
3396    
3397    This path denotes the file bar/a/b/c.sml relative to the directory
3398    denoted by anchor "foo".  Notice that the encoding also includes
3399    baz.cm which is the .cm-file that listed a/b/c.sml.  As usual, such
3400    paths are resolved relative to the .cm-files directory, so baz.cm must
3401    be ignored to get the "real" pathname.
3402    
3403    To make this fact more obvious, CM puts the names of such "virtual
3404    arcs" into parentheses when they appear in progress reports. (No
3405    parentheses will appear in the internal protocol encoding.)  Thus,
3406    what you really see is:
3407    
3408      $foo/bar/(baz.cm):a/b/c.sml
3409    
3410    I find this notation to be much more informative than before.
3411    
3412    Another new feature of the encoding is that special characters
3413    including parentheses, colons, (back)slashes, and white space are
3414    written as \ddd (where ddd is the decimal encoding of the character).
3415    
3416    *** The CM manual still needs to be updated.
3417    
3418    ----------------------------------------------------------------------
3419    Name: Allen Leung
3420    Date: 2000/06/15 00:38:00
3421    Tag: leunga-20000615-x86-peephole
3422    
3423    x86 Peephole fix by Fermin.  Affects c-- and moby only.
3424    
3425    ----------------------------------------------------------------------
3426    Name: Matthias Blume
3427    Date: 2000/06/12 11:40:00
3428    Tag: blume-20000612-parmakefix
3429    Description:
3430    
3431    More cleanup after changing the file naming scheme: This time I
3432    repaired the parallel make mechanism for CMB.make which I broke earlier.
3433    
3434    ----------------------------------------------------------------------
3435    Name: Allen Leung
3436    Date: 2000/06/09 01:25:00
3437    Tag: leunga-20000609-various
3438    
3439    None of these things should affect normal SML/NJ operations
3440    
3441    1. Peephole improvements provided by Fermin (c--)
3442    2. New annotation DEFUSE for adding extra dependence (moby)
3443    3. New X86 LOCK instructions (moby)
3444    4. New machine description language for reservation tables (scheduling)
3445    5. Fixes to various optimization/analysis modules (branch chaining, dominator
3446       trees etc.)
3447    6. I've changed the CM files so that they can work with versions
3448       110.0.6, 110.25 and 110.28
3449    
3450    ----------------------------------------------------------------------
3451    Name: Matthias Blume
3452    Date: 2000/06/09 12:40:00
3453    Tag: blume-20000609-log
3454    Description:
3455    
3456    - Removed all(?) remaining RCS Log entries from sources.
3457    
3458    - Fixed bug in ml-yacc and ml-lex sources (use explicit anchors for
3459      anchored paths).
3460    
3461    ----------------------------------------------------------------------
3462    Name: Matthias Blume
3463    Date: 2000/06/07 17:00:00 JST
3464    Tag: blume-20000607-no-implicit-anchors
3465    Description:
3466    
3467    1. This update changes the default setting for
3468    CM.Control.implicit_anchors from true to false.  This means that
3469    implicit anchors are no longer permitted by default.  I also tried to
3470    make sure that nothing else still relies on implicit anchors.
3471    (This is the next step on the schedule towards a CM that does not even
3472    have the notion of implicit anchors anymore.)
3473    
3474    2. More CM manual updates.
3475    
3476    3. I managed to track down and fix the pickling bug I mentioned last
3477    time.  Because of the previously existing workaround, this entails no
3478    immediate practical changes.
3479    
3480    ----------------------------------------------------------------------
3481    Name: Matthias Blume
3482    Date: 2000/06/06 11:15:00 JST
3483    Tag: blume-20000606-lazierpickle
3484    Description:
3485    
3486    !!!! NEW BOOT FILES !!!!
3487    
3488    * The main purpose of this update is to make library pickles lazier in
3489    order to reduce the initial space penalty for autoloading a library.
3490    As a result, it is now possible to have $smlnj/compiler.cm
3491    pre-registered.  This should take care of the many complaints or
3492    inquiries about missing structure Compiler.  This required changes to
3493    CM's internal data structures and small tweaks to some algorithms.
3494    
3495    As a neat additional effect, it is no longer necessary (for the sake
3496    of lean heap image files) to distinguish between a "minimal" CM and a
3497    "full" CM.  Now, there is only one CM (i.e., the "full" version:
3498    $smlnj/cm.cm aka $smlnj/cm/full.cm), and it is always available at the
3499    interactive top level. ($smlnj/cm/minimal.cm is gone.)
3500    
3501    To make the life of compiler-hackers easier, "makeml" now also
3502    pre-registers $smlnj/cmb.cm (aka $smlnj/cmb/current.cm).  In other
3503    words, after you bootstrap a new sml for the first time, you will not
3504    have to autoload $smlnj/cmb.cm again afterwards.  (The first time
3505    around you will still have to do it, though.)
3506    
3507    * A second change consists of major updates to the CM manual.  There
3508    are now several appendices with summary information and also a full
3509    specification of the CM description file syntax.
3510    
3511    * In directory src/system I added the script "allcross".  This script
3512    invokes sml and cross-compiles the compiler for all supported
3513    architectures.  (Useful when providing a new set of boot files.)
3514    
3515    * There seems to be a latent bug in my "lazy pickles" mechanism.  I
3516    added a small tweak to pickle-util.sml to work around this problem,
3517    but it is not a proper fix yet.  I will investigate further.  (The
3518    effect of the bug was an inflation of library pickle size.)
3519    
3520    * Version number increased to 110.28.1 (to avoid compatibility problems).
3521    
3522    ----------------------------------------------------------------------
3523    Name: Allen Leung
3524    Date: 2000/05/25 17:28 EDT
3525    Tag: leunga-20000525-ra
3526    Description:
3527    
3528      Fixed a bug in freezing phase of the register allocator.
3529    
3530    ----------------------------------------------------------------------
3531    Name: Allen Leung
3532    Date: 2000/05/15 22:53 EDT
3533    Tag: leunga-20000515-alpha-x86-ra
3534    Description:
3535    
3536      1. Alpha
3537    
3538          Slight cleanup.  Removed the instruction SGNXL
3539    
3540      2. X86
3541    
3542          Added the following instructions to the instruction set:
3543    
3544            ROLx, RORx,
3545            BTx, BTSx, BTLx, BTRx,
3546            XCHGx, and variants with the LOCK prefix
3547    
3548      3. Register Allocation
3549    
3550          The module ra-rewrite-with-renaming has been improved.
3551    
3552      These have no effect on SML/NJ.
3553    
3554    ----------------------------------------------------------------------
3555    Name: Matthias Blume
3556    Date: 2000/05/15 16:20:00 JST
3557    Tag: blume-20000515-lightrebuild
3558    Description:
3559    
3560    1. I added an alternative to "-rebuild" to "makeml".  The difference is
3561       that prior to calling CMB.make' the CM-variable "LIGHT" will be
3562       defined.  In effect, the command will not build any cross-compiler
3563       backends and therefore finish more quickly.
3564    
3565       The "fixpt" script also takes a "-light" switch to be able to use
3566       this new facility while compiling for a fixpoint.
3567    
3568    2. I replaced all mentions of anchored paths in group owner specifications
3569       with simple relative paths (usually starting with "..").
3570       The rationale is that a library's internal workings should not be
3571       compromised by the lack of some anchor.  (An anchor is necessary
3572       for someone who wants to refer to the library by an anchored path,
3573       but it should not be necessary to build the same library in the first
3574       place.)
3575    
3576    3. I changed the way CM's tool mechanism determines the shell command
3577       string used for things like ml-yacc etc. so that it does not break
3578       when CM.Control.implicit_anchors is turned off.
3579    
3580    ----------------------------------------------------------------------
3581    Name: Matthias Blume
3582    Date: 2000/05/12 18:20:00 JST
3583    Tag: blume-20000512-ml-build
3584    Description:
3585    
3586    Fixed a bug in config/_ml-build that prevented ml-yacc and ml-lex from
3587    getting installed properly (by config/install.sh).
3588    
3589    ----------------------------------------------------------------------
3590    Name: Matthias Blume
3591    Date: 2000/05/12 17:30:00 JST
3592    Tag: blume-20000512-anchors
3593    Description:
3594    
3595    !!! NEW BOOT FILES !!!
3596    
3597    This change is in preparation of fading out support for "implicitly
3598    anchored path names".  I went through all sources and used the
3599    explicit (and relatively new) $-notation.  See system/README and the
3600    CM manual for more info on this.
3601    
3602    I also modified the anchoring scheme for some things such as "smlnj",
3603    "MLRISC", "cm", etc. to take advantage of the fact that explicit
3604    anchors are more expressive: anchor name and first arc do not have to
3605    coincide.  This entails the following user-visible change:
3606    
3607    You have to write $smlnj/foo/bar instead of smlnj/foo/bar.  In
3608    particular, when you fire up sml with a command-line argument, say,
3609    e.g.:
3610    
3611       sml '$smlnj/cmb.cm'
3612    
3613    At the ML toplevel prompt:
3614    
3615       CM.autoload "$smlnj/cmb.cm";