Home My Page Projects Code Snippets Project Openings SML/NJ
Summary Activity Forums Tracker Lists Tasks Docs Surveys News SCM Files

SCM Repository

[smlnj] Diff of /sml/trunk/HISTORY
ViewVC logotype

Diff of /sml/trunk/HISTORY

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 580, Wed Mar 22 06:33:52 2000 UTC revision 1204, Mon May 20 14:58:33 2002 UTC
# Line 8  Line 8 
8  The form of an entry should be:  The form of an entry should be:
9    
10  Name:  Name:
11  Date:  Date: yyyy/mm/dd
12  Tag: <post-commit CVS tag>  Tag: <post-commit CVS tag>
13  Description:  Description:
14    
15    ----------------------------------------------------------------------
16    Name: Matthias Blume
17    Date: 2002/05/20 10:59:00 EDT
18    Tag: blume-20020520-preloads
19    Description:
20    
21    Added a lot of commented-out suggestions for things to be included
22    in config/preloads.
23    
24    ----------------------------------------------------------------------
25    Name: Allen Leung
26    Date: 2002/05/18 14:20:00 EDT
27    Tag: leunga-20020518-mdl
28    Description:
29    
30    o Made the mdl tool stuff compile and run again.
31    o I've disabled all the stuff that depends on RTL specifications; they
32      are all badly broken anyway.
33    
34    ----------------------------------------------------------------------
35    Name: Matthias Blume
36    Date: 2002/05/17 16:49:00 EDT
37    Tag: blume-20020517-controls
38    Description:
39    
40    0. John Reppy made several modifications to the SML/NJ library.
41       In particular, there is a shiny new controls-lib.cm.
42    
43    1. Pushed new controls interface through compiler so that everything
44       compiles again.
45    
46    2. Added FormatComb and FORMAT_COMB to the CML version of the
47       SML/NJ library (so that CML compiles again).
48    
49    3. Modified init scripts because XXX_DEFAULT environment variables
50       are no longer with us.  (Boot-time initialization is now done
51       using the same environment variables that are also used for
52       startup-time initialization of controls.)
53    
54    ----------------------------------------------------------------------
55    Name: Lal George
56    Date: 2002/05/15 09:20:10 EDT
57    Tag: george-20020515-pseudo-op-decls
58    Description:
59    
60            All pseudo-ops emitted before the first segment declaration
61    such as TEXT, DATA, and BSS directives are assumed to be global
62    declarations and are emitted first in the assembly file. This is
63    useful in a number of situations where one has pseudo-ops that are not
64    specific to any segment, and also works around the constraint that one
65    cannot have client pseudo-ops in the TEXT segment.
66    
67            Because no segment is associated with these declarations it is
68    an error to allocate any space or objects before the first segment
69    directive and an exception will be raised. However, we cannot make
70    this check for client pseudo-ops.
71    
72            These top level declarations are a field in the CFG graph_info.
73    In theory you can continue to add to this field after the CFG has been
74    built -- provided you know what you are doing;-)
75    
76    ----------------------------------------------------------------------
77    Name: Matthias Blume
78    Date: 2002/05/13 16:40:00 EDT
79    Tag: blume-20020513-pp-etc
80    Description:
81    
82    A few minor bugfixes:
83    
84      - Stopgap measure for bug recently reported by Elsa Gunter (ppDec).
85        (Bogus printouts for redefined bindings still occur.  Compiler
86        bug should no longer occur now.  We need to redo the prettyprinter
87        from scratch.)
88    
89      - CM pathname printer now also adds escape sequences for ( and )
90    
91      - commend and docu fixes for ml-nlffi
92    
93    ----------------------------------------------------------------------
94    Name: Matthias Blume
95    Date: 2002/05/10 16:40:00 EDT
96    Tag: blume-20020510-erg-textio
97    Description:
98    
99    Applied the following bugfix provided by Emden Gansner:
100    
101        Output is corrupted when outputSubstr is used rather than output.
102    
103        The problem occurs when a substring
104    
105            ss = (s, dataStart, dataLen)
106    
107        where dataStart > 0, fills a stream buffer with avail bytes left.
108        avail bytes of s, starting at index dataStart, are copied into the
109        buffer, the buffer is flushed, and then the remaining dataLen-avail
110        bytes of ss are copied into the beginning of the buffer. Instead of
111        starting this copy at index dataStart+avail in s, the current code
112        starts the copy at index avail.
113    
114        Fix:
115        In text-io-fn.sml, change line 695 from
116             val needsFlush = copyVec(v, avail, dataLen-avail, buf, 0)
117        to
118             val needsFlush = copyVec(v, dataStart+avail, dataLen-avail, buf, 0)
119    
120    ----------------------------------------------------------------------
121    Name: Matthias Blume
122    Date: 2002/04/12 13:55:00 EDT
123    Tag: blume-20020412-assyntax
124    Description:
125    
126    1. Grabbed newer assyntax.h from the XFree86 project.
127    2. Fiddled with how to compile X86.prim.asm without warnings.
128    3. (Very) Minor cleanup in CM.
129    
130    ----------------------------------------------------------------------
131    Name: Matthias Blume
132    Date: 2002/04/01 (no joke!) 17:07:00 EST
133    Tag: blume-20020401-x86div
134    Description:
135    
136    Added full support for div/mod/rem/quot on the x86, using the machine
137    instruction's two results (without clumsily recomputing the remainder)
138    directly where appropriate.
139    
140    Some more extensive power-of-two support was added to the x86 instruction
141    selector (avoiding expensive divs, mods, and muls where they can be
142    replaced with cheaper shifts and masks).  However, this sort of thing
143    ought to be done earlier, e.g., within the CPS optimizer so that
144    all architectures benefit from it.
145    
146    The compiler compiles to a fixed point, but changes might be somewhat
147    fragile nevertheless.  Please, report any strange things that you might
148    see wrt. div/mod/quot/rem...
149    
150    ----------------------------------------------------------------------
151    Name: Matthias Blume
152    Date: 2002/03/29 17:22:00
153    Tag: blume-20020329-div
154    Description:
155    
156    Fixed my broken div/mod logic.  Unfortunately, this means that the
157    inline code for div/mod now has one more comparison than before.
158    Fast paths (quotient > 0 or remainder = 0) are not affected, though.
159    The problem was with quotient = 0, because that alone does not tell
160    us which way the rounding went.  One then has to look at whether
161    remainder and divisor have the same sign...  :(
162    
163    Anyway, I replaced the bootfiles with fresh ones...
164    
165    ----------------------------------------------------------------------
166    Name: Matthias Blume
167    Date: 2002/03/29 14:10:00 EST
168    Tag: blume-20020329-inlprims
169    Description:
170    
171    NEW BOOTFILES!!!    Version number bumped to 110.39.3.
172    
173    Primops have changed. This means that the bin/boot-file formats have
174    changed as well.
175    
176    To make sure that there is no confusion, I made a new version.
177    
178    
179    CHANGES:
180    
181    * removed REMT from mltree (remainder should never overflow).
182    
183    * added primops to deal with divisions of all flavors to the frontend
184    
185    * handled these primops all the way through so they map to their respective
186      MLRISC support
187    
188    * used these primops in the implementation of Int, Int32, Word, Word32
189    
190    * removed INLDIV, INLMOD, and INLREM as they are no longer necessary
191    
192    * parameterized INLMIN, INLMAX, and INLABS by a numkind
193    
194    * translate.sml now deals with all flavors of INL{MIN,MAX,ABS}, including
195      floating point
196    
197    * used INL{MIN,MAX,ABS} in the implementation of Int, Int32, Word, Word32,
198      and Real (but Real.abs maps to a separate floating-point-only primop)
199    
200    
201    TODO items:
202    
203    * Hacked Alpha32 instruction selection, disabling the selection of REMx
204      instructions because the machine instruction encoder cannot handle
205      them.  (Hppa, PPC, and Sparc instruction selection did not handle
206      REM in the first place, and REM is supported by the x86 machine coder.)
207    
208    * Handle DIV and MOD with DIV_TO_NEGINF directly in the x86 instruction
209      selection phase.  (The two can be streamlined because the hardware
210      delivers both quotient and remainder at the same time anyway.)
211    
212    * Think about what to do with "valOf(Int32.minInt) div ~1" and friends.
213      (Currently the behavior is inconsistent both across architectures and
214      wrt. the draft Basis spec.)
215    
216    * Word8 should eventually be handled natively, too.
217    
218    * There seems to be one serious bug in mltree-gen.sml.  It appears, though,
219      as if there currently is no execution path that could trigger it in
220      SML/NJ.  (The assumptions underlying functions arith and promotable do not
221      hold for things like multiplication and division.)
222    
223    ----------------------------------------------------------------------
224    Name: Matthias Blume
225    Date: 2002/03/27 16:27:00 EST
226    Tag: blume-20020327-mlrisc-divisions
227    Description:
228    
229    Added support for all four division operations (ML's div, mod, quot,
230    and rem) to MLRISC.  In the course of doing so, I also rationalized
231    the naming (no more annoying switch-around of DIV and QUOT), by
232    parameterizing the operation by div_rounding_mode (which can be either
233    DIV_TO_ZERO or DIV_TO_NEGINF).
234    
235    The generic MLTreeGen functor takes care of compiling all four
236    operations down to only round-to-zero div.
237    
238    Missing pieces:
239    
240      * Doing something smarter than relying on MLTreeGen on architectures
241        like, e.g., the x86 where hardware division delivers both quotient and
242        remainder at the same time.  With this, the implementation of the
243        round-to-neginf operations could be further streamlined.
244    
245      * Remove inlining support for div/mod/rem from the frontend and replace it
246        with primops that get carried through to the backend.  Do this for all
247        int and word types.
248    
249    ----------------------------------------------------------------------
250    Name: Matthias Blume
251    Date: 2002/03/25 17:25:00 EST
252    Tag: blume-20020325-divmod
253    Description:
254    
255    I improved (hopefully without breaking them) the implementation of Int.div,
256    Int.mod, and Int.rem.   For this, the code in translate.sml now takes
257    advantage of the following observations:
258    
259      Let  q = x quot y      r = x rem y
260           d = x div  y      m = x mod y
261    
262    where "quot" is the round-to-zero version of integer division that
263    hardware usually provides.  Then we have:
264    
265         r = x - q * y        where neither the * nor the - will overflow
266         d = if q >= 0 orelse x = q * y then q else q - 1
267                              where neither the * nor the - will overflow
268         m = if q >= 0 orelse r = 0 then r else r + y
269                              where the + will not overflow
270    
271    This results in substantial simplification of the generated code.
272    The following table shows the number of CFG nodes and edges generated
273    for
274            fun f (x, y) = x OPER y
275            (* with OPER \in div, mod, quot, rem *)
276    
277    
278        OPER | nodes(old) | edges(old) | nodes(new) | edges(new)
279        --------------------------------------------------------
280         div |         24 |         39 |         12 |         16
281         mod |         41 |         71 |         12 |         16
282        quot |          8 |         10 |          8 |         10
283         rem |         10 |         14 |          8 |         10
284    
285    
286    ----------------------------------------------------------------------
287    Name: Matthias Blume
288    Date: 2002/03/25 22:06:00 EST
289    Tag: blume-20020325-cprotobug
290    Description:
291    
292    Fixed a bug in cproto (c prototype decoder).
293    
294    ----------------------------------------------------------------------
295    Name: Matthias Blume
296    Date: 2002/03/25 16:00:00 EST
297    Tag: blume-20020325-raw-primops
298    Description:
299    
300    I did some cleanup to Allen's new primop code and
301    replaced yesterday's bootfiles with new ones.
302    (But they are stored in the same place.)
303    
304    ----------------------------------------------------------------------
305    Name: Matthias Blume
306    Date: 2002/03/24 22:40:00 EST
307    Tag: blume-20020324-bootfiles
308    Description:
309    
310    Made the bootfiles that Allen asked for.
311    
312    ----------------------------------------------------------------------
313    Name: Allen Leung
314    Date: 2002/03/23 15:50:00 EST
315    Tag: leunga-20020323-flint-cps-rcc-primops
316    Description:
317    
318      1. Changes to FLINT primops:
319    
320        (* make a call to a C-function;
321         * The primop carries C function prototype information and specifies
322         * which of its (ML-) arguments are floating point. C prototype
323         * information is for use by the backend, ML information is for
324         * use by the CPS converter. *)
325      | RAW_CCALL of { c_proto: CTypes.c_proto,
326                       ml_args: ccall_type list,
327                       ml_res_opt: ccall_type option,
328                       reentrant : bool
329                     } option
330       (* Allocate uninitialized storage on the heap.
331        * The record is meant to hold short-lived C objects, i.e., they
332        * are not ML pointers.  With the tag, the representation is
333        * the same as RECORD with tag tag_raw32 (sz=4), or tag_fblock (sz=8)
334        *)
335      | RAW_RECORD of {tag:bool,sz:int}
336      and ccall_type = CCALL_INT32 | CCALL_REAL64 | CCALL_ML_PTR
337    
338      2.  These CPS primops are now overloaded:
339    
340           rawload of {kind:numkind}
341           rawstore of {kind:numkind}
342    
343          The one argument form is:
344    
345             rawload {kind} address
346    
347          The two argument form is:
348    
349             rawload {kind} [ml object, byte-offset]
350    
351      3. RAW_CCALL/RCC now takes two extra arguments:
352    
353         a. The first is whether the C call is reentrant, i.e., whether
354            ML state should be saved and restored.
355         b. The second argument is a string argument specifying the name of
356            library and the C function.
357    
358         These things are currently not handled in the code generator, yet.
359    
360      4. In CProto,
361    
362         An encoding type of "bool" means "ml object" and is mapped into
363         C prototype of PTR.  Note that "bool" is different than "string",
364         even though "string" is also mapped into PTR, because "bool"
365         is assigned an CPS type of BOGt, while "string" is assigned INT32t.
366    
367      5. Pickler/unpicker
368    
369         Changed to handle RAW_RECORD and newest RAW_CCALL
370    
371      6. MLRiscGen,
372    
373         1. Changed to handle the new rawload/rawstore/rawrecord operators.
374         2. Code for handling C Calls has been moved to a new module CPSCCalls,
375            in the file CodeGen/cpscompile/cps-c-calls.sml
376    
377      7. Added the conditional move operator
378    
379             condmove of branch
380    
381         to cps.  Generation of this is still buggy so it is currently
382         disabled.
383    
384    ----------------------------------------------------------------------
385    Name: Lal George
386    Date: 2002/03/22 14:18:25 EST
387    Tag: george-20020322-cps-branch-prob
388    Description:
389    
390    Implemented the Ball-Larus branch prediction-heuristics, and
391    incorporated graphical viewers for control flow graphs.
392    
393    Ball-Larus Heuristics:
394    ---------------------
395    See the file compiler/CodeGen/cpscompile/cpsBranchProb.sml.
396    
397    By design it uses the Dempster-Shafer theory for combining
398    probabilities.  For example, in the function:
399    
400        fun f(n,acc) = if n = 0 then acc else f(n-1, n*acc)
401    
402    the ball-larus heuristics predicts that the n=0 is unlikely
403    (OH-heuristic), and the 'then' branch is unlikely because of the
404    RH-heuristic -- giving the 'then' branch an even lower combined
405    probability using the Dempster-Shafer theory.
406    
407    Finally, John Reppy's loop analysis in MLRISC, further lowers the
408    probability of the 'then' branch because of the loop in the else
409    branch.
410    
411    
412    Graphical Viewing:
413    ------------------
414    I merely plugged in Allen's graphical viewers into the compiler. The
415    additional code is not much. At the top level, saying:
416    
417            Control.MLRISC.getFlag "cfg-graphical-view" := true;
418    
419    will display the graphical view of the control flow graph just before
420    back-patching.  daVinci must be in your path for this to work. If
421    daVinci is not available, then the default viewer can be changed
422    using:
423    
424            Control.MLRISC.getString "viewer"
425    
426    which can be set to "dot" or "vcg" for the corresponding viewers. Of
427    course, these viewers must be in your path.
428    
429    The above will display the compilation unit at the level of clusters,
430    many of which are small, boring, and un-interesting. Also setting:
431    
432            Control.MLRISC.getInt "cfg-graphical-view_size"
433    
434    will display clusters that are larger than the value set by the above.
435    
436    
437    ----------------------------------------------------------------------
438    Name: Matthias Blume
439    Date: 2002/03/21 22:20:00 EST
440    Tag: blume-20020321-kmp-bugfix
441    Description:
442    
443    Changed the interface to the KMP routine in PreString and fixed
444    a minor bug in one place where it was used.
445    
446    ----------------------------------------------------------------------
447    Name: Allen Leung
448    Date: 2002/03/21 20:30:00 EST
449    Tag: leunga-20020321-cfg
450    Description:
451    
452      Fixed a potential problem in cfg edge splitting.
453    
454    ----------------------------------------------------------------------
455    Name: Allen Leung
456    Date: 2002/03/21 17:15:00 EST
457    Tag: leunga-20020321-x86-fp-cfg
458    Description:
459    
460      1. Recoded the buggy parts of x86-fp.
461    
462         a. All the block reordering code has been removed.
463            We now depend on the block placement phases to do this work.
464    
465         b. Critical edge splitting code has been simplified and moved into the
466            CFG modules, as where they belong.
467    
468         Both of these were quite buggy and complex.  The code is now much, much
469         simpler.
470    
471      2. X86 backend.
472    
473         a. Added instructions for 64-bit support.  Instruction selection for
474            64-bit has not been committed, however, since that
475            requires changes to MLTREE which haven't been approved by
476            Lal and John.
477    
478         b. Added support for FUCOMI and FUCOMIP when generating code for
479            PentiumPro and above.  We only generate these instructions in
480            the fast-fp mode.
481    
482         c. Added cases for JP and JNP in X86FreqProps.
483    
484      3. CFG
485    
486         CFG now has a bunch of methods for edge splitting and merging.
487    
488      4. Machine description.
489    
490         John's simplification of MLTREE_BASIS.fcond broke a few machine
491         description things:
492    
493         rtl-build.{sig,sml} and hppa.mdl fixed.
494    
495         NOTE: the machine description stuff in the repository is still broken.
496               Again, I can't put my fixes in because that involves
497               changes to MLTREE.
498    
499    ----------------------------------------------------------------------
500    Name: Matthias Blume
501    Date: 2002/03/20 15:55:00 EST
502    Tag: blume-20020320-kmp
503    Description:
504    
505    Implemented Knuth-Morris-Pratt string matching in PreString and used
506    it for String.isSubstring, Substring.isSubstring, and
507    Substring.position.
508    
509    (Might need some stress-testing.  Simple examples worked fine.)
510    
511    ----------------------------------------------------------------------
512    Name: Matthias Blume
513    Date: 2002/03/19 16:37:00 EST
514    Tag: blume-20020319-witnesses
515    Description:
516    
517    Added a structure C.W and functions convert/Ptr.convert to ml-nlffi-lib.
518    
519    This implements a generic mechanism for changing constness qualifiers
520    anywhere within big C types without resorting to outright "casts".
521    (So far, functions such as C.rw/C.ro or C.Ptr.rw/C.Ptr.ro only let you
522    modify the constness at the outermost level.)
523    The implementation of "convert" is based on the idea of "witness"
524    values -- values that are not used by the operation but whose types
525    "testify" to their applicability.  On the implementation side, "convert"
526    is simply a projection (returning its second curried argument).  With
527    cross-module inlining, it should not result in any machine code being
528    generated.
529    
530    ----------------------------------------------------------------------
531    Name: Matthias Blume
532    Date: 2002/03/15 16:40:00 EST
533    Tag: blume-20020315-basis
534    Description:
535    
536    Provided (preliminary?) implementations for
537    
538      {String,Substring}.{concatWith,isSuffix,isSubstring}
539    
540    and
541    
542      Substring.full
543    
544    Those are in the Basis spec but they were missing in SML/NJ.
545    
546    ----------------------------------------------------------------------
547    Name: Matthias Blume
548    Date: 2002/03/14 21:30:00 EST
549    Tag: blume-20020314-controls
550    Description:
551    
552    Controls:
553    ---------
554    
555    1. Factored out the recently-added Controls : CONTROLS stuff and put
556       it into its own library $/controls-lib.cm.  The source tree for
557       this is under src/smlnj-lib/Controls.
558    
559    2. Changed the names of types and functions in this interface, so they
560       make a bit more "sense":
561    
562          module -> registry
563          'a registry -> 'a group
564    
565    3. The interface now deals in ref cells only.  The getter/setter interface
566       is (mostly) gone.
567    
568    4. Added a function that lets one register an already-existing ref cell.
569    
570    5. Made the corresponding modifications to the rest of the code so that
571       everything compiles again.
572    
573    6. Changed the implementation of Controls.MLRISC back to something closer
574       to the original.  In particular, this module (and therefore MLRISC)
575       does not depend on Controls.  There now is some link-time code in
576       int-sys.sml that registers the MLRISC controls with the Controls
577       module.
578    
579    CM:
580    ---
581    
582      * One can now specify the lambda-split aggressiveness in init.cmi.
583    
584    ----------------------------------------------------------------------
585    Name: Allen Leung
586    Date: 2002/03/13 17:30:00 EST
587    Tag: leunga-20020313-x86-fp-unary
588    Description:
589    
590    Bug fix for:
591    
592    > leunga@weaselbane:~/Yale/tmp/sml-dist{21} bin/sml
593    > Standard ML of New Jersey v110.39.1 [FLINT v1.5], March 08, 2002
594    > - fun f(x,(y,z)) = Real.~ y;
595    > [autoloading]
596    > [autoloading done]
597    >       fchsl   (%eax), 184(%esp)
598    > Error: MLRisc bug: X86MCEmitter.emitInstr
599    >
600    > uncaught exception Error
601    >   raised at: ../MLRISC/control/mlriscErrormsg.sml:16.14-16.19
602    
603    The problem was that the code generator did not generate any fp registers
604    in this case, and the ra didn't know that it needed to run the X86FP phase to
605    translate the pseudo fp instruction.   This only happened with unary fp
606    operators in certain situations.
607    
608    ----------------------------------------------------------------------
609    Name: Matthias Blume
610    Date: 2002/03/13 14:00:00 EST
611    Tag: blume-20020313-overload-etc
612    Description:
613    
614    1. Added _overload as a synonym for overload for backward compatibility.
615       (Control.overloadKW must be true for either version to be accepted.)
616    
617    2. Fixed bug in install script that caused more things to be installed
618       than what was requested in config/targets.
619    
620    3. Made CM aware of the (_)overload construct so that autoloading
621       works.
622    
623    ----------------------------------------------------------------------
624    Name: Matthias Blume
625    Date: 2002/03/12 22:03:00 EST
626    Tag: blume-20020312-url
627    Description:
628    
629    Forgot to update BOOT and srcarchiveurl.
630    
631    ----------------------------------------------------------------------
632    Name: Matthias Blume
633    Date: 2002/03/12 17:30:00 EST
634    Tag: blume-20020312-version110392
635    Description:
636    
637    Yet another version number bump (because of small changes to the
638    binfile format).  Version number is now 110.39.2.  NEW BOOTFILES!
639    
640    Changes:
641    
642      The new pid generation scheme described a few weeks ago was overly
643      complicated.  I implemented a new mechanism that is simpler and
644      provides a bit more "stability":  Once CM has seen a compilation
645      unit, it keeps its identity constant (as long as you do not delete
646      those crucial CM/GUID/* files).  This means that when you change
647      an interface, compile, then go back to the old interface, and
648      compile again, you arrive at the original pid.
649    
650      There now also is a mechanism that instructs CM to use the plain
651      environment hash as a module's pid (effectively making its GUID
652      the empty string).  For this, "noguid" must be specified as an
653      option to the .sml file in question within its .cm file.
654      This is most useful for code that is being generated by tools such
655      as ml-nlffigen (because during development programmers tend to
656      erase the tool's entire output directory tree including CM's cached
657      GUIDs).  "noguid" is somewhat dangerous (since it can be used to locally
658      revert to the old, broken behavior of SML/NJ, but in specific cases
659      where there is no danger of interface confusion, its use is ok
660      (I think).
661    
662      ml-nlffigen by default generates "noguid" annotations.  They can be
663      turned off by specifying -guid in its command line.
664    
665    ----------------------------------------------------------------------
666    Name: Lal George
667    Date: 2002/03/12 12 14:42:36 EST
668    Tag: george-20020312-frequency-computation
669    Description:
670    
671    Integrated jump chaining and static block frequency into the
672    compiler. More details and numbers later.
673    
674    ----------------------------------------------------------------------
675    Name: Lal George
676    Date: 2002/03/11 11 22:38:53 EST
677    Tag: george-20020311-jump-chain-elim
678    Description:
679    
680    Tested the jump chain elimination on all architectures (except the
681    hppa).  This is on by default right now and is profitable for the
682    alpha and x86, however, it may not be profitable for the sparc and ppc
683    when compiling the compiler.
684    
685    The gc test will typically jump to a label at the end of the cluster,
686    where there is another jump to an external cluster containing the actual
687    code to invoke gc. This is to allow factoring of common gc invocation
688    sequences. That is to say, we generate:
689    
690            f:
691               testgc
692               ja   L1      % jump if above to L1
693    
694            L1:
695               jmp L2
696    
697    
698    After jump chain elimination the 'ja L1' instructions is converted to
699    'ja L2'. On the sparc and ppc, many of the 'ja L2' instructions may end
700    up being implemented in their long form (if L2 is far away) using:
701    
702            jbe     L3      % jump if below or equal to L3
703            jmp     L2
704         L3:
705            ...
706    
707    
708    For large compilation units L2  may be far away.
709    
710    
711    ----------------------------------------------------------------------
712    Name: Matthias Blume
713    Date: 2002/03/11 13:30:00 EST
714    Tag: blume-20020311-mltreeeval
715    Description:
716    
717    A functor parameter was missing.
718    
719    ----------------------------------------------------------------------
720    Name: Allen Leung
721    Date: 2002/03/11 10:30:00 EST
722    Tag: leunga-20020311-runtime-string0
723    Description:
724    
725       The representation of the empty string now points to a
726    legal null terminated C string instead of unit.  It is now possible
727    to convert an ML string into C string with InlineT.CharVector.getData.
728    This compiles into one single machine instruction.
729    
730    ----------------------------------------------------------------------
731    Name: Allen Leung
732    Date: 2002/03/10 23:55:00 EST
733    Tag: leunga-20020310-x86-call
734    Description:
735    
736       Added machine generation for CALL instruction (relative displacement mode)
737    
738    ----------------------------------------------------------------------
739    Name: Matthias Blume
740    Date: 2002/03/08 16:05:00
741    Tag: blume-20020308-entrypoints
742    Description:
743    
744    Version number bumped to 110.39.1.  NEW BOOTFILES!
745    
746    Entrypoints: non-zero offset into a code object where execution should begin.
747    
748    - Added the notion of an entrypoint to CodeObj.
749    - Added reading/writing of entrypoint info to Binfile.
750    - Made runtime system bootloader aware of entrypoints.
751    - Use the address of the label of the first function given to mlriscGen
752      as the entrypoint.  This address is currently always 0, but it will
753      not be 0 once we turn on block placement.
754    - Removed the linkage cluster code (which was The Other Way(tm) of dealing
755      with entry points) from mlriscGen.
756    
757    ----------------------------------------------------------------------
758    Name: Allen Leung
759    Date: 2002/03/07 20:45:00 EST
760    Tag: leunga-20020307-x86-cmov
761    Description:
762    
763       Bug fixes for CMOVcc on x86.
764    
765       1. Added machine code generation for CMOVcc
766       2. CMOVcc is now generated in preference over SETcc on PentiumPro or above.
767       3. CMOVcc cannot have an immediate operand as argument.
768    
769    ----------------------------------------------------------------------
770    Name: Matthias Blume
771    Date: 2002/03/07 16:15:00 EST
772    Tag: blume-20020307-controls
773    Description:
774    
775    This is a very large but mostly boring patch which makes (almost)
776    every tuneable compiler knob (i.e., pretty much everything under
777    Control.* plus a few other things) configurable via both the command
778    line and environment variables in the style CM did its configuration
779    until now.
780    
781    Try starting sml with '-h' (or, if you are brave, '-H')
782    
783    To this end, I added a structure Controls : CONTROLS to smlnj-lib.cm which
784    implements the underlying generic mechanism.
785    
786    The interface to some of the existing such facilities has changed somewhat.
787    For example, the MLRiscControl module now provides mkFoo instead of getFoo.
788    (The getFoo interface is still there for backward-compatibility, but its
789    use is deprecated.)
790    
791    The ml-build script passes -Cxxx=yyy command-line arguments through so
792    that one can now twiddle the compiler settings when using this "batch"
793    compiler.
794    
795    TODO items:
796    
797    We should go through and throw out all controls that are no longer
798    connected to anything.  Moreover, we should go through and provide
799    meaningful (and correct!) documentation strings for those controls
800    that still are connected.
801    
802    Currently, multiple calls to Controls.new are accepted (only the first
803    has any effect).  Eventually we should make sure that every control
804    is being made (via Controls.new) exactly once.  Future access can then
805    be done using Controls.acc.
806    
807    Finally, it would probably be a good idea to use the getter-setter
808    interface to controls rather than ref cells.  For the time being, both
809    styles are provided by the Controls module, but getter-setter pairs are
810    better if thread-safety is of any concern because they can be wrapped.
811    
812    *****************************************
813    
814    One bug fix: The function blockPlacement in three of the MLRISC
815    backpatch files used to be hard-wired to one of two possibilities at
816    link time (according to the value of the placementFlag).  But (I
817    think) it should rather sense the flag every time.
818    
819    *****************************************
820    
821    Other assorted changes (by other people who did not supply a HISTORY entry):
822    
823    1. the cross-module inliner now works much better (Monnier)
824    2. representation of weights, frequencies, and probabilities in MLRISC
825       changed in preparation of using those for weighted block placement
826       (Reppy, George)
827    
828    ----------------------------------------------------------------------
829    Name: Lal George
830    Date: 2002/03/07 14:44:24 EST 2002
831    Tag: george-20020307-weighted-block-placement
832    
833    Tested the weighted block placement optimization on all architectures
834    (except the hppa) using AMPL to generate the block and edge frequencies.
835    Changes were required in the machine properties to correctly
836    categorize trap instructions. There is an MLRISC flag
837    "weighted-block-placement" that can be used to enable weighted block
838    placement, but this will be ineffective without block/edge
839    frequencies (coming soon).
840    
841    
842    ----------------------------------------------------------------------
843    Name: Lal George
844    Date: 2002/03/05 17:24:48 EST
845    Tag: george-20020305-linkage-cluster
846    
847    In order to support the block placement optimization, a new cluster
848    is generated as the very first cluster (called the linkage cluster).
849    It contains a single jump to the 'real' entry point for the compilation
850    unit. Block placement has no effect on the linkage cluster itself, but
851    all the other clusters  have full freedom in the manner in which they
852    reorder blocks or functions.
853    
854    On the x86 the typical linkage code that is generated is:
855       ----------------------
856            .align 2
857       L0:
858            addl    $L1-L0, 72(%esp)
859            jmp     L1
860    
861    
862            .align  2
863       L1:
864       ----------------------
865    
866    72(%esp) is the memory location for the stdlink register. This
867    must contain the address of the CPS function being called. In the
868    above example, it contains the address of  L0; before
869    calling L1 (the real entry point for the compilation unit), it
870    must contain the address for L1, and hence
871    
872            addl $L1-L0, 72(%esp)
873    
874    I have tested this on all architectures except the hppa.The increase
875    in code size is of course negligible
876    
877    ----------------------------------------------------------------------
878    Name: Allen Leung
879    Date: 2002/03/03 13:20:00 EST
880    Tag: leunga-20020303-mlrisc-tools
881    
882      Added #[ ... ] expressions to mlrisc tools
883    
884    ----------------------------------------------------------------------
885    Name: Matthias Blume
886    Date: 2002/02/27 12:29:00 EST
887    Tag: blume-20020227-cdebug
888    Description:
889    
890    - made types in structure C and C_Debug to be equal
891    - got rid of code duplication (c-int.sml vs. c-int-debug.sml)
892    - there no longer is a C_Int_Debug (C_Debug is directly derived from C)
893    
894    ----------------------------------------------------------------------
895    Name: Matthias Blume
896    Date: 2002/02/26 12:00:00 EST
897    Tag: blume-20020226-ffi
898    Description:
899    
900    1. Fixed a minor bug in CM's "noweb" tool:
901       If numbering is turned off, then truly don't number (i.e., do not
902       supply the -L option to noweb).  The previous behavior was to supply
903       -L'' -- which caused noweb to use the "default" line numbering scheme.
904       Thanks to Chris Richards for pointing this out (and supplying the fix).
905    
906    2. Once again, I reworked some aspects of the FFI:
907    
908       A. The incomplete/complete type business:
909    
910       - Signatures POINTER_TO_INCOMPLETE_TYPE and accompanying functors are
911         gone!
912       - ML types representing an incomplete type are now *equal* to
913         ML types representing their corresponding complete types (just like
914         in C).  This is still safe because ml-nlffigen will not generate
915         RTTI for incomplete types, nor will it generate functions that
916         require access to such RTTI.   But when ML code generated from both
917         incomplete and complete versions of the C type meet, the ML types
918         are trivially interoperable.
919    
920         NOTE:  These changes restore the full generality of the translation
921         (which was previously lost when I eliminated functorization)!
922    
923       B. Enum types:
924    
925       - Structure C now has a type constructor "enum" that is similar to
926         how the "su" constructor works.  However, "enum" is not a phantom
927         type because each "T enum" has values (and is isomorphic to
928         MLRep.Signed.int).
929       - There are generic access operations for enum objects (using
930         MLRep.Signed.int).
931       - ml-nlffigen will generate a structure E_foo for each "enum foo".
932         * The structure contains the definition of type "mlrep" (the ML-side
933         representation type of the enum).  Normally, mlrep is the same
934         as "MLRep.Signed.int", but if ml-nlffigen was invoked with "-ec",
935         then mlrep will be defined as a datatype -- thus facilitating
936         pattern matching on mlrep values.
937         ("-ec" will be suppressed if there are duplicate values in an
938          enumeration.)
939         * Constructors ("-ec") or values (no "-ec") e_xxx of type mlrep
940         will be generated for each C enum constant xxx.
941         * Conversion functions m2i and i2m convert between mlrep and
942         MLRep.Signed.int.  (Without "-ec", these functions are identities.)
943         * Coversion functions c and ml convert between mlrep and "tag enum".
944         * Access functions (get/set) fetch and store mlrep values.
945       - By default (unless ml-nlffigen was invoked with "-nocollect"), unnamed
946         enumerations are merged into one single enumeration represented by
947         structure E_'.
948    
949    ----------------------------------------------------------------------
950    Name: Allen Leung
951    Date: 2002/02/25 04:45:00 EST
952    Tag: leunga-20020225-cps-spill
953    
954    This is a new implementation of the CPS spill phase.
955    The new phase is in the new file compiler/CodeGen/cpscompile/spill-new.sml
956    In case of problems, replace it with the old file spill.sml
957    
958    The current compiler runs into some serious performance problems when
959    constructing a large record.  This can happen when we try to compile a
960    structure with many items.  Even a very simple structure like the following
961    makes the compiler slow down.
962    
963        structure Foo = struct
964           val x_1 = 0w1 : Word32.int
965           val x_2 = 0w2 : Word32.int
966           val x_3 = 0w3 : Word32.int
967           ...
968           val x_N = 0wN : Word32.int
969        end
970    
971    The following table shows the compile time, from N=1000 to N=4000,
972    with the old compiler:
973    
974    N
975    1000   CPS 100 spill                           0.04u  0.00s  0.00g
976           MLRISC ra                               0.06u  0.00s  0.05g
977              (spills = 0 reloads = 0)
978           TOTAL                                   0.63u  0.07s  0.21g
979    
980    1100   CPS 100 spill                           8.25u  0.32s  0.64g
981           MLRISC ra                               5.68u  0.59s  3.93g
982              (spills = 0 reloads = 0)
983           TOTAL                                   14.71u  0.99s  4.81g
984    
985    1500   CPS 100 spill                           58.55u  2.34s  1.74g
986           MLRISC ra                               5.54u  0.65s  3.91g
987              (spills = 543 reloads = 1082)
988           TOTAL                                   65.40u  3.13s  6.00g
989    
990    2000   CPS 100 spill                           126.69u  4.84s  3.08g
991           MLRISC ra                               0.80u  0.10s  0.55g
992              (spills = 42 reloads = 84)
993           TOTAL                                   129.42u  5.10s  4.13g
994    
995    3000   CPS 100 spill                           675.59u  19.03s  11.64g
996           MLRISC ra                               2.69u  0.27s  1.38g
997              (spills = 62 reloads = 124)
998           TOTAL                                   682.48u  19.61s  13.99g
999    
1000    4000   CPS 100 spill                           2362.82u  56.28s  43.60g
1001           MLRISC ra                               4.96u  0.27s  2.72g
1002              (spills = 85 reloads = 170)
1003           TOTAL                                   2375.26u  57.21s  48.00g
1004    
1005    As you can see the old cps spill module suffers from some serious
1006    performance problem.  But since I cannot decipher the old code fully,
1007    instead of patching the problems up, I'm reimplementing it
1008    with a different algorithm.  The new code is more modular,
1009    smaller when compiled, and substantially faster
1010    (O(n log n) time and O(n) space).  Timing of the new spill module:
1011    
1012    4000  CPS 100 spill                           0.02u  0.00s  0.00g
1013          MLRISC ra                               0.25u  0.02s  0.15g
1014             (spills=1 reloads=3)
1015          TOTAL                                   7.74u  0.34s  1.62g
1016    
1017    Implementation details:
1018    
1019    As far as I can tell, the purpose of the CPS spill module is to make sure the
1020    number of live variables at any program point (the bandwidth)
1021    does not exceed a certain limit, which is determined by the
1022    size of the spill area.
1023    
1024    When the bandwidth is too large, we decrease the register pressure by
1025    packing live variables into spill records.  How we achieve this is
1026    completely different than what we did in the old code.
1027    
1028    First, there is something about the MLRiscGen code generator
1029    that we should be aware of:
1030    
1031    o MLRiscGen performs code motion!
1032    
1033       In particular, it will move floating point computations and
1034       address computations involving only the heap pointer to
1035       their use sites (if there is only a single use).
1036       What this means is that if we have a CPS record construction
1037       statement
1038    
1039           RECORD(k,vl,w,e)
1040    
1041       we should never count the new record address w as live if w
1042       has only one use (which is often the case).
1043    
1044       We should do something similar to floating point, but the transformation
1045       there is much more complex, so I won't deal with that.
1046    
1047    Secondly, there are now two new cps primops at our disposal:
1048    
1049     1. rawrecord of record_kind option
1050        This pure operator allocates some uninitialized storage from the heap.
1051        There are two forms:
1052    
1053         rawrecord NONE [INT n]  allocates a tagless record of length n
1054         rawrecord (SOME rk) [INT n] allocates a tagged record of length n
1055                                     and initializes the tag.
1056    
1057     2. rawupdate of cty
1058          rawupdate cty (v,i,x)
1059          Assigns to x to the ith component of record v.
1060          The storelist is not updated.
1061    
1062    We use these new primops for both spilling and increment record construction.
1063    
1064     1. Spilling.
1065    
1066        This is implemented with a linear scan algorithm (but generalized
1067        to trees).  The algorithm will create a single spill record at the
1068        beginning of the cps function and use rawupdate to spill to it,
1069        and SELECT or SELp to reload from it.  So both spills and reloads
1070        are fine-grain operations.  In contrast, in the old algorithm
1071        "spills" have to be bundled together in records.
1072    
1073        Ideally, we should sink the spill record construction to where
1074        it is needed.  We can even split the spill record into multiple ones
1075        at the places where they are needed.  But CPS is not a good
1076        representation for global code motion, so I'll keep it simple and
1077        am not attempting this.
1078    
1079     2. Incremental record construction (aka record splitting).
1080    
1081        Long records with many component values which are simulatenously live
1082        (recall that single use record addresses are not considered to
1083         be live) are constructed with rawrecord and rawupdate.
1084        We allocate space on the heap with rawrecord first, then gradually
1085        fill it in with rawupdate.  This is the technique suggested to me
1086        by Matthias.
1087    
1088        Some restrictions on when this is applicable:
1089        1. It is not a VECTOR record.  The code generator currently does not handle
1090           this case. VECTOR record uses double indirection like arrays.
1091        2. All the record component values are defined in the same "basic block"
1092           as the record constructor.  This is to prevent speculative
1093           record construction.
1094    
1095    ----------------------------------------------------------------------
1096    Name: Allen Leung
1097    Date: 2002/02/22 01:02:00 EST
1098    Tag: leunga-20020222-mlrisc-tools
1099    
1100    Minor bug fixes in the parser and rewriter
1101    
1102    ----------------------------------------------------------------------
1103    Name: Allen Leung
1104    Date: 2002/02/21 20:20:00 EST
1105    Tag: leunga-20020221-peephole
1106    
1107    Regenerated the peephole files.  Some contained typos in the specification
1108    and some didn't compile because of pretty printing bugs in the old version
1109    of 'nowhere'.
1110    
1111    ----------------------------------------------------------------------
1112    Name: Allen Leung
1113    Date: 2002/02/19 20:20:00 EST
1114    Tag: leunga-20020219-mlrisc-tools
1115    Description:
1116    
1117       Minor bug fixes to the mlrisc-tools library:
1118    
1119       1.  Fixed up parsing colon suffixed keywords
1120       2.  Added the ability to shut the error messages up
1121       3.  Reimplemented the pretty printer and fixed up/improved
1122           the pretty printing of handle and -> types.
1123       4.  Fixed up generation of literal symbols in the nowhere tool.
1124       5.  Added some SML keywords to to sml.sty
1125    
1126    ----------------------------------------------------------------------
1127    Name: Matthias Blume
1128    Date: 2002/02/19 16:20:00 EST
1129    Tag: blume-20020219-cmffi
1130    Description:
1131    
1132    A wild mix of changes, some minor, some major:
1133    
1134    * All C FFI-related libraries are now anchored under $c:
1135        $/c.cm      --> $c/c.cm
1136        $/c-int.cm  --> $c/internals/c-int.cm
1137        $/memory.cm --> $c/memory/memory.cm
1138    
1139    * "make" tool (in CM) now treats its argument pathname slightly
1140      differently:
1141        1. If the native expansion is an absolute name, then before invoking
1142           the "make" command on it, CM will apply OS.Path.mkRelative
1143           (with relativeTo = OS.FileSys.getDir()) to it.
1144        2. The argument will be passed through to subsequent phases of CM
1145           processing without "going native".  In particular, if the argument
1146           was an anchored path, then "make" will not lose track of that anchor.
1147    
1148    * Compiler backends now "know" their respective C calling conventions
1149      instead of having to be told about it by ml-nlffigen.  This relieves
1150      ml-nlffigen from one of its burdens.
1151    
1152    * The X86Backend has been split into X86CCallBackend and X86StdCallBackend.
1153    
1154    * Export C_DEBUG and C_Debug from $c/c.cm.
1155    
1156    * C type encoding in ml-nlffi-lib has been improved to model the conceptual
1157      subtyping relationship between incomplete pointers and their complete
1158      counterparts.  For this, ('t, 'c) ptr has been changed to 'o ptr --
1159      with the convention of instantiating 'o with ('t, 'c) obj whenever
1160      the pointer target type is complete.  In the incomplete case, 'o
1161      will be instantiated with some "'c iobj" -- a type obtained by
1162      using one of the functors PointerToIncompleteType or PointerToCompleteType.
1163    
1164      Operations that work on both incomplete and complete pointer types are
1165      typed as taking an 'o ptr while operations that require the target to
1166      be known are typed as taking some ('t, 'c) obj ptr.
1167    
1168      voidptr is now a bit "more concrete", namely "type voidptr = void ptr'"
1169      where void is an eqtype without any values.  This makes it possible
1170      to work on voidptr values using functions meant to operate on light
1171      incomplete pointers.
1172    
1173    * As a result of the above, signature POINTER_TO_INCOMPLETE_TYPE has
1174      been vastly simplified.
1175    
1176    ----------------------------------------------------------------------
1177    Name: Matthias Blume
1178    Date: 2002/02/19 10:48:00 EST
1179    Tag: blume-20020219-pqfix
1180    Description:
1181    
1182    Applied Chris Okasaki's bug fix for priority queues.
1183    
1184    ----------------------------------------------------------------------
1185    Name: Matthias Blume
1186    Date: 2002/02/15 17:05:00
1187    Tag: Release_110_39
1188    Description:
1189    
1190    Last-minute retagging is becoming a tradition... :-(
1191    
1192    This is the working release 110.39.
1193    
1194    ----------------------------------------------------------------------
1195    Name: Matthias Blume
1196    Date: 2002/02/15 16:00:00 EST
1197    Tag: Release_110_39-orig
1198    Description:
1199    
1200    Working release 110.39.  New bootfiles.
1201    
1202    (Update: There was a small bug in the installer so it wouldn't work
1203    with all shells.  So I retagged. -Matthias)
1204    
1205    ----------------------------------------------------------------------
1206    Name: Matthias Blume
1207    Date: 2002/02/15 14:17:00 EST
1208    Tag: blume-20020215-showbindings
1209    Description:
1210    
1211    Added EnvRef.listBoundSymbols and CM.State.showBindings.  Especially
1212    the latter can be useful for exploring what bindings are available at
1213    the interactive prompt.  (The first function returns only the list
1214    of symbols that are really bound, the second prints those but also the
1215    ones that CM's autoloading mechanism knows about.)
1216    
1217    ----------------------------------------------------------------------
1218    Name: Matthias Blume
1219    Date: 2002/02/15 12:08:00 EST
1220    Tag: blume-20020215-iptrs
1221    Description:
1222    
1223    Two improvements to ml-nlffigen:
1224    
1225      1. Write files only if they do not exist or if their current contents
1226         do not coincide with what's being written.  (That is, avoid messing
1227         with the time stamps unless absolutely necessary.)
1228    
1229      2. Implement a "repository" mechanism for generated files related
1230         to "incomplete pointer types".   See the README file for details.
1231    
1232    ----------------------------------------------------------------------
1233    Name: Matthias Blume
1234    Date: 2002/02/14 11:50:00 EST
1235    Tag: blume-20020214-quote
1236    Description:
1237    
1238    Added a type 't t_' to tag.sml (in ml-nlffi-lib.cm).  This is required
1239    because of the new and improved tag generation scheme.  (Thanks to Allen
1240    Leung for pointing it out.)
1241    
1242    ----------------------------------------------------------------------
1243    Name: Lal George
1244    Date: 2002/02/14 09:55:27 EST 2002
1245    Tag: george-20020214-isabelle-bug
1246    Description:
1247    
1248    Fixed the MLRISC bug sent by Markus Wenzel regarding the compilation
1249    of Isabelle on the x86.
1250    
1251    From Allen:
1252    -----------
1253     I've found the problem:
1254    
1255         in ra-core.sml, I use the counter "blocked" to keep track of the
1256         true number of elements in the freeze queue.  When the counter goes
1257         to zero, I skip examining the queue.  But I've messed up the
1258         bookkeeping in combine():
1259    
1260             else ();
1261             case !ucol of
1262               PSEUDO => (if !cntv > 0 then
1263                     (if !cntu > 0 then blocked := !blocked - 1 else ();
1264                                        ^^^^^^^^^^^^^^^^^^^^^^^
1265                      moveu := mergeMoveList(!movev, !moveu)
1266                     )
1267                  else ();
1268    
1269         combine() is called to coalesce two nodes u and v.
1270         I think I was thinking that if the move counts of u and v are both
1271         greater than zero then after they are coalesced then one node is
1272         removed from the freeze queue.  Apparently I was thinking that
1273         both u and v are of low degree, but that's clearly not necessarily true.
1274    
1275    
1276    02/12/2002:
1277        Here's the patch.  HOL now compiles.
1278    
1279        I don't know how this impact on performance (compile
1280        time or runtime).  This bug caused the RA (especially on the x86)
1281        to go thru the potential spill phase when there are still nodes on the
1282        freeze queue.
1283    
1284    
1285    
1286    
1287    ----------------------------------------------------------------------
1288    Name: Matthias Blume
1289    Date: 2002/02/13 22:40:00 EST
1290    Tag: blume-20020213-fptr-rtti
1291    Description:
1292    
1293    Fixed a bug in ml-nlffigen that was introduced with one of the previous
1294    updates.
1295    
1296    ----------------------------------------------------------------------
1297    Name: Matthias Blume
1298    Date: 2002/02/13 16:41:00 EST
1299    Tag: blume-20020213-cmlpq
1300    Description:
1301    
1302    Added new priority queue export symbols (which have just been added to
1303    smlnj-lib.cm) to CML's version of smlnj-lib.cm.  (Otherwise CML would
1304    not compile and the installer would choke.)
1305    
1306    ----------------------------------------------------------------------
1307    Name: Matthias Blume
1308    Date: 2002/02/13 16:15:00 EST
1309    Tag: blume-20020213-various
1310    Description:
1311    
1312    1. More tweaks to ml-nlffigen:
1313    
1314       - better internal datastructures (resulting in slight speedup)
1315       - "-match" option requires exact match
1316       - "localized" gensym counters (untagged structs/unions nested within
1317         other structs/unions or within typedefs get a fresh counter; their
1318         tag will be prefixed by a concatenation of their parents' tags)
1319       - bug fixes (related to calculation of transitive closure of types
1320         to be included in the output)
1321    
1322    2. Minor Basis updates:
1323    
1324       - added implementations for List.collate and Option.app
1325    
1326    ----------------------------------------------------------------------
1327    Name: Matthias Blume
1328    Date: 2002/02/11 15:55:00 EST
1329    Tag: blume-20020211-gensym
1330    Description:
1331    
1332    Added a "-gensym" option to command line of ml-nlffigen.  This can be
1333    used to specify a "stem" -- a string that is inserted in all "gensym'd"
1334    names (ML structure names that correspond to unnamed C structs, unions,
1335    and enums), so that separate runs of ml-nlffigen do not clash.
1336    
1337    ----------------------------------------------------------------------
1338    Name: Matthias Blume
1339    Date: 2002/02/11 12:05:00 EST
1340    Tag: blume-20020211-gensml
1341    Description:
1342    
1343    A quick fix for a problem with GenSML (in the pgraph-util library):
1344    Make generation of toplevel "local" optional.  (Strictly speaking,
1345    signature definitions within "local" are not legal SML.)
1346    
1347    Other than that: updates to INSTALL and cm/TODO.
1348    
1349    ----------------------------------------------------------------------
1350    Name: Matthias Blume
1351    Date: 2002/02/08 15:00:00 EST
1352    Tag: blume-20020208-uniquepid
1353    Description:
1354    
1355    0. Version number has been bumped to 110.38.1.  NEW BOOTFILES!!!
1356    
1357    1. The installer (config/install.sh) has gotten smarter:
1358    
1359         - Configuration options are a bit easier to specify now
1360           (in config/targets).
1361         - Bug in recognizing .tar.bz2 files fixed.
1362         - Installer automatically resolves dependencies between
1363           configuration options (e.g., if you ask for eXene, you will
1364           also get cml -- regardless whether you asked for it or not).
1365         - Installer can run in "quieter mode" by setting the environment
1366           variable INSTALL_QUIETLY to "true".  "Quieter" does not mean
1367           "completely silent", though.
1368         - Build HashCons library as part of smlnj-lib.
1369    
1370    2. A new scheme for assigning persistent identifiers to compilation
1371       units (and, by extension, to types etc.) has been put into place.
1372       This fixes a long-standing bug where types and even dynamic values
1373       can get internally confused, thereby compromising type safety
1374       (abstraction) and dynamic correctness.  See
1375    
1376         http://cm.bell-labs.com/cm/cs/who/blume/pid-confusion.tgz
1377    
1378       for an example of how things could go wrong until now.
1379    
1380       The downside of the new scheme is that pids are not quite as
1381       persistent as they used to be: CM will generate a fresh pid
1382       for every compilation unit that it thinks it sees for the first
1383       time.  That means that if you compile starting from a clean, fresh
1384       source tree at two different times, you end up with different
1385       binaries.
1386    
1387       Cutoff recompilation, however, has not been compromised because
1388       CM keeps pid information in special caches between runs.
1389    
1390    ----------------------------------------------------------------------
1391    Name: Lal George
1392    Date: 2002/02/07 15:34:13 EST 2002
1393    Tag: <none>
1394    Description:
1395    
1396    Compilers that generate assembly code may produce  global labels
1397    whose value is resolved at link time. The various peephole optimization
1398    modules did not take this in account.
1399    
1400    TODO. The Labels.addrOf function should really return an option
1401    type so that clients are forced to deal with this issue, rather
1402    than an exception being raised.
1403    
1404    ----------------------------------------------------------------------
1405    Name: Lal George
1406    Date: 2002/02/06 13:55:02 EST
1407    Tag: george-20020206-ra-breakup
1408    Description:
1409    
1410    1. A bug fix from Allen.
1411    
1412        A typo causes extra fstp %st(0)'s to be generated at compensation
1413        edges, which might cause stack underflow traps at runtime.  This
1414        occurs in fft where there are extraneous fstps right before the 'into'
1415        trap instruction (in this case they are harmless since none of the
1416        integers overflow.)
1417    
1418    2. Pulled out various utility modules that were embedded in the modules
1419       of the register allocator. I need these modules for other purposes, but
1420       they are not complete enough to put into a library (just yet).
1421    ----------------------------------------------------------------------
1422    Name: Matthias Blume
1423    Date: 2002/01/31 16:05:00 EST
1424    Tag: blume-20020131-sparc-ccalls
1425    Description:
1426    
1427    1. C-calls on Sparc needlessly allocated a huge chunk (96 bytes)
1428       of extra stack space by mistake.  Fixed.
1429    
1430    2. Bug in logic of handling of command-line options in ml-nlffigen fixed.
1431    
1432    ----------------------------------------------------------------------
1433    Name: Allen Leung
1434    Date: 2002/01/30
1435    Tag: leunga-20020130-nowhere-bug-fix
1436    Description:
1437    
1438       MLRISC bug fixes:
1439       1. Fixed a bindings computation bug in the 'nowhere' program generator tool.
1440       2. MachineInt.fromString was negating its value.
1441    
1442    ----------------------------------------------------------------------
1443    Name: Matthias Blume
1444    Date: 2002/01/29
1445    Tag: blume-20020129-INSTALL
1446    Description:
1447    
1448    - Added somewhat detailed installation instructions (file INSTALL).
1449    - Fixed curl-detection bug in config/install.sh.
1450    - It is now possible to select the URL getter using the URLGETTER
1451      environment variable:
1452    
1453          not set / "unknown"      --> automatic detection (script tries wget,
1454                                       curl, and lynx)
1455          "wget" / "curl" / "lynx" --> use the specified program (script "knows"
1456                                       how to properly invoke them)
1457          other                    --> use $URLGETTER directly, it must take
1458                                       precisely two command-line arguments
1459                                       (source URL and destination file name)
1460    
1461    ----------------------------------------------------------------------
1462    Name: Matthias Blume
1463    Date: 2002/01/28
1464    Tag: blume-20020128-sparc-ccalls
1465    Description:
1466    
1467    - Fixed problem with calculation of "used" registers in sparc-c-calls.
1468    - Make use of the allocParam argument in sparc-c-calls.
1469    
1470    ----------------------------------------------------------------------
1471    Name: Matthias Blume
1472    Date: 2002/01/28
1473    Tag: blume-20020128-allocParam
1474    Description:
1475    
1476    John Reppy:  Changes c-calls API to accept client-callback for
1477    allocating extra stack space.
1478    me: Corresponding changes to mlriscGen (using a dummy argument that
1479        does not change the current behavior).
1480    
1481    ----------------------------------------------------------------------
1482    Name: Matthias Blume
1483    Date: 2002/01/28 12:00:00
1484    Tag: Release_110_38
1485    Description:
1486    
1487    This time for real!!!
1488    
1489    ----------------------------------------------------------------------
1490    Name: Matthias Blume
1491    Date: 2002/01/28 10:56:00 EST
1492    Tag: blume-20020128-retraction
1493    Description:
1494    
1495    0. Retracted earlier 110.38.  (The Release_110_38 tag has been replaced
1496       with blume-Release_110_38-retracted.)
1497    
1498    1. Fixed a problem with incorrect rounding modes in real64.sml.
1499       (Thanks to Andrew Mccreight <andrew.mccreight@yale.edu>.)
1500    
1501    2. A bug in ml-nlffigen related to the handling of unnamed structs, unions,
1502       and enums fixed.  The naming of corresponding ML identifiers should
1503       now be consistent again.
1504    
1505    ----------------------------------------------------------------------
1506    Name: Allen Leung
1507    Date: 2002/01/27
1508    Tag: leunga-20020127-nowhere
1509    Description:
1510    
1511       Added a target called nowhere in the configuration scripts.
1512       Enabling this will build the MLRISC 'nowhere' tool (for translating
1513       programs with where-clauses into legal SML code) during installation.
1514    
1515    ----------------------------------------------------------------------
1516    Name: Matthias Blume
1517    Date: 2002/01/25 21:27:00 EST
1518    Tag: blume-Release_110_38-retracted
1519    Description:
1520    
1521    Call it a (working) release!  Version is 110.38. Bootfiles are ready.
1522    
1523    README will be added later.
1524    
1525    !!! NOTE:  Re-tagged as blume-Release_110_38-retracted. Original tag
1526    (Release_110_38) removed.  Reason: Last-minute bug fixes.
1527    
1528    ----------------------------------------------------------------------
1529    Name: Matthias Blume
1530    Date: 2002/01/25
1531    Tag: blume-20020125-ffi
1532    Description:
1533    
1534    A large number of tweaks and improvements to ml-nlffi-lib and
1535    ml-nlffigen:
1536    
1537       - ML represenation types have been streamlined
1538       - getter and setter functions work with concrete values, not abstract
1539         ones where possible
1540       - ml-nlffigen command line more flexible (see README file there)
1541       - some bugs have been fixed (hopefully)
1542    
1543    ----------------------------------------------------------------------
1544    Name: Lal George
1545    Date: 2002/01/24
1546    Tag: george-20020124-risc-ra-interface
1547    Description:
1548    
1549       There is a dramatic simplification in the interface to the
1550       register allocator for RISC architectures as a result of making
1551       parallel copy instructions explicit.
1552    
1553    ----------------------------------------------------------------------
1554    Name: Matthias Blume
1555    Date: 2002/01/22
1556    Tag: blume-20020122-x86-ccalls
1557    Description:
1558    
1559    Bug fix for c-calls on x86 (having to do with how char- and
1560    short-arguments are being handled).
1561    
1562    ----------------------------------------------------------------------
1563    Name: Matthias Blume
1564    Date: 2002/01/21
1565    Tag: blume-20020121-ff
1566    Description:
1567    
1568    Another day of fiddling with the FFI...
1569    
1570    1. Bug fix/workaround:  CKIT does not complain about negative array
1571       dimensions, so ml-nlffigen has to guard itself against this possibility.
1572       (Otherwise a negative dimension would send it into an infinite loop.)
1573    
1574    2. Some of the abstract types (light objects, light pointers, most "base"
1575       types) in structure C are now eqtypes.
1576    
1577    3. Added constructors and test functions for NULL function pointers.
1578    
1579    ----------------------------------------------------------------------
1580    Name: Matthias Blume
1581    Date: 2002/01/18
1582    Tag: blume-20020118-ready-for-new-release
1583    Description:
1584    
1585    Made config/srcarchiveurl point to a new place.  (Will provide boot
1586    files shortly.)
1587    
1588    Maybe we christen this to be 110.38?
1589    
1590    ----------------------------------------------------------------------
1591    Name: Matthias Blume
1592    Date: 2002/01/18
1593    Tag: blume-20020118-more-ffifiddle
1594    Description:
1595    
1596    Today's FFI fiddling:
1597    
1598      - Provided a structure CGetSet with "convenient" versions of C.Get.* and
1599        C.Set.* that use concrete (MLRep.*) arguments and results instead
1600        of abstract ones.
1601    
1602      - Provided word-style bit operations etc. for "int" representation
1603        types in MLRep.S<Foo>Bitops where <Foo> ranges over Char, Int, Short,
1604        and Long.
1605    
1606    ----------------------------------------------------------------------
1607    Name: Matthias Blume
1608    Date: 2002/01/18
1609    Tag: blume-20020118-use-x86-fp
1610    Description:
1611    
1612    Now that x86-fast-fp seems to be working, I turned it back on again
1613    by default.  (Seems to work fine now, even with the FFI.)
1614    
1615    Other than that, I added some documentation about the FFI to
1616    src/ml-nlffigen/README and updated the FFI test examples in
1617    src/ml-nlffi-lib/Tests/*.
1618    
1619    ----------------------------------------------------------------------
1620    Name: Allen Leung
1621    Date: 2002/01/17
1622    Tag: leunga-20020117-x86-fast-fp-call
1623    Description:
1624    
1625       1. Fixed a problem with handling return fp values when x86's fast fp
1626          mode is turned on.
1627    
1628       2. Minor pretty printing fix for cellset.  Print %st(0) as %st(0) instead
1629          of %f32.
1630    
1631       3. Added a constructor INT32lit to the ast of MLRISC tools.
1632    
1633    ----------------------------------------------------------------------
1634    Name: Matthias Blume
1635    Date: 2002/01/16
1636    Tag: blume-20020116-ffifiddle
1637    Description:
1638    
1639    More fiddling with the FFI interface:
1640    
1641     - Make constness 'c instead of rw wherever possible.  This eliminates
1642       the need for certain explicit coercions.  (However, due to ML's
1643       value polymorphism, there will still be many cases where explicit
1644       coercions are necessary.  Phantom types are not the whole answer
1645       to modeling a subtyping relationship in ML.)
1646    
1647     - ro/rw coersions for pointers added.  (Avoids the detour through */&.)
1648    
1649     - "printf" test example added to src/ml-nlffi-lib/Tests.  (Demonstrates
1650       clumsy workaround for varargs problem.)
1651    
1652    ----------------------------------------------------------------------
1653    Name: Lal George
1654    Date: 2002/01/15
1655    Tag: <none>
1656    Description:
1657    
1658    1. Since COPY instructions are no longer native to the architecture,
1659       a generic functor can be used to implement the expandCopies function.
1660    
1661    2. Allowed EXPORT and IMPORT pseudo-op declarations to appear inside a
1662       TEXT segment.
1663    
1664    ----------------------------------------------------------------------
1665    Name: Matthias Blume
1666    Date: 2002/01/15
1667    Tag: blume-20020115-ffiupdates
1668    Description:
1669    
1670    1. Fix for bug resulting in single-precision float values being returned
1671       incorrectly from FFI calls.
1672    
1673    2. Small modifications to C FFI API:
1674    
1675        - memory-allocation routines return straight objects (no options)
1676          and raise an exception in out-of-memory situations
1677        - unsafe extensions to cast between function pointers and pointers
1678          from/to ints
1679        - added structure C_Debug as an alternative to structure C where
1680          pointer-dereferencing (|*| and |*!) always check for null-pointers
1681        - added open_lib' to DynLinkage;  open_lib' works like open_lib
1682          but also takes a (possibly empty) list of existing library handles
1683          that the current library depends on
1684    
1685    ----------------------------------------------------------------------
1686    Name: Matthias Blume
1687    Date: 2002/01/10
1688    Tag: blume-20020110-newffigen
1689    Description:
1690    
1691    1. Updates to portable graph code.
1692    
1693    2. Major update to ml-nlffigen and ml-nlffi-lib.  Things are much
1694       more scalable now so that even huge interfaces such as the one
1695       for GTK compile in finite time and space. :-)
1696       See src/ml-nlffigen/README for details on what's new.
1697    
1698    ----------------------------------------------------------------------
1699    Name: Lal George
1700    Date: 2001/01/09 14:31:35 EST 2002
1701    Tag: george-20011206-rm-native-copy
1702    Description:
1703    
1704            Removed the native COPY and FCOPY instructions
1705            from all the architectures and replaced it with the
1706            explicit COPY instruction from the previous commit.
1707    
1708            It is now possible to simplify many of the optimizations
1709            modules that manipulate copies. This has not been
1710            done in this change.
1711    
1712    ----------------------------------------------------------------------
1713    Name: Lal George
1714    Date: 2001/12/06 16:50:13 EST 2001
1715    Tag: george-20011206-mlrisc-instruction
1716    Description:
1717    
1718    Changed the representation of instructions from being fully abstract
1719    to being partially concrete. That is to say:
1720    
1721      from
1722            type instruction
1723    
1724      to
1725            type instr                              (* machine instruction *)
1726    
1727            datatype instruction =
1728                LIVE of {regs: C.cellset, spilled: C.cellset}
1729              | KILL of {regs: C.cellset, spilled: C.cellset}
1730              | COPYXXX of {k: CB.cellkind, dst: CB.cell list, src: CB.cell list}
1731              | ANNOTATION of {i: instruction, a: Annotations.annotation}
1732              | INSTR of instr
1733    
1734    This makes the handling of certain special instructions that appear on
1735    all architectures easier and uniform.
1736    
1737    LIVE and KILL say that a list of registers are live or killed at the
1738    program point where they appear. No spill code is generated when an
1739    element of the 'regs' field is spilled, but the register is moved to
1740    the 'spilled' (which is present, more for debugging than anything else).
1741    
1742    LIVE replaces the (now deprecated) DEFFREG instruction on the alpha.
1743    We used to generate:
1744    
1745            DEFFREG f1
1746            f1 := f2 + f3
1747            trapb
1748    
1749    but now generate:
1750    
1751            f1 := f2 + f3
1752            trapb
1753            LIVE {regs=[f1,f2,f3], spilled=[]}
1754    
1755    Furthermore, the DEFFREG (hack) required that all floating point instruction
1756    use all registers mentioned in the instruction. Therefore f1 := f2 + f3,
1757    defines f1 and uses [f1,f2,f3]! This hack is no longer required resulting
1758    in a cleaner alpha implementation. (Hopefully, intel will not get rid of
1759    this architecture).
1760    
1761    COPYXXX is intended to replace the parallel COPY and FCOPY  available on
1762    all the architectures. This will result in further simplification of the
1763    register allocator that must be aware of them for coalescing purposes, and
1764    will also simplify certain aspects of the machine description that provides
1765    callbacks related to parallel copies.
1766    
1767    ANNOTATION should be obvious, and now INSTR represents the honest to God
1768    machine instruction set!
1769    
1770    The <arch>/instructions/<arch>Instr.sml files define certain utility
1771    functions for making porting easier -- essentially converting upper case
1772    to lower case. All machine instructions (of type instr) are in upper case,
1773    and the lower case form generates an MLRISC instruction. For example on
1774    the alpha we have:
1775    
1776      datatype instr =
1777         LDA of {r:cell, b:cell, d:operand}
1778       | ...
1779    
1780      val lda : {r:cell, b:cell, d:operand} -> instruction
1781        ...
1782    
1783    where lda is just (INSTR o LDA), etc.
1784    
1785    ----------------------------------------------------------------------
1786    Name: Matthias Blume
1787    Date: 2001/11/22 21:40:00 EST
1788    Tag: Release_110_37
1789    Description:
1790    
1791    Release 110.37.  This time for real.
1792    
1793    ----------------------------------------------------------------------
1794    Name: Matthias Blume
1795    Date: 2001/11/21 16:35:00 EST
1796    Tag: blume-20011121-foot-in-mouth
1797    Description:
1798    
1799    Removed the "Release_110_37" tag because of a serious bug.
1800    This will be re-tagged once the bug is fixed.
1801    
1802    ----------------------------------------------------------------------
1803    Name: Matthias Blume
1804    Date: 2001/11/21 16:14:00 EST
1805    Tag: blume-20011121-forgottenfile
1806    Description:
1807    
1808    Forgot to add a file.  (Just a .tex-file -- part of
1809    the CM manual source.)
1810    
1811    ----------------------------------------------------------------------
1812    Name: Matthias Blume
1813    Date: 2001/11/21 16:10:00 EST
1814    Tag: blume-20011121-invalid_110_37
1815    Description:
1816    
1817    Note: I removed the original tag "Release_110_37" from this commit
1818          because we found a serious bug in all non-x86 backends.
1819          - Matthias
1820    
1821    1. Modifications to the SML/NJ code generator and to the runtime system
1822       so that code object name strings are directly inserted into code
1823       objects at code generation time.  The only business the runtime system
1824       has with this is now to read the name strings on occasions.
1825       (The encoding of the name string has also changed somewhat.)
1826    
1827    2. CM now implements a simple "set calculus" for specifying export lists.
1828       In particular, it is now possible to refer to the export lists of
1829       other libraries/groups/sources and form unions as well as differences.
1830       See the latest CM manual for details.
1831    
1832    3. An separate notion of "proxy" libraries has again be eliminated from
1833       CM's model.  (Proxy libraries are now simply a special case of using
1834       the export list calculus.)
1835    
1836    4. Some of the existing libraries now take advantage of the new set
1837       calculus.
1838       (Notice that not all libraries have been converted because some
1839       of the existing .cm-files are supposed to be backward compatible
1840       with 110.0.x.)
1841    
1842    5. Some cleanup in stand-alone programs.  (Don't use "exnMessage" -- use
1843       "General.exnMessage"!  The former relies on a certain hook to be
1844       initialized, and that often does not happen in the stand-alone case.)
1845    
1846    ----------------------------------------------------------------------
1847    Name: Lal George
1848    Date: 2001/11/21  13:56:18 EST
1849    Tag: george-2001121-pseudo-ops
1850    Description:
1851    
1852      Implemented a complete redesign of MLRISC pseudo-ops. Now there
1853      ought to never be any question of incompatabilities with
1854      pseudo-op syntax expected by host assemblers.
1855    
1856      For now, only modules supporting GAS syntax are implemented
1857      but more should follow, such as MASM, and vendor assembler
1858      syntax, e.g. IBM as, Sun as, etc.
1859    
1860    ----------------------------------------------------------------------
1861    Name: Matthias Blume
1862    Date: 2001/11/14 11:52:00 EST
1863    Tag: blume-20011114-srcname
1864    Description:
1865    
1866    1. Routed the name of the current source file to mlriscgen where it
1867       should be directly emitted into the code object.  (This last part
1868       is yet to be done.)
1869    
1870    2. Some cleanup of the pgraph code to make it match the proposal that
1871       I put out the other day.  (The proposal notwithstanding, things are
1872       still in flux here.)
1873    
1874    ----------------------------------------------------------------------
1875    Name: Lal George
1876    Date: 2001/11/14 09:44:04 EST
1877    Tag:
1878    Description:
1879    
1880      Fix for a backpatching bug reported by Allen.
1881    
1882      Because the boundary between short and long span-dependent
1883      instructions is +/- 128, there are an astounding number of
1884      span-dependent instructions whose size is over estimated.
1885    
1886      Allen came up with the idea of letting the size of span
1887      dependent instructions be non-monotonic, for a maxIter
1888      number of times, after which the size must be monotonically
1889      increasing.
1890    
1891      This table shows the number of span-dependent instructions
1892      whose size was over-estimated as a function of maxIter, for the
1893      file Parse/parse/ml.grm.sml:
1894    
1895         maxIter            # of instructions:
1896            10                      687
1897            20                      438
1898            30                      198
1899            40                        0
1900    
1901      In compiling the compiler, there is no significant difference in
1902      compilation speed between maxIter=10 and maxIter=40. Actually,
1903      my measurements showed that maxIter=40 was a tad faster than
1904      maxIter=10! Also 96% of the  files in the compiler reach a fix
1905      point within 13 iterations, so fixing maxIter at 40, while high,
1906      is okay.
1907    
1908    ----------------------------------------------------------------------
1909    Name: Matthias Blume
1910    Date: 2001/10/31 15:25:00 EST
1911    Tag: blume-20011031-pgraph
1912    Description:
1913    
1914    CKIT:
1915    * Changed the "Function" constructor of type Ast.ctype to carry optional
1916      argument identifiers.
1917    * Changed the return type of TypeUtil.getFunction accordingly.
1918    * Type equality ignores the argument names.
1919    * TypeUtil.composite tries to preserve argument names but gives up quickly
1920      if there is a mismatch.
1921    
1922    installation script:
1923    * attempts to use "curl" if available (unless "wget" is available as well)
1924    
1925    CM:
1926    * has an experimental implementation of "portable graphs" which I will
1927      soon propose as an implementation-independent library format
1928    * there are also new libraries $/pgraph.cm and $/pgraph-util.cm
1929    
1930    NLFFI-LIB:
1931    * some cleanup (all cosmetic)
1932    
1933    NLFFIGEN:
1934    * temporarily disabled the mechanism that suppresses ML output for
1935      C definitions whose identifiers start with an underscore character
1936    * generate val bindings for enum constants
1937    * user can request that only one style (light or heavy) is being used;
1938      default is to use both (command-line arguments: -heavy and -light)
1939    * fixed bug in handling of function types involving incomplete pointers
1940    * generate ML entry points that take record arguments (i.e., using
1941      named arguments) for C functions that have a prototype with named
1942      arguments
1943      (see changes to CKIT)
1944    
1945    ----------------------------------------------------------------------
1946    Name: Allen Leung
1947    Date: 2001/10/27 20:34:00 EDT
1948    Tag: leunga-20011027-x86-fast-fp-call
1949    Description:
1950    
1951       Fixed the bug described in blume-20010920-slowfp.
1952    
1953       The fix involves
1954          1. generating FCOPYs in FSTP in ia32-svid
1955          2. marking a CALL with the appropriate annotation
1956    
1957    ----------------------------------------------------------------------
1958    Name: Matthias Blume
1959    Date: 2001/10/16 11:32:00 EDT
1960    Tag: blume-20011016-netbsd
1961    Description:
1962    
1963    Underscore patch from Chris Richards (fixing problem with compiling
1964    runtime system under recent NetBSD).
1965    
1966    ----------------------------------------------------------------------
1967    Name: Allen Leung
1968    Date: 2001/10/12 17:18:32 EDT 2001
1969    Tag: leung-20011012-x86-printflowgraph
1970    Description:
1971    
1972    X86RA now uses a valid (instead of dummy) PrintFlowgraph module.
1973    
1974    ----------------------------------------------------------------------
1975    Name: Lal George
1976    Date: 2001/10/11 23:51:34 EDT
1977    Tag: george-20011011-too-many-instrs
1978    Description:
1979    
1980    The representation of a program point never expected to see more
1981    than 65536 instructions in a basic block!
1982    
1983    ----------------------------------------------------------------------
1984    Name: Lal George
1985    Date: 2001/10/09 09:41:37 EDT
1986    Tag: george-20011008-mlrisc-labels
1987    Description:
1988    
1989    Changed the machine description files to support printing of
1990    local and global labels in assembly code, based on host assembler
1991    conventions.
1992    
1993    ----------------------------------------------------------------------
1994    Name: Matthias Blume
1995    Date: 2001/09/25 15:25:00 EDT
1996    Tag: blume-20010925-exninfo
1997    Description:
1998    
1999    I provided a non-hook implementation of exnName (at the toplevel) and
2000    made the "dummy" implementation of exnMessage (at the toplevel) more
2001    useful: if nothing gets "hooked in", then at least you are going to
2002    see the exception name and a message indicating why you don't see more.
2003    
2004    [For the time being, programs that need exnMessage and want to use
2005    ml-build should either use General.exnMessage (strongly recommended) or
2006    refer to structure General at some other point so that CM sees a
2007    static dependency.]
2008    
2009    [Similar remarks go for "print" and "use":  If you want to use their
2010    functionality in stand-alone programs generated by ml-build, then use
2011    TextIO.output and Backend.Interact.useFile (from $smlnj/compiler.cm).]
2012    
2013    ----------------------------------------------------------------------
2014    Name: Matthias Blume
2015    Date: 2001/09/20 17:28:00 EDT
2016    Tag: blume-20010920-slowfp
2017    Description:
2018    
2019    Allen says that x86-fast-fp is not safe yet, so I turned it off again...
2020    
2021    ----------------------------------------------------------------------
2022    Name: Matthias Blume
2023    Date: 2001/09/20 17:20:00 EDT
2024    Tag: blume-20010920-canonicalpaths
2025    Description:
2026    
2027    0. Updated the BOOT file (something that I forgot to do earlier).
2028    
2029    1. Small internal change to CM so that it avoids "/../" in filenames
2030       as much as possible (but only where it is safe).
2031    
2032    2. Changed config/_run-sml (resulting in a changed bin/.run-sml) so
2033       that arguments that contain delimiters are passed through correctly.
2034       This change also means that all "special" arguments of the form
2035       @SMLxxx... must come first.
2036    
2037    3. Changed install script to put relative anchor names for tool commands
2038       into pathconfig.
2039    
2040    ----------------------------------------------------------------------
2041    Name: Matthias Blume
2042    Date: 2001/09/18 15:35:00 EDT
2043    Tag: blume-20010918-readme11036
2044    Description:
2045    
2046    Added README files.
2047    
2048    ----------------------------------------------------------------------
2049    Name: Matthias Blume
2050    Date: 2001/09/18 11:45:00 EDT
2051    Tag: Release_110_36 (retag)
2052    Description:
2053    
2054    Fixed mistake in config/preloads. Retagged as 110.36.
2055    
2056    ----------------------------------------------------------------------
2057    Name: Matthias Blume
2058    Date: 2001/09/18 09:40:00 EDT
2059    Tag: Release_110_36_orig (tag changed)
2060    Description:
2061    
2062    New version (110.36).  New bootfiles.
2063    
2064    ----------------------------------------------------------------------
2065    Name: Matthias Blume
2066    Date: 2001/09/14 16:15:00 EDT
2067    Tag: blume-20010914-x86fastfp
2068    Description:
2069    
2070    John committed some changes that Allen made, in particular a (hopefully)
2071    correctly working version of the x86-fp module.
2072    
2073    I changed the default setting of the Control.MLRISC.getFlag "x86-fast-fp"
2074    flag to "true".  Everything seems to compile to a fixpoint ok, and
2075    "mandelbrot" speeds up by about 15%.
2076    
2077    ----------------------------------------------------------------------
2078    Name: Matthias Blume
2079    Date: 2001/09/13 11:20:00 EDT
2080    Tag: blume-20010913-minimal
2081    Description:
2082    
2083    1. Stefan Monnier's patch to fix a miscompilation problem that
2084       was brought to light by John Reppy's work on Moby.
2085    
2086    2. Implemented a minimal "structure Compiler" that contains just
2087       "version" and "architecture".  The minimal version will be
2088       available when the full version is not.  This is for backward-
2089       compatibility with code that wants to test Compiler.version.
2090    
2091    ----------------------------------------------------------------------
2092    Name: Matthias Blume
2093    Date: 2001/08/28 14:03:00 EDT
2094    Tag: blume-20010828-ml-lex
2095    Description:
2096    
2097    Fix for bug 1581, received from Neophytos Michael.
2098    
2099    ----------------------------------------------------------------------
2100    Name: Matthias Blume
2101    Date: 2001/08/27 11:20:00 EDT
2102    Tag: blume-20010827-readme11035
2103    Description:
2104    
2105    Fleshed out the README file for 110.35.
2106    
2107    ----------------------------------------------------------------------
2108    Name: Matthias Blume
2109    Date: 2001/08/24 17:10:00 EDT
2110    Tag: Release_110_35
2111    Description:
2112    
2113    New version number (110.35).  New bootfiles.
2114    
2115    ----------------------------------------------------------------------
2116    Name: Lal George
2117    Date: 2001/08/24 13:47:18 EDT 2001
2118    Tag: george-20010824-MLRISC-graphs
2119    Description:
2120    
2121     removed clusters from MLRISC completely and replaced with graphs.
2122    
2123    ----------------------------------------------------------------------
2124    Name: Matthias Blume
2125    Date: 2001/08/23 17:50:00 EDT
2126    Tag: blume-20010823-toplevel
2127    Description:
2128    
2129    - some reorganization of the code that implements various kinds of
2130      environments in the compiler (static, dynamic, symbolic, combined)
2131    - re-implemented the EnvRef module so that evalStream works properly
2132      (if the stream contains references to "use", "CM.make", etc.)
2133    - cleaned up evalloop.sml and interact.sml (but they need more cleaning)
2134    
2135    ----------------------------------------------------------------------
2136    Name: Matthias Blume
2137    Date: 2001/08/20 15:50 EDT
2138    Tag: blume20010820-slipup
2139    Description:
2140    
2141    I forgot to commit a few files.  Here they are...
2142    
2143    ----------------------------------------------------------------------
2144    Name: Matthias Blume
2145    Date: 2001/08/20 15:35:00 EDT
2146    Tag: blume-20010820-debugprof
2147    Description:
2148    
2149    !!!! NEW BOOTFILES !!!!
2150    
2151    This is another round of reorganizing the compiler sources.  This
2152    time the main goal was to factor out all the "instrumentation"
2153    passes (for profiling and backtracing) into their own library.
2154    The difficulty was to do it in such a way that it does not depend
2155    on elaborate.cm but only on elabdata.cm.
2156    
2157    Therefore there have been further changes to both elaborate.cm and
2158    elabdata.cm -- more "generic" things have been moved from the former
2159    to the latter.  As a result, I was forced to split the assignment
2160    of numbers indicating "primtyc"s into two portions: SML-generic and
2161    SML/NJ-specific.  Since it would have been awkward to maintain,
2162    I bit the bullet and actually _changed_ the mapping between these
2163    numbers and primtycs.  The bottom line of this is that you need
2164    a new set of bin- and bootfiles.
2165    
2166    I have built new bootfiles for all architectures, so doing a fresh
2167    checkout and config/install.sh should be all you need.
2168    
2169    The newly created library's name is
2170    
2171        $smlnj/viscomp/debugprof.cm
2172    
2173    and its sources live under
2174    
2175        src/compiler/DebugProf
2176    
2177    ----------------------------------------------------------------------
2178    Name: Matthias Blume
2179    Date: 2001/08/15 17:15:00 EDT
2180    Tag: blume-20010815-compreorg
2181    Description:
2182    
2183    This is a first cut at reorganizing the CM libraries that make up the
2184    core of the compiler.  The idea is to separate out pieces that could
2185    be used independently by tools, e.g., the parser, the typechecker, etc.
2186    
2187    The current status is a step in this direction, but it is not quite
2188    satisfactory yet.  Expect more changes in the future.
2189    
2190    Here is the current (new) organization...
2191    
2192        What used to be $smlnj/viscomp/core.cm is now divided into
2193        six CM libraries:
2194    
2195             $smlnj/viscomp/basics.cm
2196                           /parser.cm
2197                           /elabdata.cm
2198                           /elaborate.cm
2199                           /execute.cm
2200                           /core.cm
2201    
2202        The CM files for these libraries live under src/system/smlnj/viscomp.
2203        All these libraries are proxy libraries that contain precisely
2204        one CM library component.  Here are the locations of the components
2205        (all within the src/compiler tree):
2206    
2207             Basics/basics.cm
2208             Parse/parser.cm
2209             ElabData/elabdata.cm
2210             Elaborator/elaborate.cm
2211             Execution/execute.cm
2212             core.cm
2213    
2214         [This organization is the same that has been used already
2215         for a while for the architecture-specific parts of the visible
2216         compiler and for the old version of core.cm.]
2217    
2218         As you will notice, many source files have been moved from their
2219         respective original locations to a new home in one of the above
2220         subtrees.
2221    
2222         The division of labor between the new libraries is the following:
2223    
2224             basics.cm:
2225                - Simple, basic definitions that pertain to many (or all) of
2226                  the other libraries.
2227             parser.cm:
2228                - The SML parser, producing output of type Ast.dec.
2229                - The type family for Ast is also defined and exported here.
2230             elabdata.cm:
2231                - The datatypes that describe input and output of the elaborator.
2232                  This includes types, absyn, and static environments.
2233             elaborator.cm:
2234                - The SML/NJ type checker and elaborator.
2235                  This maps an Ast.dec (with a given static environment) to
2236                  an Absyn.dec (with a new static environment).
2237                - This libraries implements certain modules that used to be
2238                  structures as functors (to remove dependencies on FLINT).
2239             execute.cm:
2240                - Everything having to do with executing binary code objects.
2241                - Dynamic environments.
2242             core.cm:
2243                - SML/NJ-specific instantiations of the elaborator and MLRISC.
2244                - Top-level modules.
2245                - FLINT (this should eventually become its own library)
2246    
2247    Notes:
2248    
2249    I am not 100% happy with the way I separated the elaborator (and its
2250    data structures) from FLINT.  Two instances of the same problem:
2251    
2252        1. Data structures contain certain fields that carry FLINT-specific
2253           information.  I hacked around this using exn and the property list
2254           module from smlnj-lib.  But the fact that there are middle-end
2255           specific fields around at all is a bit annoying.
2256    
2257        2. The elaborator calculates certain FLINT-related information.  I tried
2258           to make this as abstract as I could using functorization, but, again,
2259           the fact that the elaborator has to perform calculations on behalf
2260           of the middle-end at all is not nice.
2261    
2262        3. Having to used exn and property lists is unfortunate because it
2263           weakens type checking.  The other alternative (parameterizing
2264           nearly *everything*) is not appealing, though.
2265    
2266    I removed the "rebinding =" warning hack because due to the new organization
2267    it was awkward to maintain it.  As a result, the compiler now issues some of
2268    these warnings when compiling init.cmi during bootstrap compilation. On
2269    the plus side, you also get a warning when you do, for example:
2270       val op = = Int32.+
2271    which was not the case up to now.
2272    
2273    I placed "assign" and "deref" into the _Core structure so that the
2274    code that deals with the "lazy" keyword can find them there.  This
2275    removes the need for having access to the primitive environment
2276    during elaboration.
2277    
2278    ----------------------------------------------------------------------
2279    Name: Matthias Blume
2280    Date: 2001/08/13
2281    Tag: blume-20010813-closures
2282    Description:
2283    
2284    This fix was sent to us by Zhong Shao.  It is supposed to improve the
2285    performance of certain loops by avoiding needless closure allocation.
2286    
2287    ----------------------------------------------------------------------
2288    Name: Lal George
2289    Date: 2001/07/31 10:03:23 EDT 2001
2290    Tag: george-20010731-x86-fmalloc
2291    Description: Fixed bug in x86 calls
2292    
2293        There was a bug where call instructions would mysteriously
2294        vanish. The call instruction had to be one that returned
2295        a floating point value.
2296    
2297    ----------------------------------------------------------------------
2298    Name: Lal George
2299    Date: 2001/07/19 16:36:29 EDT 2001
2300    Tag: george-20010719-simple-cells
2301    Description:
2302    
2303    I have dramatically simplified the interface for CELLS in MLRISC.
2304    
2305    In summary, the cells interface is broken up into three parts:
2306    
2307      1. CellsBasis : CELLS_BASIS
2308    
2309            CellsBasis is a top level structure and common for all
2310            architectures.  it contains the definitions of basic datatypes
2311            and utility  functions over these types.
2312    
2313      2. functor Cells() : CELLS
2314    
2315            Cells generates an interface for CELLS that incorporates the
2316            specific resources on the target architecture, such as the
2317            presence of special register classes, their number and size,
2318            and various useful substructures.
2319    
2320      3. <ARCH>CELLS
2321    
2322            e.g. SparcCells: SPARCCELLS
2323    
2324            <ARCH>CELLS usually contains additional bindings for special
2325            registers  on the architecture, such as:
2326    
2327                    val r0 : cell           (* register zero *)
2328                    val y : cell            (* Y register *)
2329                    val psr : cell          (* processor status register *)
2330                    ...
2331    
2332            The structure returned by applying the Cells functor is opened
2333            in this interface.
2334    
2335    The main implication of all this is that the datatypes for cells is
2336    split between CellsBasis and CELLS -- a fairly simple change for user
2337    code.
2338    
2339    In the old scheme the CELLS interface had a definitional binding of
2340    the form:
2341    
2342            signature CELLS = sig
2343    
2344               structure CellsBasis = CellsBasis
2345    
2346               ...
2347    
2348            end
2349    
2350    With all the sharing constraints that goes on in MLRISC, this old
2351    design  quickly leads to errors such as:
2352    
2353            "structure definition spec inside of sharing ... "
2354    
2355    
2356    and appears to require an unacceptable amount of sharing and where
2357    constraint hackery.
2358    
2359    I think this error message (the interaction of definitional specs and
2360    sharing) requires more explanation on our web page.
2361    
2362    ----------------------------------------------------------------------
2363    Name: Matthias Blume
2364    Date: 2001/07/19 15:00:00 EDT
2365    Tag: blume-20010719-libreorg
2366    Description:
2367    
2368    This update puts together a fairly extensive but straightforward change
2369    to the way the libraries that implement the interactive system are
2370    organized:
2371    
2372       The biggest change is the elimination of structure Compiler.  As a
2373       replacement for this structure, there is now a CM library
2374       (known as $smlnj/compiler.cm or $smlnj/compiler/current.cm)
2375       that exports all the substructures of the original structure Compiler
2376       directly.  So instead of saying Compiler.Foo.bar one now simply
2377       says Foo.bar.  (The CM libraries actually export a collection of
2378       structures that is richer than the collection of substructures of
2379       structure Compiler.)
2380    
2381       To make the transition smooth, there is a separate library called
2382       $smlnj/compiler/compiler.cm which puts together and exports the
2383       original structure Compiler (or at least something very close to it).
2384    
2385       There are five members of the original structure Compiler
2386       that are not exported directly but which instead became members
2387       of a new structure Backend (described by signature BACKEND).  These are:
2388       structure Profile (: PROFILE), structure Compile (: COMPILE), structure
2389       Interact (: INTERACT), structure Machine (: MACHINE), and val
2390       architecture (: string).
2391    
2392       Structure Compiler.Version has become structure CompilerVersion.
2393    
2394       Cross-compilers for alpha32, hppa, ppc, sparc, and x86 are provided
2395       by $smlnj/compiler/<arch>.cm where <arch> is alpha32, hppa, ppc, sparc,
2396       or x86, respectively.
2397       Each of these exports the same frontend structures that
2398       $smlnj/compiler.cm exports.  But they do not have a structure Backend
2399       and instead export some structure <Arch>Backend where <Arch> is Alpha32,
2400       Hppa, PPC, Sparc, or X86, respectively.
2401    
2402       Library $smlnj/compiler/all.cm exports the union of the exports of
2403       $smlnj/compiler/<arch>.cm
2404    
2405       There are no structures <Arch>Compiler anymore, use
2406       $smlnj/compiler/<arch>.cm instead.
2407    
2408       Library host-compiler-0.cm is gone.  Instead, the internal library
2409       that instantiates CM is now called cm0.cm.  Selection of the host
2410       compiler (backend) is no longer done here but. (Responsibility for it
2411       now lies with $smlnj/compiler/current.cm.  This seems to be more
2412       logical.)
2413    
2414       Many individual files have been moved or renamed.  Some files have
2415       been split into multiple files, and some "dead" files have been deleted.
2416    
2417    Aside from these changes to library organization, there are also changes
2418    to the way the code itself is organized:
2419    
2420       Structure Binfile has been re-implemented in such a way that it no
2421       longer needs any knowledge of the compiler.  It exclusively deals
2422       with the details of binfile layout.  It no longer invokes the
2423       compiler (for the purpose of creating new prospective binfile
2424       content), and it no longer has any knowledge of how to interpret
2425       pickles.
2426    
2427       Structure Compile (: COMPILE) has been stripped down to the bare
2428       essentials of compilation.  It no longer deals with linking/execution.
2429       The interface has been cleaned up considerably.
2430    
2431       Utility routines for dealing with linking and execution have been
2432       moved into their own substructures.
2433    
2434       (The ultimate goal of these changes is to provide a light-weight
2435       binfile loader/linker (at least for, e.g., stable libraries) that
2436       does not require CM or the compiler to be present.)
2437    
2438    CM documentation has been updated to reflect the changes to library
2439    organization.
2440    
2441    ----------------------------------------------------------------------
2442    Name: Matthias Blume
2443    Date: 2001/07/10 17:30:00 EDT
2444    Tag: Release_110_34
2445    Description:
2446    
2447    Minor tweak to 110.34 (re-tagged):
2448    
2449      - README.html file added to CVS repository
2450      - runtime compiles properly under FreeBSD 3.X and 4.X
2451    
2452    ----------------------------------------------------------------------
2453    Name: Matthias Blume
2454    Date: 2001/07/10 17:30:00 EDT
2455    Tag: Release_110_34
2456    Description:
2457    
2458    New version number (110.34). New bootfiles.
2459    
2460    ----------------------------------------------------------------------
2461    Name: Matthias Blume
2462    Date: 2001/07/09 16:00:00 EDT
2463    Tag: blume-20010709-more-varargs
2464    Description:
2465    
2466    I changed the handling of varargs in ml-nlffigen again:
2467    The ellipsis ... will now simply be ignored (with an accompanying warning).
2468    
2469    The immediate effect is that you can actually call a varargs function
2470    from ML -- but you can't actually supply any arguments beyond the ones
2471    specified explicitly.  (For example, you can call printf with its format
2472    string, but you cannot pass additional arguments.)
2473    
2474    This behavior is only marginally more useful than the one before, but
2475    it has the advantage that a function or, more importantly, a function
2476    type never gets dropped on the floor, thus avoiding follow-up problems with
2477    other types that refer to the offending one.
2478    
2479    ----------------------------------------------------------------------
2480    Name: Matthias Blume
2481    Date: 2001/07/09 11:25:00 EDT
2482    Tag: blume-20010709-varargs
2483    Description:
2484    
2485    1. ckit-lib.cm now exports structure Error
2486    2. ml-nlffigen reports occurences of "..." (i.e., varargs function types)
2487       with a warning accompanied by a source location.  Moreover, it
2488       merely skips the offending function or type and proceeds with the
2489       rest of its work.u  As a result, one can safely feed C code containing
2490       "..." to ml-nlffigen.
2491    3. There are some internal improvements to CM, providing slightly
2492       more general string substitutions in the tools subsystem.
2493    
2494    ----------------------------------------------------------------------
2495    Name: Matthias Blume
2496    Date: 2001/06/27 15:10:00 EDT
2497    Tag: blume-20010627-concur
2498    Description:
2499    
2500    Fixed a small bug in CM's handling of parallel compilation.
2501    (You could observe the bug by Control-C-interrupting an ordinary
2502    CMB.make or CM.stabilize and then attaching some compile servers.
2503    The result was that all of a sudden the previously interrupted
2504    compilation would continue on its own.  This was because of
2505    an over-optimization: CM did not bother to clean out certain queues
2506    when no servers were attached "anyway", resulting in the contents
2507    of these queues to grab control when new servers did get attached.)
2508    
2509    There is also another minor update to the CM manual.
2510    
2511    ----------------------------------------------------------------------
2512    Name: Matthias Blume
2513    Date: 2001/06/26 16:15:00 EDT
2514    Tag: blume-20010626-cmdoc
2515    Description:
2516    
2517    Minor typo fixed in CM manual (syntax diagram for libraries).
2518    
2519    ----------------------------------------------------------------------
2520    Name: Matthias Blume
2521    Date: 2001/06/25 22:55:00 EDT
2522    Tag: blume-20010625-x86pc
2523    Description:
2524    
2525    Fixed a nasty bug in the X86 assembly code that caused signal
2526    handlers to fail (crash) randomly.
2527    
2528    ----------------------------------------------------------------------
2529    Name: Matthias Blume
2530    Date: 2001/06/25 12:05:00 EDT
2531    Tag: blume-20010625-nlffigen
2532    Description:
2533    
2534    This update fixes a number of minor bugs in ml-nlffigen as reported by
2535    Nick Carter <nbc@andrew.cmu.edu>.
2536    
2537      1. Silly but ok typedefs of the form "typedef void myvoid;" are now accepted.
2538      2. Default names for generated files are now derived from the name of
2539         the C file *without its directory*.  In particular, this causes generated
2540         files to be placed locally even if the C file is in some system directory.
2541      3. Default names for generated signatures and structures are also derived
2542         from the C file name without its directory.  This avoids silly things
2543         like "structure GL/GL".
2544         (Other silly names are still possible because ml-nlffigen does not do
2545          a thorough check of whether generated names are legal ML identifiers.
2546          When in doubt, use command line arguments to force particular names.)
2547    
2548    ----------------------------------------------------------------------
2549    Name: Matthias Blume
2550    Date: 2001/06/21 12:25:00 EDT
2551    Tag: blume-20010621-eXene
2552    Description:
2553    
2554    eXene now compiles and (sort of) works again.
2555    
2556    The library name (for version > 110.33) is $/eXene.cm.
2557    
2558    I also added an new example in src/eXene/examples/nbody.  See the
2559    README file there for details.
2560    
2561    ----------------------------------------------------------------------
2562    Name: Matthias Blume
2563    Date: 2001/06/20 16:40:00 EDT
2564    Tag: blume-20010620-cml
2565    Description:
2566    
2567    CML now compiles and works again.
2568    
2569    Libraries (for version > 110.33):
2570    
2571      $cml/cml.cm            Main CML library.
2572      $cml/basis.cm          CML's version of $/basis.cm.
2573      $cml/cml-internal.cm   Internal helper library.
2574      $cml/core-cml.cm       Internal helper library.
2575      $cml-lib/trace-cml.cm  Tracing facility.
2576      $cml-lib/smlnj-lib.cm  CML's version of $/smlnj-lib.cm
2577    
2578    The installer (config/install.sh) has been taught how to properly
2579    install this stuff.
2580    
2581    ----------------------------------------------------------------------
2582    Name: Matthias Blume
2583    Date: 2001/06/19 17:55:00 EDT
2584    Tag: blume-20010619-instantiate
2585    Description:
2586    
2587    This un-breaks the fix for bug 1432.
2588    (The bug was originally fixed in 110.9 but I broke it again some
2589    time after that.)
2590    
2591    ----------------------------------------------------------------------
2592    Name: Matthias Blume
2593    Date: 2001/06/19 17:25:00 EDT
2594    Tag: blume-20010619-signals
2595    Description:
2596    
2597    This should (hopefully) fix the long-standing signal handling bug.
2598    (The runtime system was constructing a continuation record with an
2599    incorrect descriptor which would cause the GC to drop data on the floor...)
2600    
2601    ----------------------------------------------------------------------
2602    Name: Matthias Blume
2603    Date: 2001/06/15 15:05:00 EDT
2604    Tag: blume-20010615-moresparc
2605    Description:
2606    
2607    Here is a short late-hour update related to Sparc c-calls:
2608    
2609     -- made handling of double-word arguments a bit smarter
2610    
2611     -- instruction selection phase tries to collapse certain clumsily
2612        constructed ML-Trees; typical example:
2613    
2614            ADD(ty,ADD(_,e,LI d1),LI d2)  ->  ADD(ty,e,LI(d1+d2))
2615    
2616        This currently has no further impact on SML/NJ since mlriscGen does
2617        not seem to generate such patterns in the first place, and c-calls
2618        (which did generate them in the beginning) has meanwhile been fixed
2619        so as to avoid them as well.
2620    
2621    ----------------------------------------------------------------------
2622    Name: Matthias Blume
2623    Date: 2001/06/15 15:05:00 EDT
2624    Tag: blume-20010615-sparc
2625    Description:
2626    
2627    The purpose of this update is to provide an implementation of NLFFI
2628    on Sparc machines.
2629    
2630    Here are the changes in detail:
2631    
2632       * src/MLRISC/sparc/c-calls/sparc-c-calls.sml is a new file containing
2633       the Sparc implementation of the c-calls API.
2634       * The Sparc backend of SML/NJ has been modified to uniformely use %fp
2635       for accessing the ML frame.  Thus, we have a real frame pointer and
2636       can freely modify %sp without need for an omit-frame-ptr phase.
2637       The vfp logic in src/compiler/CodeGen/* has been changed to accomodate
2638       this case.
2639       * ml-nlffigen has been taught to produce code for different architectures
2640       and calling conventions.
2641       * In a way similar to what was done in the x86 case, the Sparc
2642       backend uses its own specific extension to mltree.  (For example,
2643       it needs to be able to generate UNIMP instructions which are part
2644       of the calling convention.)
2645       * ml-nlffi-lib was reorganized to make it more modular (in particular,
2646       to make it easier to plug in new machine- and os-dependent parts).
2647    
2648    There are some other fairly unrelated bug fixes and cleanups as well:
2649    
2650       * I further hacked the .cm files for MLRISC tools (like MDLGen) so
2651       that they properly share their libraries with existing SML/NJ libraries.
2652       * I fixed a minor cosmetic bug in CM, supressing certain spurious
2653       follow-up error messages.
2654       * Updates to CM/CMB documentation.
2655    
2656    TODO items:
2657    
2658       * MLRISC should use a different register as its asmTemp on the Sparc.
2659         (The current %o2 is a really bad choice because it is part of the
2660         calling conventions, so things might interfere in unexpected ways.)
2661    
2662    ----------------------------------------------------------------------
2663    Name: Matthias Blume
2664    Date: 2001/06/07
2665    Tag: blume-20010607-calls
2666    Description:
2667    
2668    A number of internal changes related to C calls and calling conventions:
2669    
2670    1. ML-Tree CALL statements now carry a "pops" field.  It indicates the
2671       number of bytes popped implicitly (by the callee).  In most cases
2672       this field is 0 but on x86/win32 it is some non-zero value.  This
2673       is information provided for the benefit of the "omit-frameptr" pass.
2674    2. The CALL instruction on the x86 carries a similar "pops" field.
2675       The instruction selection phase copies its value from the ML-Tree
2676       CALL statement.
2677    3. On all other architectures, the instruction selection phase checks
2678       whether "pops=0" and complains if not.
2679    4. The c-calls implementation for x86 now accepts two calling conventions:
2680       "ccall" and "stdcall".  When "ccall" is selected, the caller cleans
2681       up after the call and pops is set to 0.  For "stdcall", the caller
2682       does nothing, leaving the cleanup to the callee; pops is set to
2683       the number of bytes that were pushed onto the stack.
2684    5. The cproto decoder (compiler/Semant/types/cproto.sml) now can
2685       distinguish between "ccall" and "stdcall".
2686    6. The UNIMP instruction has been added to the supported Sparc instruction
2687       set. (This is needed for implementing the official C calling convention
2688       on this architecture.)
2689    7. I fixed some of the .cm files under src/MLRISC/Tools to make them
2690       work with the latest CM.
2691    
2692    ----------------------------------------------------------------------
2693    Name: Matthias Blume
2694    Date: 2001/06/05 15:10:00 EDT
2695    Tag: blume-20010605-cm-index
2696    Description:
2697    
2698    0. The "lambdasplit" parameter for class "sml" in CM has been documented.
2699    
2700    1. CM can now generate "index files".  These are human-readable files
2701       that list on a per-.cm-file basis each toplevel symbol defined or
2702       imported.  The location of the index file for
2703       <p>/<d>.cm is <p>/CM/INDEX/<d>.cm.
2704       To enable index-file generation, set CM.Control.generate_index to true
2705       or export an environment-symbol: export CM_GENERATE_INDEX=true.
2706    
2707       The CM manual has been updated accordingly.
2708    
2709    2. I made some slight modifications to the c-calls API in MLRISC.
2710    
2711         a) There is now a callback to support saving/restoring of
2712            dedicated but caller-save registers around the actual call
2713            instruction.
2714         b) One can optionally specify a comment-annotation for the
2715            call instruction.
2716    
2717    3. SML/NJ (mlriscGen.sml) uses this new API for the rawccall primop.
2718       (For example, the comment annotation shows the C prototype of
2719        the function being called.)
2720    
2721    ----------------------------------------------------------------------
2722    Name: Matthias Blume
2723    Date: 2001/06/01 13:30:00 EDT
2724    Tag: blume-20010601-nlffi-cleanup
2725    Description:
2726    
2727    This is mostly a cleanup of MLFFI stuff:
2728    
2729       - some signature files have been put into a more exposed place
2730       - the ugly 'f type parameter is gone (simplifies types tremendously!)
2731       - ml-nlffigen changed accordingly
2732       - tutorial updated
2733    
2734    Other changes:
2735    
2736       - author's affiliation in CM manual(s) updated
2737       - some more recognized keywords added to Allen's sml.sty
2738    
2739    ----------------------------------------------------------------------
2740    Name: Matthias Blume
2741    Date: 2001/05/25 15:30:00 EDT
2742    Tag: blume-20010525-iptr
2743    Description:
2744    
2745      - put the official 110.33-README (as it appears on the ftp server) under
2746        CVS
2747      - fixed a small bug related to incomplete pointer types in
2748        ml-nlffigen
2749      - small cosmetic change to the ml-nlffi-lib's "arr" type constructor
2750        (it does not need the 'f type parameter)
2751    
2752    ----------------------------------------------------------------------
2753    Name: Matthias Blume
2754    Date: 2001/05/23 14:30:00 EDT
2755    Tag: Release_110_33
2756    Description:
2757    
2758    New version number (110.33).  New bootfiles.
2759    
2760    ----------------------------------------------------------------------
2761    Name: Matthias Blume
2762    Date: 2001/05/22 18:06:00 EDT
2763    Tag: blume-20010522-targets
2764    Description:
2765    
2766    Made install.sh use file config/targets.customized if it exists, falling
2767    back to config/targets if it doesn't.  This way one can have a customized
2768    version of the targets file without touching the "real thing", thus
2769    eliminating the constant fear of accidentally checking something bogus
2770    back into the CVS repository...  (File config/targets.customized must
2771    not be added to the repository!)
2772    
2773    ----------------------------------------------------------------------
2774    Name: Matthias Blume
2775    Date: 2001/05/22 16:30:00 EDT
2776    Tag: blume-20010522-minitut
2777    Description:
2778    
2779    1. Bug fix in ml-nlffigen; now (hopefully) correctly handling
2780       struct returns.
2781    2. Added src/ml-nlffi-lib/Doc/mini-tutorial.txt.  This is some very
2782       incomplete, preliminary documentation for NLFFI.
2783    
2784    ----------------------------------------------------------------------
2785    Name: Matthias Blume
2786    Date: 2001/05/14 11:30:00 EDT
2787    Tag: blume-20010514-script
2788    Description:
2789    
2790    Some bugs in install script fixed.
2791    
2792    In addition to that I also made a slight change to the NLFFI API:
2793    Functors generated by ml-nlffigen now take the dynamic library as a
2794    straight functor argument, not as a suspended one.  (The original
2795    functor code used to force the suspension right away anyway, so there
2796    was nothing gained by this complication of the interface.)
2797    
2798    ----------------------------------------------------------------------
2799    Name: Matthias Blume
2800    Date: 2001/05/11 14:35:00 EDT
2801    Tag: blume-20010511-ml-nlffi
2802    Description:
2803    
2804    I finally took the plunge and added my new FFI code to the main
2805    repository.  For x86-linux it is now ready for prime-time.
2806    
2807    There are two new subdirectories of "src":
2808    
2809      - ml-nlffi-lib:
2810           The utility library for programs using the FFI interface.
2811           Here is the implementation of $/c.cm and its associated low-level
2812           partners $/c-int.cm and $/memory.cm.
2813      - ml-nlffigen:
2814           A stand-alone program for generating ML glue code from C source
2815           code.
2816    
2817    Building ml-nlffigen requires $/ckit-lib.cm.
2818    
2819    The config/install.sh script has been updates to do the Right Thing
2820    (hopefully).
2821    
2822    Notice that the source tree for the C-Kit will not be put under "src"
2823    but directly under the installation root directory.  (This is the
2824    structure that currently exists on the CVS server when you check out
2825    module "sml".)  Fortunately, config/install.sh knows about this oddity.
2826    
2827    Bugs: No documentation yet.
2828    
2829    ----------------------------------------------------------------------
2830    Name: Matthias Blume
2831    Date: 2001/05/09 16:35:00 EDT
2832    Tag: blume-20010509-cpscontract
2833    Description:
2834    
2835    Fixed a bug in the accounting code in cpsopt/contract.sml.  (The
2836    wrapper/unwrapper elimination did not decrement usage counts and some
2837    dead variables got overlooked by the dead-up logic.)
2838    
2839    ----------------------------------------------------------------------
2840    Name: Lal George
2841    Date: 2001/05/08  17:26:09 EDT
2842    Tag: george-20010508-omit-frameptr
2843    Description:
2844    
2845    Changes to implement the omit-frame-pointer optimization to support
2846    raw C calls. For now, there is only support on the Intel x86, but
2847    other architectures will follow as more experience is gained with this.
2848    
2849    
2850    ----------------------------------------------------------------------
2851    Name: Matthias Blume
2852    Date: 2001/05/07 14:40:00 EDT
2853    Tag: blume-20010507-proxies
2854    Description:
2855    
2856    I made into "proxy libraries" all libraries that qualify for such a
2857    change.  (A qualifying library is a library that has another library or
2858    groups as its sole member and repeats that member's export list
2859    verbatim.  A proxy library avoids this repetition by omitting its export
2860    list, effectively inheriting the list that its (only) member exports.
2861    See the CM manual for more explanation.)
2862    The main effect is that explicit export lists for these libraries
2863    do not have to be kepts in sync, making maintenance a bit easier.
2864    
2865    I also added copyright notices to many .cm-files.
2866    
2867    Last but not least, I made a new set of bootfiles.
2868    
2869    ----------------------------------------------------------------------
2870    Name: Matthias Blume
2871    Date: 2001/05/04 17:00:00 EDT
2872    Tag: blume-20010504-cm-lsplit
2873    Description:
2874    
2875    0. John merged pending changes to $/smlnj-lib.cm
2876    
2877    1. Allen's previous change accidentally backed out of one of Lal's
2878       earlier changes.  I undid this mistake (re-introducing Lal's change).
2879    
2880    2. I used the new topOrder' function from graph-scc.sml (from $/smlnj-lib.cm)
2881       within the compiler where applicable.  There is some code simplification
2882       because of that.
2883    
2884    3. The "split" phase (in FLINT) is now part of the default list of phases.
2885       Compiler.Control.LambdaSplitting.* can be used to globally control the
2886       lambda-splitting (cross-module-inlining) engine.  In addition to that,
2887       it can now also be controlled on a per-source basis: CM has been taught
2888       a new tool parameter applicable to ML source files.
2889    
2890       - To turn lambda-splitting off completely:
2891            local open Compiler.Control.LambdaSplitting in
2892                val _ = set Off
2893            end
2894       - To make "no lambda-splitting" the global default (but allow per-source
2895         overriding); this is the initial setting:
2896            local open Compiler.Control.LambdaSplitting in
2897                val _ = set (Default NONE)
2898            end
2899       - To make "lambda-splitting with aggressiveness a" the global default
2900         (and allow per-source overriding):
2901            local open Compiler.Control.LambdaSplitting in
2902                val _ = set (Default (SOME a))
2903            end
2904    
2905       - To turn lambda-splitting off for a given ML souce file (say: a.sml)
2906         write (in the respective .cm-file):
2907            a.sml (lambdasplitting:off)
2908       - To turn lambda-splitting for a.sml on with minimal aggressiveness:
2909            a.sml (lambdasplitting:on)
2910       - To turn lambda-splitting for a.sml on with aggressiveness <a> (where
2911         <a> is a decimal non-negative integer):
2912            a.sml (lambdasplitting:<a>)
2913       - To turn lambda-splitting for a.sml on with maximal aggressiveness:
2914            a.sml (lambdasplitting:infinity)
2915       - To use the global default for a.sml:
2916            a.sml (lambdasplitting:default)
2917         or simply
2918            a.sml
2919    
2920    ----------------------------------------------------------------------
2921    Name: Allen Leung
2922    Date: 2001/05/04 01:57:00 EDT
2923    Tag: leunga-20010504-sync
2924    Description:
2925    
2926      MLRISC features.
2927    
2928      1. Fix to CMPXCHG instructions.
2929      2. Changed RA interface to allow annotations in callbacks.
2930      3. Added a new method to the stream interface to allow annotations updates.
2931    
2932    ----------------------------------------------------------------------
2933    Name: Matthias Blume
2934    Date: 2001/05/01 11:45:00 EDT
2935    Tag: blume-20010501-pcedittmp
2936    Description:
2937    
2938    Changed install.sh to use the current working directory instead of
2939    /usr/tmp for a temporary file (pcedittmp).  The previous choice
2940    of /usr/tmp caused trouble with MacOS X because of file premission
2941    problems.
2942    
2943    ----------------------------------------------------------------------
2944    Name: Matthias Blume
2945    Date: 2001/04/20 11:10:00 EDT
2946    Tag: blume-20010420-inMLflag
2947    Description:
2948    
2949     - added vp_limitPtrMask to vproc-state.h
2950       (for use by the raw-C-calls mechanism to implement proper interrupt
2951        handling)
2952     - made the ML compiler aware of various data-structure offsets so it
2953       can generate code for accessing the vp_inML flag and vp_limitPtrMask
2954     - tweaked mlriscGen.sml to have it emit interrupt-handling code for
2955       raw C-calls
2956    
2957    ----------------------------------------------------------------------
2958    Name: Lal George
2959    Date: 2001/04/20 09:15:28 EDT
2960    Tag: george-20010420-macosX
2961    Description:
2962    
2963     - Changes to port to Mac OS X; Darwin.
2964    
2965     - In the process I found that sqrt was broken on the PPC, because the
2966       fsqrt instruction is not implemented.
2967    
2968    ----------------------------------------------------------------------
2969    Name: Matthias Blume
2970    Date: 2001/04/18 12:45:00 EDT
2971    Tag: blume-20010418-ccalls
2972    Description:
2973    
2974     - fixed two off-by-4 errors in the x86-specific c-calls implementation
2975       (this bug prevented structure arguments containing pointers from being
2976        passed correctly)
2977     - changed the raw-C-call code in mlriscGen.sml in such a way that
2978       structure arguments are represented as a pointer to the beginning
2979       of the structure (instead of having a series of synthesized arguments,
2980       one for each structure member)
2981    
2982     - made makeml script's verbosity level configurable via environment
2983       variable (MAKEML_VERBOSITY)
2984    
2985     - eliminated placeholder implementations for f32l, w16s, i16s, and f32s
2986       in rawmem-x86.sml; we are now using the real thing
2987    
2988    ----------------------------------------------------------------------
2989    Name: Matthias Blume
2990    Date: 2001/03/22 16:25:00 EST
2991    Tag: blume-20010322-bootfiles
2992    Description:
2993    
2994    Created a new set of bootfiles (for your automatic installation convenience).
2995    
2996    ----------------------------------------------------------------------
2997    Name: Matthias Blume
2998    Date: 2001/03/22 15:10:00 EST
2999    Tag: blume-20010322-rawmem-parcm
3000    Description:
3001    
3002    1. All "raw memory access" primitives for the new FFI are implemented now
3003       (at least on the x86).
3004    2. Some further cleanup of CM's parallel make mechanism.
3005    
3006    ----------------------------------------------------------------------
3007    Name: Matthias Blume
3008    Date: 2001/03/19 17:53:00 EST
3009    Tag: blume-20010319-parallel
3010    Description:
3011    
3012    Parallel make (using compile servers) now works again.
3013    
3014    To this end, CM.stabilize and CMB.make have been modified to work in
3015    two passes when compile servers are attached:
3016       1. Compile everything, do not perform stabilization; this pass
3017          uses compile servers
3018       2. Stabilize everything; this pass does not use compile servers
3019    If there are no compile servers, the two passes are combined into one
3020    (as before).  Splitting the passes increases the inherent parallelism
3021    in the dependency graph because the entire graph including all
3022    libraries is available at the same time.  This, in turn, improves
3023    server utilization.  The downside is that the master process will
3024    have to do some extra work after compilation is done (because for
3025    technical reasons it must re-read all the binfiles during stabilization).
3026    
3027    ----------------------------------------------------------------------
3028    Name: Matthias Blume
3029    Date: 2001/03/16 12:22:00 EST
3030    Tag: blume-20010316-bootfiles
3031    Description:
3032    
3033    Created a new set of bootfiles (for your automatic installation convenience).
3034    
3035    ----------------------------------------------------------------------
3036    Name: Matthias Blume
3037    Date: 2001/03/16 11:00:00 EST
3038    Tag: blume-20010316-MLTREE-fixup
3039    Description:
3040    
3041    This is a minor fixup for an (untagged) earlier commit by Allen.
3042    (A file was missing).
3043    
3044    ----------------------------------------------------------------------
3045    Name: Allen Leung
3046    Date: Mon Mar  5 18:54:57 EST 2001
3047    Tag: leunga-20010305-cut-support
3048    
3049    1. New support for alternative control-flow in MLTREE.
3050       Currently we support
3051    
3052          FLOW_TO(CALL ...., [k1,...,kn])
3053    
3054       This is needed for 'cuts to' in C-- and try/handle-like constructs
3055       in Moby
3056    
3057       New assembler flag "asm-show-cutsto" to turn on control-flow debugging.
3058    
3059    2. Register Allocator
3060    
3061       Changes in interface [from Fermin, John]
3062    
3063    3. Alpha 8-bit SLL support [Fermin]
3064    
3065    4. All architectures
3066    
3067       A new module (ClusterExpandCopies) for expanding parallel copies.
3068    
3069    ----------------------------------------------------------------------
3070    Name: Allen Leung
3071    Date: 2001/02/27 23:07:00 EST
3072    Tag: leunga-20010227-minor-stuff
3073    
3074    1. Alpha bug fix for CMOVNE
3075    2. Handle mltree COND(..,FCMP ...,...)
3076    3. Bug fix in simplifier
3077    
3078    ----------------------------------------------------------------------
3079    Name: Matthias Blume
3080    Date: 2001/01/30 17:50:00 EST
3081    Tag: blume-20010130-sync
3082    Description:
3083    
3084    This is just a minor update to sync my devel branch with the main brach.
3085    The only visible change is the addition of some README files.
3086    
3087    ----------------------------------------------------------------------
3088    Name: Matthias Blume
3089    Date: 2001/01/12 23:30:00 JST
3090    Tag: blume-20010112-bootfiles
3091    Description:
3092    
3093    Made a new set of bootfiles that goes with the current state of the
3094    repository.
3095    
3096    ----------------------------------------------------------------------
3097    Name: Matthias Blume
3098    Date: 2001/01/12 21:20:00 JST
3099    Tag: blume-20010112-sync
3100    Description:
3101    
3102    I am just flushing out some minor changes that had accumulated in
3103    my private branch in order to sync with the main tree.  (This is
3104    mainly because I had CVS trouble when trying to merge _into_ my
3105    private branch.)
3106    
3107    Most people should be completely unaffected by this.
3108    
3109    ----------------------------------------------------------------------
3110    Name: Allen Leung
3111    Date: Thu Jan 11 21:03:00 EST 2001
3112    Tag: leunga-20010111-labexp=mltree
3113    Description:
3114    
3115    1.  Removed the type LabelExp and replace it by MLTree.
3116    2.  Rewritten mltree-simplify with the pattern matcher tool.
3117    3.  There were some bugs in alpha code generator which would break
3118        64-bit code generation.
3119    4.  Redo the tools to generate code with the
3120    5.  The CM files in MLRISC (and in src/system/smlnj/MLRISC)
3121        are now generated by perl scripts.
3122    
3123    ----------------------------------------------------------------------
3124    Name: Matthias Blume
3125    Date: 2001/01/10 21:55:00 JST
3126    Tag: blume-20010110-rcc
3127    Description:
3128    
3129    The RCC stuff now seems to work (but only on the x86).
3130    This required hacking of the c-calls interface (and -implementation) in
3131    MLRISC.
3132    
3133    Normal compiler users should be unaffected.
3134    
3135    ----------------------------------------------------------------------
3136    Name: Matthias Blume
3137    Date: 2001/01/09 01:20:00 JST
3138    Tag: blume-20010109-rcc
3139    Description:
3140    
3141    This is a fairly big patch, flushing out a large number of pending
3142    changes that I made to my development copy over the last couple of days.
3143    
3144    Of practical relevance at this moment is a workaround for a pickling
3145    bug that Allen ran into the other day.  The cause of the bug itself is
3146    still unknown and it might be hard to fix it properly, but the
3147    workaround has some merits of its own (namely somewhat reducing pickling
3148    overhead for certain libraries).  Therefore, I think this solution should
3149    be satisfactory at this time.
3150    
3151    The rest of the changes (i.e., the vast majority) has to do with my
3152    ongoing efforts of providing direct support for C function calls from
3153    ML.  At the moment there is a new primop "RAW_CCALL", typing magic
3154    in types/cproto.sml (invoked from FLINT/trans/translate.sml), a new
3155    case in the FLINT CPS datatype (RCC), changes to cps/convert.sml to
3156    translate uses of RAW_CCALL into RCC, and changes to mlriscGen.sml to
3157    handle RCC.
3158    
3159    The last part (the changes to mlriscGen.sml) are still known to be
3160    wrong on the x86 and not implemented on all other architectures.  But
3161    the infrastructure is in place. I had to change a few functor
3162    signatures in the backend to be able to route the CCalls interface
3163    from MLRISC there, and I had to specialize the mltree type (on the
3164    x86) to include the necessary extensions. (The extensions themselves
3165    were already there and redy to go in MLRISC/x86).
3166    
3167    Everything should be very happy as soon as someone helps me with
3168    mlriscGen.sml...
3169    
3170    In any case, nothing of this should matter to anyone as long as the
3171    new primop is not being used (which is going to be the case unless you
3172    find it where I hid it :). The rest of the compiler is completely
3173    unaffected.
3174    
3175    ----------------------------------------------------------------------
3176    Name: Matthias Blume
3177    Date: 2001/01/05 00:30:00 JST
3178    Tag: blume-20010105-primops
3179    Description:
3180    
3181    Added some experimental support for work that I am doing right now.
3182    These changes mostly concern added primops, but there is also a new
3183    experimental C library in the runtime system (but currently not enabled
3184    anywhere except on Linux/X86).
3185    
3186    In the course of adding primops (and playing with them), I discovered that
3187    Zhong's INL_PRIM hack (no type info for certain primops) was, in fact, badly
3188    broken.  (Zhong was very right he labeled this stuff as "major gross hack".)
3189    To recover, I made type information in INL_PRIM mandatory and changed
3190    prim.sml as well as built-in.sml accordingly.  The InLine structure now
3191    has complete, correct type information (i.e., no bottom types).
3192    
3193    Since all these changes mean that we need new binfiles, I also bumped the
3194    version number to 110.32.1.
3195    
3196    ----------------------------------------------------------------------
3197    Name: Matthias Blume
3198    Date: 2000/12/30 22:10:00 JST
3199    Tag: blume-20001230-various
3200    Description:
3201    
3202    Added proxy libraries for MLRISC and let MLRISC libraries refer
3203    to each other using path anchors.  (See CM manual for explanation.)
3204    
3205    Updated CM documentation.
3206    
3207    Fixed some bugs in CM.
3208    
3209    Implemented "proxy" libraries (= syntactic sugar for CM).
3210    
3211    Added "-quiet" option to makeml and changed runtime system accordingly.
3212    
3213    Added cleanup handler for exportML to reset timers and compiler stats.
3214    
3215    ----------------------------------------------------------------------
3216    Name: Lal George
3217    Date: 2000/12/22 22:22:58 EST 2000
3218    Tag: Release_110_32
3219    Description:
3220    
3221            Infinite precision used throughout MLRISC.
3222            see MLRISC/mltree/machine-int.sig
3223    
3224    ----------------------------------------------------------------------
3225    Name: Matthias Blume
3226    Date: 2000/12/22 23:16:00 JST
3227    Tag: blume-20001222-warn
3228    Description:
3229    
3230    Corrected wording and formatting of some CM warning message which I
3231    broke in my previous patch.
3232    
3233    ----------------------------------------------------------------------
3234    Name: Matthias Blume
3235    Date: 2000/12/22 21:20:00 JST
3236    Tag: blume-20001222-anchorenv
3237    Description:
3238    
3239    Fixed CM's handling of anchor environments in connection with CMB.make.
3240    
3241    ----------------------------------------------------------------------
3242    Name: Matthias Blume
3243    Date: 2000/12/22 13:15:00 JST
3244    Tag: blume-20001222-cleanup
3245    Description:
3246    
3247    Removed src/cm/ffi which does not (and did not) belong here.
3248    
3249    ----------------------------------------------------------------------
3250    Name: Matthias Blume
3251    Date: 2000/12/21 23:55:00 JST
3252    Tag: blume-20001221-exn
3253    Description:
3254    
3255    Probably most important: CM no longer silently swallows all exceptions
3256    in the compiler.
3257    Plus: some other minor CM changes.  For example, CM now reports some
3258    sizes for generated binfiles (code, data, envpickle, lambdapickle).
3259    
3260    ----------------------------------------------------------------------
3261    Name: Matthias Blume
3262    Date: 2000/12/15 00:01:05 JST
3263    Tag: blume-20001215-dirtool
3264    Description:
3265    
3266    - "dir" tool added.
3267    - improvements and cleanup to Tools structure
3268    - documentation updates
3269    
3270    ----------------------------------------------------------------------
3271    Name: Allen Leung
3272    Date: Thu Dec 14 03:45:24 EST 2000
3273    Description:
3274    Tag:  leunga-20001214-int-inf
3275    Description:
3276    
3277       In IntInf, added these standard functions, which are missing from our
3278    implementation:
3279    
3280        andb : int * int -> int
3281        xorb : int * int -> int
3282        orb  : int * int -> int
3283        notb : int -> int
3284         <<   : int * word -> int
3285        ~>>  : int * word -> int
3286    
3287       Not tested, I hope they are correct.
3288    
3289    ----------------------------------------------------------------------
3290    Name: Allen Leung
3291    Date: Fri Dec  8 19:23:26 EST 2000
3292    Description:
3293    Tag:  leunga-20001208-nowhere
3294    Description:
3295    
3296      Slight improvements to the 'nowhere' tool to handle OR-patterns,
3297    to generate better error messages etc.  Plus a brief manual.
3298    
3299    ----------------------------------------------------------------------
3300    Name: Lal George
3301    Date: 2000/12/08 09:54:02 EST 2000
3302    Tag: Release_110_31
3303    Description:
3304    
3305    - Version 110.31
3306    ----------------------------------------------------------------------
3307    Name: Allen Leung
3308    Date: Thu Dec  7 22:01:04 EST 2000
3309    Tag:  leunga-20001207-cell-monster-hack
3310    Description:
3311    
3312    Major MLRISC internal changes.  Affect all clients.
3313    Summary:
3314    
3315    1.  Type CELLS.cell = int is now replaced by a datatype.
3316        As a result, the old regmap is now gone.  Almost all interfaces
3317        in MLRISC change as a consequence.
3318    
3319    2.  A new brand version of machine description tool (v3.0) that generates
3320        modules expecting the new interface.  The old version is removed.
3321    
3322    3.  The RA interface has been further abstracted into two new functors.
3323        RISC_RA and X86RA.  These functors have much simpler interfaces.
3324        [See also directory MLRISC/demo.]
3325    
3326    4.  Some other new source->source code generation tools are available:
3327    
3328        a. MLRISC/Tools/RewriteGen -- generate rewriters from rules.
3329        b. MLRISC/Tools/WhereGen -- expands conditional pattern matching rules.
3330           I use this tool to generate the peephole optimizers---with the new
3331           cell type changes, peephole rules are becoming difficult to write
3332           without conditional pattern matching.
3333    
3334    5.  More Intmap -> IntHashTable change.  Previous changes by Matthias didn't
3335        cover the entire MLRISC source tree so many things broke.
3336    
3337    6.  CM files have been moved to the subdirectory MLRISC/cm.
3338        They are moved because there are a lot of them and they clutter up the
3339        root dir.
3340    
3341    7.  More detailed documentation to come...
3342    
3343        NOTE: To rebuild from 110.30 (ftp distribution), you'll have to do
3344        a makeml -rebuild first.  This is because of other other
3345        changes that Matthias has made (see below).
3346    
3347    
3348    ----------------------------------------------------------------------
3349    Name: Matthias Blume
3350    Date: 2000/11/30 23:12:00 JST
3351    Tag: blume-20001130-filereorg
3352    Description:
3353    
3354    Some manual updates and some file reorganizations in CM.
3355    
3356    ----------------------------------------------------------------------
3357    Name: Matthias Blume
3358    Date: 2000/11/24 17:45:00 JST
3359    Tag: blume-20001124-link
3360    Description:
3361    
3362    Drastically improved link traversal code for the case that the dynamic
3363    value was already loaded at bootstrap time.  As a result, CM and CMB
3364    now both load blazingly fast -- even on a very slow machine.  Also,
3365    memory consumption has been further reduced by this.
3366    
3367    Warning: The format of the PIDMAP file has changed.  THerefore, to
3368    bootstrap you have to do this:
3369    
3370    1. Run CMB.make
3371    2. Make a symbolic link for the boot directory:
3372         ln -s sml.boot.ARCH-OS xxx
3373    3. "Rebuild" the boot directory:
3374         ./makeml -boot xxx -rebuild sml ; rm xxx
3375    4. Boot normally:
3376          ./makeml
3377    
3378    ----------------------------------------------------------------------
3379    Name: Matthias Blume
3380    Date: 2000/11/21 21:20:00 JST
3381    Tag: blume-20001121-tools
3382    Description:
3383    
3384    Continued hacking on autoloading problem -- with success this time.
3385    Also changed tool-plugin mechanism.  See new CM manual.
3386    
3387    ----------------------------------------------------------------------
3388    Name: Matthias Blume
3389    Date: 2000/11/19 14:30:00 JST
3390    Tag:  blume-20001119-autoload
3391    Description:
3392    
3393    Some hacking to make autoloading faster.  Success for CMB, no success
3394    so far for CM.  There is a reduced structure CM' that autoloads faster.
3395    (This is a temporary, non-documented hack to be eliminated again when
3396    the general problem is solved.)
3397    
3398    ----------------------------------------------------------------------
3399    Name: Matthias Blume
3400    Date: 2000/11/17 14:10:00 JST
3401    Tag: blume-20001117-pickle-lib
3402    Description:
3403    
3404    1. Eliminated comp-lib.cm
3405    2. Made pickle-lib.cm
3406    3. Eliminated all uses of intset.sml (from comp-lib.cm)
3407    4. Replaced all uses of intmap.{sig,sml} (from comp-lib.cm) with
3408       equivalent constructs from smlnj-lib.cm (INtHashTable).
3409    5. Point 4. also goes for those uses of intmap.* in MLRISC.
3410       Duplicated intmap modules thrown out.
3411    6. Hunted down all duplicated SCC code and replaced it with
3412       equivalent stuff (GraphSCCFn from smlnj-lib.cm).
3413    7. Rewrote Feedback module.
3414    8. Moved sortedlist.sml into viscomp-lib.cm.  Eventually it
3415       should be thrown out and equivalent modules from smlnj-lib.cm
3416       should be used (IntRedBlackSet, IntListSet, ...).
3417    
3418    Confirmed that compiler compiles to fixpoint.
3419    
3420    ----------------------------------------------------------------------
3421    Name: Allen Leung
3422    Date: 2000/11/10 18:00:00
3423    Tag: leunga-20001110-new-x86-fp
3424    
3425    A new x86 floating point code generator has been added.
3426    By default this is turned off.  To turn this on, do:
3427    
3428        CM.autoload "$smlnj/compiler.cm";
3429        Compiler.Control.MLRISC.getFlag "x86-fast-fp" := true;
3430    
3431    Changes:
3432    
3433    1.  Changed FTAN to FPTAN so that the assembly output is correct.
3434    2.  Changed the extension callback for FTANGENT to generate:
3435    
3436              fptan
3437              fstp  %st(0)
3438        instead of
3439              fptan
3440              fstpl ftempmem
3441    
3442    3.  Numerous assembly fixes for x86.
3443    
3444    5.  Cleaned up the machine code output module x86/x86MC.sml and added
3445        support for a whole bunch of instructions and addressing modes:
3446    
3447          fadd/fsub/fsubr/fmul/fdiv/fdivr  %st, %st(n)
3448          faddp/fsubp/fsubrp/fmulp/fdivp/fdivrp  %st, %st(n)
3449          fadd/fsub/fsubr/fmul/fdiv/fdivr  %st(n), %st
3450          fiadd/fisub/fisubr/fimul/fidiv/fidivr mem
3451          fxch %st(n)
3452          fld %st(n)
3453          fst %st(n)
3454          fst mem
3455          fstp %st(n)
3456          fucom %st(n)
3457          fucomp %st(n)
3458    
3459        All these are now generated when the fast fp mode is turned on.
3460    
3461    6.  Removed the dedicated registers %st(0), ..., %st(7) from X86CpsRegs
3462    
3463    ----------------------------------------------------------------------
3464    Name: Matthias Blume
3465    Date: 2000/11/09 11:20:00 JST
3466    Tag: blume-20001109-scc
3467    Description:
3468    
3469    Eliminated some code duplication:
3470    
3471    1. Added "where" clause to GraphSCCFn in SML/NJ Library.
3472       (Otherwise the functor is useless.)
3473    2. Used GraphSCCFn where SCCUtilFun was used previously.
3474    3. Got rid of SCCUtilFun (in comp-lib.cm).
3475    
3476    ----------------------------------------------------------------------
3477    Name: Lal George
3478    Date: 2000/11/06 09:02:21 EST 2000
3479    Tag: Release_110_30
3480    Description:
3481    
3482    - Version 110.30
3483    ----------------------------------------------------------------------
3484    Name: Matthias Blume
3485    Date: 2000/11/04 14:45:00
3486    Tag: blume-20001104-mlbuild
3487    Description:
3488    
3489    - Made ml-build faster on startup.
3490    - Documentation fixes.
3491    
3492    ----------------------------------------------------------------------
3493    Name: Matthias Blume
3494    Date: 2000/11/02 17:00:00 JST
3495    Tag: blume-20001102-condcomp
3496    Description:
3497    
3498    - Small tweaks to pickler -- new BOOTFILES!
3499    - Version bumped to 110.29.2.
3500    - Added conditional compilation facility to init.cmi (see comment there).
3501    ----------------------------------------------------------------------
3502    Name: Allen Leung
3503    Date: 2000/10/23 19:31:00
3504    Tag: leunga-20001023-demo-ra
3505    
3506    1. Minor RA changes that improves spilling on x86 (affects Moby and C-- only)
3507    2. Test programs for the graph library updated
3508    3. Some new MLRISC demo programs added
3509    
3510    ----------------------------------------------------------------------
3511    Name: Matthias Blume
3512    Date: 2000/08/31 22:15:00 JST
3513    Tag: blume-20001017-errmsg
3514    Description:
3515    
3516    More error message grief: Where there used to be no messages, there
3517    now were some that had bogus error regions.  Fixed.
3518    
3519    ----------------------------------------------------------------------
3520    Name: Matthias Blume
3521    Date: 2000/08/31 17:30:00 JST
3522    Tag: blume-20001017-v110p29p1
3523    Description:
3524    
3525    I made a version 110.29.1 with new bootfiles.
3526    
3527    Changes:  Modified pickler/unpickler for faster and leaner unpickling.
3528              CM documentation changes and a small bugfix in CM's error reporting.
3529    
3530    ----------------------------------------------------------------------
3531    Name: Lal George
3532    Date: 2000/09/27 14:42:35 EDT
3533    Tag: george-20000927-nodestatus
3534    Description:
3535    
3536    Changed the type of the nodestatus, so that:
3537    
3538            SPILLED(~1)             is now SPILLED
3539            SPILLED(m) where m>=0   is now MEMREG(m)
3540            SPILLED(s) where s<~1   is now SPILL_LOC(~s)
3541    
3542    ----------------------------------------------------------------------
3543    Name: Matthias Blume
3544    Date: 2000/09/07 14:45:00 JST
3545    Tag: blume-20000907-cmerrmsg
3546    Description:
3547    
3548    Small tweak to CM to avoid getting ML syntax error messages twice.
3549    
3550    ----------------------------------------------------------------------
3551    Name: Matthias Blume
3552    Date: 2000/08/31 18:00:00 JST
3553    Tag: blume-20000831-cvsbootfiles
3554    Description:
3555    
3556    New URL for boot files (because the 110.29 files on the BL server do
3557    now work correctly with my updated install scripts for yacc and lex).
3558    
3559    ----------------------------------------------------------------------
3560    Name: Matthias Blume
3561    Date: 2000/08/08 12:33:00 JST
3562    Tag: blume-20000808-manual
3563    Description:
3564    
3565    Tiny update to CM manual.
3566    
3567    ----------------------------------------------------------------------
3568    Name: Allen Leung
3569    Date: 2000/08/7 19:31:00
3570    Tag: leunga-20000807-a-whole-bunch-of-stuff
3571    
3572      Moby, C--, SSA, x86, machine descriptions etc.  Should only affect C--
3573    and Mobdy.
3574    
3575    1.  x86
3576    
3577       a.  Fixes to peephole module by John and Dan.
3578       b.  Assembly fix to SETcc by Allen.
3579       c.  Fix to c-call by John.
3580       d.  Fix to spilling by John.  (This one deals with the missing FSTPT case)
3581       e.  Instruction selection optimization to SETcc as suggested by John.
3582    
3583           For example,
3584    
3585            MV(32, x, COND(32, CMP(32, LT, a, b), LI 1, LI 0))
3586    
3587           should generate:
3588    
3589            MOVL a, x
3590            SUBL b, x
3591            SHRL 31, x
3592    
3593    2.  IR stuff
3594    
3595         A bunch of new DJ-graph related algorithms added.  These
3596         speed up SSA construction.
3597    
3598    3.  SSA + Scheduling
3599    
3600         Added code for SSA and scheduling to the repository
3601    
3602    ----------------------------------------------------------------------
3603    Name: Lal George
3604    Date: 2000/07/27 11:53:14 EDT
3605    
3606    Tag: lal-20000727-linux-ppc
3607    Description:
3608    
3609     Made changes to support Linux PPC.
3610     p.s. I have confirmation that the 110.29 boot files work fine.
3611    
3612    ----------------------------------------------------------------------
3613    Name: Matthias Blume
3614    Date: 2000/07/27 17:40:00 JST
3615    Tag: blume-20000727-scripts
3616    Description:
3617    
3618    !!!! WARNING !!!!
3619    You must recompile the runtime system!
3620    !!!! WARNING !!!!
3621    
3622    This is basically another round of script-enhancements:
3623