Home My Page Projects Code Snippets Project Openings SML/NJ
Summary Activity Forums Tracker Lists Tasks Docs Surveys News SCM Files

SCM Repository

[smlnj] Diff of /sml/trunk/HISTORY
ViewVC logotype

Diff of /sml/trunk/HISTORY

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 806, Thu Mar 22 21:22:21 2001 UTC revision 1137, Tue Mar 12 22:28:55 2002 UTC
# Line 8  Line 8 
8  The form of an entry should be:  The form of an entry should be:
9    
10  Name:  Name:
11  Date:  Date: yyyy/mm/dd
12  Tag: <post-commit CVS tag>  Tag: <post-commit CVS tag>
13  Description:  Description:
14    ----------------------------------------------------------------------
15    Name: Matthias Blume
16    Date: 2002/03/12 17:30:00 EST
17    Tag: blume-20020312-version110392
18    Description:
19    
20    Yet another version number bump (because of small changes to the
21    binfile format).  Version number is now 110.39.2.  NEW BOOTFILES!
22    
23    Changes:
24    
25      The new pid generation scheme described a few weeks ago was overly
26      complicated.  I implemented a new mechanism that is simpler and
27      provides a bit more "stability":  Once CM has seen a compilation
28      unit, it keeps its identity constant (as long as you do not delete
29      those crucial CM/GUID/* files).  This means that when you change
30      and interface, compiler, then go back to the old interface, and
31      compile again, you arrive at the original pid.
32    
33      There now also is a mechanism that instructs CM to use the plain
34      environment hash as a module's pid (effectively making its GUID
35      the empty string).  For this, "noguid" must be specified as an
36      option to the .sml file in question within its .cm file.
37      This is most useful for code that is being generated by tools such
38      as ml-nlffigen (because during development programmers tend to
39      erase the tool's entire output directory tree including CM's cached
40      GUIDs).  "noguid" is somewhat dangerous (since it can be used to locally
41      revert to the old, broken behavior of SML/NJ, but in specific cases
42      where there is no danger of interface confusion, its use is ok
43      (I think).
44    
45      ml-nlffigen by default generates "noguid" annotations.  They can be
46      turned off by specifying -guid in its command line.
47    
48    ----------------------------------------------------------------------
49    Name: Lal George
50    Date: 2002/03/12 12 14:42:36 EST
51    Tag: george-20020312-frequency-computation
52    Description:
53    
54    Integrated jump chaining and static block frequency into the
55    compiler. More details and numbers later.
56    
57    ----------------------------------------------------------------------
58    Name: Lal George
59    Date: 2002/03/11 11 22:38:53 EST
60    Tag: george-20020311-jump-chain-elim
61    Description:
62    
63    Tested the jump chain elimination on all architectures (except the
64    hppa).  This is on by default right now and is profitable for the
65    alpha and x86, however, it may not be profitable for the sparc and ppc
66    when compiling the compiler.
67    
68    The gc test will typically jump to a label at the end of the cluster,
69    where there is another jump to an external cluster containing the actual
70    code to invoke gc. This is to allow factoring of common gc invocation
71    sequences. That is to say, we generate:
72    
73            f:
74               testgc
75               ja   L1      % jump if above to L1
76    
77            L1:
78               jmp L2
79    
80    
81    After jump chain elimination the 'ja L1' instructions is converted to
82    'ja L2'. On the sparc and ppc, many of the 'ja L2' instructions may end
83    up being implemented in their long form (if L2 is far away) using:
84    
85            jbe     L3      % jump if below or equal to L3
86            jmp     L2
87         L3:
88            ...
89    
90    
91    For large compilation units L2  may be far away.
92    
93    
94    ----------------------------------------------------------------------
95    Name: Matthias Blume
96    Date: 2002/03/11 13:30:00 EST
97    Tag: blume-20020311-mltreeeval
98    Description:
99    
100    A functor parameter was missing.
101    
102    ----------------------------------------------------------------------
103    Name: Allen Leung
104    Date: 2002/03/11 10:30:00 EST
105    Tag: leunga-20020310-runtime-string0
106    Description:
107    
108       The representation of the empty string now points to a
109    legal null terminated C string instead of unit.  It is now possible
110    to convert an ML string into C string with InlineT.CharVector.getData.
111    This compiles into one single machine instruction.
112    
113    ----------------------------------------------------------------------
114    Name: Allen Leung
115    Date: 2002/03/10 23:55:00 EST
116    Tag: leunga-20020310-x86-call
117    Description:
118    
119       Added machine generation for CALL instruction (relative displacement mode)
120    
121    ----------------------------------------------------------------------
122    Name: Matthias Blume
123    Date: 2002/03/08 16:05:00
124    Tag: blume-20020308-entrypoints
125    Description:
126    
127    Version number bumped to 110.39.1.  NEW BOOTFILES!
128    
129    Entrypoints: non-zero offset into a code object where execution should begin.
130    
131    - Added the notion of an entrypoint to CodeObj.
132    - Added reading/writing of entrypoint info to Binfile.
133    - Made runtime system bootloader aware of entrypoints.
134    - Use the address of the label of the first function given to mlriscGen
135      as the entrypoint.  This address is currently always 0, but it will
136      not be 0 once we turn on block placement.
137    - Removed the linkage cluster code (which was The Other Way(tm) of dealing
138      with entry points) from mlriscGen.
139    
140    ----------------------------------------------------------------------
141    Name: Allen Leung
142    Date: 2002/03/07 20:45:00 EST
143    Tag: leunga-20020307-x86-cmov
144    Description:
145    
146       Bug fixes for CMOVcc on x86.
147    
148       1. Added machine code generation for CMOVcc
149       2. CMOVcc is now generated in preference over SETcc on PentiumPro or above.
150       3. CMOVcc cannot have an immediate operand as argument.
151    
152    ----------------------------------------------------------------------
153    Name: Matthias Blume
154    Date: 2002/03/07 16:15:00 EST
155    Tag: blume-20020307-controls
156    Description:
157    
158    This is a very large but mostly boring patch which makes (almost)
159    every tuneable compiler knob (i.e., pretty much everything under
160    Control.* plus a few other things) configurable via both the command
161    line and environment variables in the style CM did its configuration
162    until now.
163    
164    Try starting sml with '-h' (or, if you are brave, '-H')
165    
166    To this end, I added a structure Controls : CONTROLS to smlnj-lib.cm which
167    implements the underlying generic mechanism.
168    
169    The interface to some of the existing such facilities has changed somewhat.
170    For example, the MLRiscControl module now provides mkFoo instead of getFoo.
171    (The getFoo interface is still there for backward-compatibility, but its
172    use is deprecated.)
173    
174    The ml-build script passes -Cxxx=yyy command-line arguments through so
175    that one can now twiddle the compiler settings when using this "batch"
176    compiler.
177    
178    TODO items:
179    
180    We should go through and throw out all controls that are no longer
181    connected to anything.  Moreover, we should go through and provide
182    meaningful (and correct!) documentation strings for those controls
183    that still are connected.
184    
185    Currently, multiple calls to Controls.new are accepted (only the first
186    has any effect).  Eventually we should make sure that every control
187    is being made (via Controls.new) exactly once.  Future access can then
188    be done using Controls.acc.
189    
190    Finally, it would probably be a good idea to use the getter-setter
191    interface to controls rather than ref cells.  For the time being, both
192    styles are provided by the Controls module, but getter-setter pairs are
193    better if thread-safety is of any concern because they can be wrapped.
194    
195    *****************************************
196    
197    One bug fix: The function blockPlacement in three of the MLRISC
198    backpatch files used to be hard-wired to one of two possibilities at
199    link time (according to the value of the placementFlag).  But (I
200    think) it should rather sense the flag every time.
201    
202    *****************************************
203    
204    Other assorted changes (by other people who did not supply a HISTORY entry):
205    
206    1. the cross-module inliner now works much better (Monnier)
207    2. representation of weights, frequencies, and probabilities in MLRISC
208       changed in preparation of using those for weighted block placement
209       (Reppy, George)
210    
211    ----------------------------------------------------------------------
212    Name: Lal George
213    Date: 2002/03/07 14:44:24 EST 2002
214    Tag: george-20020307-weighted-block-placement
215    
216    Tested the weighted block placement optimization on all architectures
217    (except the hppa) using AMPL to generate the block and edge frequencies.
218    Changes were required in the machine properties to correctly
219    categorize trap instructions. There is an MLRISC flag
220    "weighted-block-placement" that can be used to enable weighted block
221    placement, but this will be ineffective without block/edge
222    frequencies (coming soon).
223    
224    
225    ----------------------------------------------------------------------
226    Name: Lal George
227    Date: 2002/03/05 17:24:48 EST
228    Tag: george-20020305-linkage-cluster
229    
230    In order to support the block placement optimization, a new cluster
231    is generated as the very first cluster (called the linkage cluster).
232    It contains a single jump to the 'real' entry point for the compilation
233    unit. Block placement has no effect on the linkage cluster itself, but
234    all the other clusters  have full freedom in the manner in which they
235    reorder blocks or functions.
236    
237    On the x86 the typical linkage code that is generated is:
238       ----------------------
239            .align 2
240       L0:
241            addl    $L1-L0, 72(%esp)
242            jmp     L1
243    
244    
245            .align  2
246       L1:
247       ----------------------
248    
249    72(%esp) is the memory location for the stdlink register. This
250    must contain the address of the CPS function being called. In the
251    above example, it contains the address of  L0; before
252    calling L1 (the real entry point for the compilation unit), it
253    must contain the address for L1, and hence
254    
255            addl $L1-L0, 72(%esp)
256    
257    I have tested this on all architectures except the hppa.The increase
258    in code size is of course negligible
259    
260    ----------------------------------------------------------------------
261    Name: Allen Leung
262    Date: 2002/03/03 13:20:00 EST
263    Tag: leunga-20020303-mlrisc-tools
264    
265      Added #[ ... ] expressions to mlrisc tools
266    
267    ----------------------------------------------------------------------
268    Name: Matthias Blume
269    Date: 2002/02/27 12:29:00 EST
270    Tag: blume-20020227-cdebug
271    Description:
272    
273    - made types in structure C and C_Debug to be equal
274    - got rid of code duplication (c-int.sml vs. c-int-debug.sml)
275    - there no longer is a C_Int_Debug (C_Debug is directly derived from C)
276    
277    ----------------------------------------------------------------------
278    Name: Matthias Blume
279    Date: 2002/02/26 12:00:00 EST
280    Tag: blume-20020226-ffi
281    Description:
282    
283    1. Fixed a minor bug in CM's "noweb" tool:
284       If numbering is turned off, then truly don't number (i.e., do not
285       supply the -L option to noweb).  The previous behavior was to supply
286       -L'' -- which caused noweb to use the "default" line numbering scheme.
287       Thanks to Chris Richards for pointing this out (and supplying the fix).
288    
289    2. Once again, I reworked some aspects of the FFI:
290    
291       A. The incomplete/complete type business:
292    
293       - Signatures POINTER_TO_INCOMPLETE_TYPE and accompanying functors are
294         gone!
295       - ML types representing an incomplete type are now *equal* to
296         ML types representing their corresponding complete types (just like
297         in C).  This is still safe because ml-nlffigen will not generate
298         RTTI for incomplete types, nor will it generate functions that
299         require access to such RTTI.   But when ML code generated from both
300         incomplete and complete versions of the C type meet, the ML types
301         are trivially interoperable.
302    
303         NOTE:  These changes restore the full generality of the translation
304         (which was previously lost when I eliminated functorization)!
305    
306       B. Enum types:
307    
308       - Structure C now has a type constructor "enum" that is similar to
309         how the "su" constructor works.  However, "enum" is not a phantom
310         type because each "T enum" has values (and is isomorphic to
311         MLRep.Signed.int).
312       - There are generic access operations for enum objects (using
313         MLRep.Signed.int).
314       - ml-nlffigen will generate a structure E_foo for each "enum foo".
315         * The structure contains the definition of type "mlrep" (the ML-side
316         representation type of the enum).  Normally, mlrep is the same
317         as "MLRep.Signed.int", but if ml-nlffigen was invoked with "-ec",
318         then mlrep will be defined as a datatype -- thus facilitating
319         pattern matching on mlrep values.
320         ("-ec" will be suppressed if there are duplicate values in an
321          enumeration.)
322         * Constructors ("-ec") or values (no "-ec") e_xxx of type mlrep
323         will be generated for each C enum constant xxx.
324         * Conversion functions m2i and i2m convert between mlrep and
325         MLRep.Signed.int.  (Without "-ec", these functions are identities.)
326         * Coversion functions c and ml convert between mlrep and "tag enum".
327         * Access functions (get/set) fetch and store mlrep values.
328       - By default (unless ml-nlffigen was invoked with "-nocollect"), unnamed
329         enumerations are merged into one single enumeration represented by
330         structure E_'.
331    
332    ----------------------------------------------------------------------
333    Name: Allen Leung
334    Date: 2002/02/25 04:45:00 EST
335    Tag: leunga-20020225-cps-spill
336    
337    This is a new implementation of the CPS spill phase.
338    The new phase is in the new file compiler/CodeGen/cpscompile/spill-new.sml
339    In case of problems, replace it with the old file spill.sml
340    
341    The current compiler runs into some serious performance problems when
342    constructing a large record.  This can happen when we try to compile a
343    structure with many items.  Even a very simple structure like the following
344    makes the compiler slow down.
345    
346        structure Foo = struct
347           val x_1 = 0w1 : Word32.int
348           val x_2 = 0w2 : Word32.int
349           val x_3 = 0w3 : Word32.int
350           ...
351           val x_N = 0wN : Word32.int
352        end
353    
354    The following table shows the compile time, from N=1000 to N=4000,
355    with the old compiler:
356    
357    N
358    1000   CPS 100 spill                           0.04u  0.00s  0.00g
359           MLRISC ra                               0.06u  0.00s  0.05g
360              (spills = 0 reloads = 0)
361           TOTAL                                   0.63u  0.07s  0.21g
362    
363    1100   CPS 100 spill                           8.25u  0.32s  0.64g
364           MLRISC ra                               5.68u  0.59s  3.93g
365              (spills = 0 reloads = 0)
366           TOTAL                                   14.71u  0.99s  4.81g
367    
368    1500   CPS 100 spill                           58.55u  2.34s  1.74g
369           MLRISC ra                               5.54u  0.65s  3.91g
370              (spills = 543 reloads = 1082)
371           TOTAL                                   65.40u  3.13s  6.00g
372    
373    2000   CPS 100 spill                           126.69u  4.84s  3.08g
374           MLRISC ra                               0.80u  0.10s  0.55g
375              (spills = 42 reloads = 84)
376           TOTAL                                   129.42u  5.10s  4.13g
377    
378    3000   CPS 100 spill                           675.59u  19.03s  11.64g
379           MLRISC ra                               2.69u  0.27s  1.38g
380              (spills = 62 reloads = 124)
381           TOTAL                                   682.48u  19.61s  13.99g
382    
383    4000   CPS 100 spill                           2362.82u  56.28s  43.60g
384           MLRISC ra                               4.96u  0.27s  2.72g
385              (spills = 85 reloads = 170)
386           TOTAL                                   2375.26u  57.21s  48.00g
387    
388    As you can see the old cps spill module suffers from some serious
389    performance problem.  But since I cannot decipher the old code fully,
390    instead of patching the problems up, I'm reimplementing it
391    with a different algorithm.  The new code is more modular,
392    smaller when compiled, and substantially faster
393    (O(n log n) time and O(n) space).  Timing of the new spill module:
394    
395    4000  CPS 100 spill                           0.02u  0.00s  0.00g
396          MLRISC ra                               0.25u  0.02s  0.15g
397             (spills=1 reloads=3)
398          TOTAL                                   7.74u  0.34s  1.62g
399    
400    Implementation details:
401    
402    As far as I can tell, the purpose of the CPS spill module is to make sure the
403    number of live variables at any program point (the bandwidth)
404    does not exceed a certain limit, which is determined by the
405    size of the spill area.
406    
407    When the bandwidth is too large, we decrease the register pressure by
408    packing live variables into spill records.  How we achieve this is
409    completely different than what we did in the old code.
410    
411    First, there is something about the MLRiscGen code generator
412    that we should be aware of:
413    
414    o MLRiscGen performs code motion!
415    
416       In particular, it will move floating point computations and
417       address computations involving only the heap pointer to
418       their use sites (if there is only a single use).
419       What this means is that if we have a CPS record construction
420       statement
421    
422           RECORD(k,vl,w,e)
423    
424       we should never count the new record address w as live if w
425       has only one use (which is often the case).
426    
427       We should do something similar to floating point, but the transformation
428       there is much more complex, so I won't deal with that.
429    
430    Secondly, there are now two new cps primops at our disposal:
431    
432     1. rawrecord of record_kind option
433        This pure operator allocates some uninitialized storage from the heap.
434        There are two forms:
435    
436         rawrecord NONE [INT n]  allocates a tagless record of length n
437         rawrecord (SOME rk) [INT n] allocates a tagged record of length n
438                                     and initializes the tag.
439    
440     2. rawupdate of cty
441          rawupdate cty (v,i,x)
442          Assigns to x to the ith component of record v.
443          The storelist is not updated.
444    
445    We use these new primops for both spilling and increment record construction.
446    
447     1. Spilling.
448    
449        This is implemented with a linear scan algorithm (but generalized
450        to trees).  The algorithm will create a single spill record at the
451        beginning of the cps function and use rawupdate to spill to it,
452        and SELECT or SELp to reload from it.  So both spills and reloads
453        are fine-grain operations.  In contrast, in the old algorithm
454        "spills" have to be bundled together in records.
455    
456        Ideally, we should sink the spill record construction to where
457        it is needed.  We can even split the spill record into multiple ones
458        at the places where they are needed.  But CPS is not a good
459        representation for global code motion, so I'll keep it simple and
460        am not attempting this.
461    
462     2. Incremental record construction (aka record splitting).
463    
464        Long records with many component values which are simulatenously live
465        (recall that single use record addresses are not considered to
466         be live) are constructed with rawrecord and rawupdate.
467        We allocate space on the heap with rawrecord first, then gradually
468        fill it in with rawupdate.  This is the technique suggested to me
469        by Matthias.
470    
471        Some restrictions on when this is applicable:
472        1. It is not a VECTOR record.  The code generator currently does not handle
473           this case. VECTOR record uses double indirection like arrays.
474        2. All the record component values are defined in the same "basic block"
475           as the record constructor.  This is to prevent speculative
476           record construction.
477    
478    ----------------------------------------------------------------------
479    Name: Allen Leung
480    Date: 2002/02/22 01:02:00 EST
481    Tag: leunga-20020222-mlrisc-tools
482    
483    Minor bug fixes in the parser and rewriter
484    
485    ----------------------------------------------------------------------
486    Name: Allen Leung
487    Date: 2002/02/21 20:20:00 EST
488    Tag: leunga-20020221-peephole
489    
490    Regenerated the peephole files.  Some contained typos in the specification
491    and some didn't compile because of pretty printing bugs in the old version
492    of 'nowhere'.
493    
494    ----------------------------------------------------------------------
495    Name: Allen Leung
496    Date: 2002/02/19 20:20:00 EST
497    Tag: leunga-20020219-mlrisc-tools
498    Description:
499    
500       Minor bug fixes to the mlrisc-tools library:
501    
502       1.  Fixed up parsing colon suffixed keywords
503       2.  Added the ability to shut the error messages up
504       3.  Reimplemented the pretty printer and fixed up/improved
505           the pretty printing of handle and -> types.
506       4.  Fixed up generation of literal symbols in the nowhere tool.
507       5.  Added some SML keywords to to sml.sty
508    
509    ----------------------------------------------------------------------
510    Name: Matthias Blume
511    Date: 2002/02/19 16:20:00 EST
512    Tag: blume-20020219-cmffi
513    Description:
514    
515    A wild mix of changes, some minor, some major:
516    
517    * All C FFI-related libraries are now anchored under $c:
518        $/c.cm      --> $c/c.cm
519        $/c-int.cm  --> $c/internals/c-int.cm
520        $/memory.cm --> $c/memory/memory.cm
521    
522    * "make" tool (in CM) now treats its argument pathname slightly
523      differently:
524        1. If the native expansion is an absolute name, then before invoking
525           the "make" command on it, CM will apply OS.Path.mkRelative
526           (with relativeTo = OS.FileSys.getDir()) to it.
527        2. The argument will be passed through to subsequent phases of CM
528           processing without "going native".  In particular, if the argument
529           was an anchored path, then "make" will not lose track of that anchor.
530    
531    * Compiler backends now "know" their respective C calling conventions
532      instead of having to be told about it by ml-nlffigen.  This relieves
533      ml-nlffigen from one of its burdens.
534    
535    * The X86Backend has been split into X86CCallBackend and X86StdCallBackend.
536    
537    * Export C_DEBUG and C_Debug from $c/c.cm.
538    
539    * C type encoding in ml-nlffi-lib has been improved to model the conceptual
540      subtyping relationship between incomplete pointers and their complete
541      counterparts.  For this, ('t, 'c) ptr has been changed to 'o ptr --
542      with the convention of instantiating 'o with ('t, 'c) obj whenever
543      the pointer target type is complete.  In the incomplete case, 'o
544      will be instantiated with some "'c iobj" -- a type obtained by
545      using one of the functors PointerToIncompleteType or PointerToCompleteType.
546    
547      Operations that work on both incomplete and complete pointer types are
548      typed as taking an 'o ptr while operations that require the target to
549      be known are typed as taking some ('t, 'c) obj ptr.
550    
551      voidptr is now a bit "more concrete", namely "type voidptr = void ptr'"
552      where void is an eqtype without any values.  This makes it possible
553      to work on voidptr values using functions meant to operate on light
554      incomplete pointers.
555    
556    * As a result of the above, signature POINTER_TO_INCOMPLETE_TYPE has
557      been vastly simplified.
558    
559    ----------------------------------------------------------------------
560    Name: Matthias Blume
561    Date: 2002/02/19 10:48:00 EST
562    Tag: blume-20020219-pqfix
563    Description:
564    
565    Applied Chris Okasaki's bug fix for priority queues.
566    
567    ----------------------------------------------------------------------
568    Name: Matthias Blume
569    Date: 2002/02/15 17:05:00
570    Tag: Release_110_39
571    Description:
572    
573    Last-minute retagging is becoming a tradition... :-(
574    
575    This is the working release 110.39.
576    
577    ----------------------------------------------------------------------
578    Name: Matthias Blume
579    Date: 2002/02/15 16:00:00 EST
580    Tag: Release_110_39-orig
581    Description:
582    
583    Working release 110.39.  New bootfiles.
584    
585    (Update: There was a small bug in the installer so it wouldn't work
586    with all shells.  So I retagged. -Matthias)
587    
588    ----------------------------------------------------------------------
589    Name: Matthias Blume
590    Date: 2002/02/15 14:17:00 EST
591    Tag: blume-20020215-showbindings
592    Description:
593    
594    Added EnvRef.listBoundSymbols and CM.State.showBindings.  Especially
595    the latter can be useful for exploring what bindings are available at
596    the interactive prompt.  (The first function returns only the list
597    of symbols that are really bound, the second prints those but also the
598    ones that CM's autoloading mechanism knows about.)
599    
600    ----------------------------------------------------------------------
601    Name: Matthias Blume
602    Date: 2002/02/15 12:08:00 EST
603    Tag: blume-20020215-iptrs
604    Description:
605    
606    Two improvements to ml-nlffigen:
607    
608      1. Write files only if they do not exist or if their current contents
609         do not coincide with what's being written.  (That is, avoid messing
610         with the time stamps unless absolutely necessary.)
611    
612      2. Implement a "repository" mechanism for generated files related
613         to "incomplete pointer types".   See the README file for details.
614    
615    ----------------------------------------------------------------------
616    Name: Matthias Blume
617    Date: 2002/02/14 11:50:00 EST
618    Tag: blume-20020214-quote
619    Description:
620    
621    Added a type 't t_' to tag.sml (in ml-nlffi-lib.cm).  This is required
622    because of the new and improved tag generation scheme.  (Thanks to Allen
623    Leung for pointing it out.)
624    
625    ----------------------------------------------------------------------
626    Name: Lal George
627    Date: 2002/02/14 09:55:27 EST 2002
628    Tag: george-20020214-isabelle-bug
629    Description:
630    
631    Fixed the MLRISC bug sent by Markus Wenzel regarding the compilation
632    of Isabelle on the x86.
633    
634    From Allen:
635    -----------
636     I've found the problem:
637    
638         in ra-core.sml, I use the counter "blocked" to keep track of the
639         true number of elements in the freeze queue.  When the counter goes
640         to zero, I skip examining the queue.  But I've messed up the
641         bookkeeping in combine():
642    
643             else ();
644             case !ucol of
645               PSEUDO => (if !cntv > 0 then
646                     (if !cntu > 0 then blocked := !blocked - 1 else ();
647                                        ^^^^^^^^^^^^^^^^^^^^^^^
648                      moveu := mergeMoveList(!movev, !moveu)
649                     )
650                  else ();
651    
652         combine() is called to coalesce two nodes u and v.
653         I think I was thinking that if the move counts of u and v are both
654         greater than zero then after they are coalesced then one node is
655         removed from the freeze queue.  Apparently I was thinking that
656         both u and v are of low degree, but that's clearly not necessarily true.
657    
658    
659    02/12/2002:
660        Here's the patch.  HOL now compiles.
661    
662        I don't know how this impact on performance (compile
663        time or runtime).  This bug caused the RA (especially on the x86)
664        to go thru the potential spill phase when there are still nodes on the
665        freeze queue.
666    
667    
668    
669    
670    ----------------------------------------------------------------------
671    Name: Matthias Blume
672    Date: 2002/02/13 22:40:00 EST
673    Tag: blume-20020213-fptr-rtti
674    Description:
675    
676    Fixed a bug in ml-nlffigen that was introduced with one of the previous
677    updates.
678    
679    ----------------------------------------------------------------------
680    Name: Matthias Blume
681    Date: 2002/02/13 16:41:00 EST
682    Tag: blume-20020213-cmlpq
683    Description:
684    
685    Added new priority queue export symbols (which have just been added to
686    smlnj-lib.cm) to CML's version of smlnj-lib.cm.  (Otherwise CML would
687    not compile and the installer would choke.)
688    
689    ----------------------------------------------------------------------
690    Name: Matthias Blume
691    Date: 2002/02/13 16:15:00 EST
692    Tag: blume-20020213-various
693    Description:
694    
695    1. More tweaks to ml-nlffigen:
696    
697       - better internal datastructures (resulting in slight speedup)
698       - "-match" option requires exact match
699       - "localized" gensym counters (untagged structs/unions nested within
700         other structs/unions or within typedefs get a fresh counter; their
701         tag will be prefixed by a concatenation of their parents' tags)
702       - bug fixes (related to calculation of transitive closure of types
703         to be included in the output)
704    
705    2. Minor Basis updates:
706    
707       - added implementations for List.collate and Option.app
708    
709    ----------------------------------------------------------------------
710    Name: Matthias Blume
711    Date: 2002/02/11 15:55:00 EST
712    Tag: blume-20020211-gensym
713    Description:
714    
715    Added a "-gensym" option to command line of ml-nlffigen.  This can be
716    used to specify a "stem" -- a string that is inserted in all "gensym'd"
717    names (ML structure names that correspond to unnamed C structs, unions,
718    and enums), so that separate runs of ml-nlffigen do not clash.
719    
720    ----------------------------------------------------------------------
721    Name: Matthias Blume
722    Date: 2002/02/11 12:05:00 EST
723    Tag: blume-20020211-gensml
724    Description:
725    
726    A quick fix for a problem with GenSML (in the pgraph-util library):
727    Make generation of toplevel "local" optional.  (Strictly speaking,
728    signature definitions within "local" are not legal SML.)
729    
730    Other than that: updates to INSTALL and cm/TODO.
731    
732    ----------------------------------------------------------------------
733    Name: Matthias Blume
734    Date: 2002/02/08 15:00:00 EST
735    Tag: blume-20020208-uniquepid
736    Description:
737    
738    0. Version number has been bumped to 110.38.1.  NEW BOOTFILES!!!
739    
740    1. The installer (config/install.sh) has gotten smarter:
741    
742         - Configuration options are a bit easier to specify now
743           (in config/targets).
744         - Bug in recognizing .tar.bz2 files fixed.
745         - Installer automatically resolves dependencies between
746           configuration options (e.g., if you ask for eXene, you will
747           also get cml -- regardless whether you asked for it or not).
748         - Installer can run in "quieter mode" by setting the environment
749           variable INSTALL_QUIETLY to "true".  "Quieter" does not mean
750           "completely silent", though.
751         - Build HashCons library as part of smlnj-lib.
752    
753    2. A new scheme for assigning persistent identifiers to compilation
754       units (and, by extension, to types etc.) has been put into place.
755       This fixes a long-standing bug where types and even dynamic values
756       can get internally confused, thereby compromising type safety
757       (abstraction) and dynamic correctness.  See
758    
759         http://cm.bell-labs.com/cm/cs/who/blume/pid-confusion.tgz
760    
761       for an example of how things could go wrong until now.
762    
763       The downside of the new scheme is that pids are not quite as
764       persistent as they used to be: CM will generate a fresh pid
765       for every compilation unit that it thinks it sees for the first
766       time.  That means that if you compile starting from a clean, fresh
767       source tree at two different times, you end up with different
768       binaries.
769    
770       Cutoff recompilation, however, has not been compromised because
771       CM keeps pid information in special caches between runs.
772    
773    ----------------------------------------------------------------------
774    Name: Lal George
775    Date: 2002/02/07 15:34:13 EST 2002
776    Tag: <none>
777    Description:
778    
779    Compilers that generate assembly code may produce  global labels
780    whose value is resolved at link time. The various peephole optimization
781    modules did not take this in account.
782    
783    TODO. The Labels.addrOf function should really return an option
784    type so that clients are forced to deal with this issue, rather
785    than an exception being raised.
786    
787    ----------------------------------------------------------------------
788    Name: Lal George
789    Date: 2002/02/06 13:55:02 EST
790    Tag: george-20020206-ra-breakup
791    Description:
792    
793    1. A bug fix from Allen.
794    
795        A typo causes extra fstp %st(0)'s to be generated at compensation
796        edges, which might cause stack underflow traps at runtime.  This
797        occurs in fft where there are extraneous fstps right before the 'into'
798        trap instruction (in this case they are harmless since none of the
799        integers overflow.)
800    
801    2. Pulled out various utility modules that were embedded in the modules
802       of the register allocator. I need these modules for other purposes, but
803       they are not complete enough to put into a library (just yet).
804    ----------------------------------------------------------------------
805    Name: Matthias Blume
806    Date: 2002/01/31 16:05:00 EST
807    Tag: blume-20020131-sparc-ccalls
808    Description:
809    
810    1. C-calls on Sparc needlessly allocated a huge chunk (96 bytes)
811       of extra stack space by mistake.  Fixed.
812    
813    2. Bug in logic of handling of command-line options in ml-nlffigen fixed.
814    
815    ----------------------------------------------------------------------
816    Name: Allen Leung
817    Date: 2002/01/30
818    Tag: leunga-20020130-nowhere-bug-fix
819    Description:
820    
821       MLRISC bug fixes:
822       1. Fixed a bindings computation bug in the 'nowhere' program generator tool.
823       2. MachineInt.fromString was negating its value.
824    
825    ----------------------------------------------------------------------
826    Name: Matthias Blume
827    Date: 2002/01/29
828    Tag: blume-20020129-INSTALL
829    Description:
830    
831    - Added somewhat detailed installation instructions (file INSTALL).
832    - Fixed curl-detection bug in config/install.sh.
833    - It is now possible to select the URL getter using the URLGETTER
834      environment variable:
835    
836          not set / "unknown"      --> automatic detection (script tries wget,
837                                       curl, and lynx)
838          "wget" / "curl" / "lynx" --> use the specified program (script "knows"
839                                       how to properly invoke them)
840          other                    --> use $URLGETTER directly, it must take
841                                       precisely two command-line arguments
842                                       (source URL and destination file name)
843    
844    ----------------------------------------------------------------------
845    Name: Matthias Blume
846    Date: 2002/01/28
847    Tag: blume-20020128-sparc-ccalls
848    Description:
849    
850    - Fixed problem with calculation of "used" registers in sparc-c-calls.
851    - Make use of the allocParam argument in sparc-c-calls.
852    
853    ----------------------------------------------------------------------
854    Name: Matthias Blume
855    Date: 2002/01/28
856    Tag: blume-20020128-allocParam
857    Description:
858    
859    John Reppy:  Changes c-calls API to accept client-callback for
860    allocating extra stack space.
861    me: Corresponding changes to mlriscGen (using a dummy argument that
862        does not change the current behavior).
863    
864    ----------------------------------------------------------------------
865    Name: Matthias Blume
866    Date: 2002/01/28 12:00:00
867    Tag: Release_110_38
868    Description:
869    
870    This time for real!!!
871    
872    ----------------------------------------------------------------------
873    Name: Matthias Blume
874    Date: 2002/01/28 10:56:00 EST
875    Tag: blume-20020128-retraction
876    Description:
877    
878    0. Retracted earlier 110.38.  (The Release_110_38 tag has been replaced
879       with blume-Release_110_38-retracted.)
880    
881    1. Fixed a problem with incorrect rounding modes in real64.sml.
882       (Thanks to Andrew Mccreight <andrew.mccreight@yale.edu>.)
883    
884    2. A bug in ml-nlffigen related to the handling of unnamed structs, unions,
885       and enums fixed.  The naming of corresponding ML identifiers should
886       now be consistent again.
887    
888    ----------------------------------------------------------------------
889    Name: Allen Leung
890    Date: 2002/01/27
891    Tag: leunga-20020127-nowhere
892    Description:
893    
894       Added a target called nowhere in the configuration scripts.
895       Enabling this will build the MLRISC 'nowhere' tool (for translating
896       programs with where-clauses into legal SML code) during installation.
897    
898    ----------------------------------------------------------------------
899    Name: Matthias Blume
900    Date: 2002/01/25 21:27:00 EST
901    Tag: blume-Release_110_38-retracted
902    Description:
903    
904    Call it a (working) release!  Version is 110.38. Bootfiles are ready.
905    
906    README will be added later.
907    
908    !!! NOTE:  Re-tagged as blume-Release_110_38-retracted. Original tag
909    (Release_110_38) removed.  Reason: Last-minute bug fixes.
910    
911    ----------------------------------------------------------------------
912    Name: Matthias Blume
913    Date: 2002/01/25
914    Tag: blume-20020125-ffi
915    Description:
916    
917    A large number of tweaks and improvements to ml-nlffi-lib and
918    ml-nlffigen:
919    
920       - ML represenation types have been streamlined
921       - getter and setter functions work with concrete values, not abstract
922         ones where possible
923       - ml-nlffigen command line more flexible (see README file there)
924       - some bugs have been fixed (hopefully)
925    
926    ----------------------------------------------------------------------
927    Name: Lal George
928    Date: 2002/01/24
929    Tag: george-20020124-risc-ra-interface
930    Description:
931    
932       There is a dramatic simplification in the interface to the
933       register allocator for RISC architectures as a result of making
934       parallel copy instructions explicit.
935    
936    ----------------------------------------------------------------------
937    Name: Matthias Blume
938    Date: 2002/01/22
939    Tag: blume-20020122-x86-ccalls
940    Description:
941    
942    Bug fix for c-calls on x86 (having to do with how char- and
943    short-arguments are being handled).
944    
945    ----------------------------------------------------------------------
946    Name: Matthias Blume
947    Date: 2002/01/21
948    Tag: blume-20020121-ff
949    Description:
950    
951    Another day of fiddling with the FFI...
952    
953    1. Bug fix/workaround:  CKIT does not complain about negative array
954       dimensions, so ml-nlffigen has to guard itself against this possibility.
955       (Otherwise a negative dimension would send it into an infinite loop.)
956    
957    2. Some of the abstract types (light objects, light pointers, most "base"
958       types) in structure C are now eqtypes.
959    
960    3. Added constructors and test functions for NULL function pointers.
961    
962    ----------------------------------------------------------------------
963    Name: Matthias Blume
964    Date: 2002/01/18
965    Tag: blume-20020118-ready-for-new-release
966    Description:
967    
968    Made config/srcarchiveurl point to a new place.  (Will provide boot
969    files shortly.)
970    
971    Maybe we christen this to be 110.38?
972    
973    ----------------------------------------------------------------------
974    Name: Matthias Blume
975    Date: 2002/01/18
976    Tag: blume-20020118-more-ffifiddle
977    Description:
978    
979    Today's FFI fiddling:
980    
981      - Provided a structure CGetSet with "convenient" versions of C.Get.* and
982        C.Set.* that use concrete (MLRep.*) arguments and results instead
983        of abstract ones.
984    
985      - Provided word-style bit operations etc. for "int" representation
986        types in MLRep.S<Foo>Bitops where <Foo> ranges over Char, Int, Short,
987        and Long.
988    
989    ----------------------------------------------------------------------
990    Name: Matthias Blume
991    Date: 2002/01/18
992    Tag: blume-20020118-use-x86-fp
993    Description:
994    
995    Now that x86-fast-fp seems to be working, I turned it back on again
996    by default.  (Seems to work fine now, even with the FFI.)
997    
998    Other than that, I added some documentation about the FFI to
999    src/ml-nlffigen/README and updated the FFI test examples in
1000    src/ml-nlffi-lib/Tests/*.
1001    
1002    ----------------------------------------------------------------------
1003    Name: Allen Leung
1004    Date: 2002/01/17
1005    Tag: leunga-20020117-x86-fast-fp-call
1006    Description:
1007    
1008       1. Fixed a problem with handling return fp values when x86's fast fp
1009          mode is turned on.
1010    
1011       2. Minor pretty printing fix for cellset.  Print %st(0) as %st(0) instead
1012          of %f32.
1013    
1014       3. Added a constructor INT32lit to the ast of MLRISC tools.
1015    
1016    ----------------------------------------------------------------------
1017    Name: Matthias Blume
1018    Date: 2002/01/16
1019    Tag: blume-20020116-ffifiddle
1020    Description:
1021    
1022    More fiddling with the FFI interface:
1023    
1024     - Make constness 'c instead of rw wherever possible.  This eliminates
1025       the need for certain explicit coercions.  (However, due to ML's
1026       value polymorphism, there will still be many cases where explicit
1027       coercions are necessary.  Phantom types are not the whole answer
1028       to modeling a subtyping relationship in ML.)
1029    
1030     - ro/rw coersions for pointers added.  (Avoids the detour through */&.)
1031    
1032     - "printf" test example added to src/ml-nlffi-lib/Tests.  (Demonstrates
1033       clumsy workaround for varargs problem.)
1034    
1035    ----------------------------------------------------------------------
1036    Name: Lal George
1037    Date: 2002/01/15
1038    Tag: <none>
1039    Description:
1040    
1041    1. Since COPY instructions are no longer native to the architecture,
1042       a generic functor can be used to implement the expandCopies function.
1043    
1044    2. Allowed EXPORT and IMPORT pseudo-op declarations to appear inside a
1045       TEXT segment.
1046    
1047    ----------------------------------------------------------------------
1048    Name: Matthias Blume
1049    Date: 2002/01/15
1050    Tag: blume-20020115-ffiupdates
1051    Description:
1052    
1053    1. Fix for bug resulting in single-precision float values being returned
1054       incorrectly from FFI calls.
1055    
1056    2. Small modifications to C FFI API:
1057    
1058        - memory-allocation routines return straight objects (no options)
1059          and raise an exception in out-of-memory situations
1060        - unsafe extensions to cast between function pointers and pointers
1061          from/to ints
1062        - added structure C_Debug as an alternative to structure C where
1063          pointer-dereferencing (|*| and |*!) always check for null-pointers
1064        - added open_lib' to DynLinkage;  open_lib' works like open_lib
1065          but also takes a (possibly empty) list of existing library handles
1066          that the current library depends on
1067    
1068    ----------------------------------------------------------------------
1069    Name: Matthias Blume
1070    Date: 2002/01/10
1071    Tag: blume-20020110-newffigen
1072    Description:
1073    
1074    1. Updates to portable graph code.
1075    
1076    2. Major update to ml-nlffigen and ml-nlffi-lib.  Things are much
1077       more scalable now so that even huge interfaces such as the one
1078       for GTK compile in finite time and space. :-)
1079       See src/ml-nlffigen/README for details on what's new.
1080    
1081    ----------------------------------------------------------------------
1082    Name: Lal George
1083    Date: 2001/01/09 14:31:35 EST 2002
1084    Tag: george-20011206-rm-native-copy
1085    Description:
1086    
1087            Removed the native COPY and FCOPY instructions
1088            from all the architectures and replaced it with the
1089            explicit COPY instruction from the previous commit.
1090    
1091            It is now possible to simplify many of the optimizations
1092            modules that manipulate copies. This has not been
1093            done in this change.
1094    
1095    ----------------------------------------------------------------------
1096    Name: Lal George
1097    Date: 2001/12/06 16:50:13 EST 2001
1098    Tag: george-20011206-mlrisc-instruction
1099    Description:
1100    
1101    Changed the representation of instructions from being fully abstract
1102    to being partially concrete. That is to say:
1103    
1104      from
1105            type instruction
1106    
1107      to
1108            type instr                              (* machine instruction *)
1109    
1110            datatype instruction =
1111                LIVE of {regs: C.cellset, spilled: C.cellset}
1112              | KILL of {regs: C.cellset, spilled: C.cellset}
1113              | COPYXXX of {k: CB.cellkind, dst: CB.cell list, src: CB.cell list}
1114              | ANNOTATION of {i: instruction, a: Annotations.annotation}
1115              | INSTR of instr
1116    
1117    This makes the handling of certain special instructions that appear on
1118    all architectures easier and uniform.
1119    
1120    LIVE and KILL say that a list of registers are live or killed at the
1121    program point where they appear. No spill code is generated when an
1122    element of the 'regs' field is spilled, but the register is moved to
1123    the 'spilled' (which is present, more for debugging than anything else).
1124    
1125    LIVE replaces the (now deprecated) DEFFREG instruction on the alpha.
1126    We used to generate:
1127    
1128            DEFFREG f1
1129            f1 := f2 + f3
1130            trapb
1131    
1132    but now generate:
1133    
1134            f1 := f2 + f3
1135            trapb
1136            LIVE {regs=[f1,f2,f3], spilled=[]}
1137    
1138    Furthermore, the DEFFREG (hack) required that all floating point instruction
1139    use all registers mentioned in the instruction. Therefore f1 := f2 + f3,
1140    defines f1 and uses [f1,f2,f3]! This hack is no longer required resulting
1141    in a cleaner alpha implementation. (Hopefully, intel will not get rid of
1142    this architecture).
1143    
1144    COPYXXX is intended to replace the parallel COPY and FCOPY  available on
1145    all the architectures. This will result in further simplification of the
1146    register allocator that must be aware of them for coalescing purposes, and
1147    will also simplify certain aspects of the machine description that provides
1148    callbacks related to parallel copies.
1149    
1150    ANNOTATION should be obvious, and now INSTR represents the honest to God
1151    machine instruction set!
1152    
1153    The <arch>/instructions/<arch>Instr.sml files define certain utility
1154    functions for making porting easier -- essentially converting upper case
1155    to lower case. All machine instructions (of type instr) are in upper case,
1156    and the lower case form generates an MLRISC instruction. For example on
1157    the alpha we have:
1158    
1159      datatype instr =
1160         LDA of {r:cell, b:cell, d:operand}
1161       | ...
1162    
1163      val lda : {r:cell, b:cell, d:operand} -> instruction
1164        ...
1165    
1166    where lda is just (INSTR o LDA), etc.
1167    
1168    ----------------------------------------------------------------------
1169    Name: Matthias Blume
1170    Date: 2001/11/22 21:40:00 EST
1171    Tag: Release_110_37
1172    Description:
1173    
1174    Release 110.37.  This time for real.
1175    
1176    ----------------------------------------------------------------------
1177    Name: Matthias Blume
1178    Date: 2001/11/21 16:35:00 EST
1179    Tag: blume-20011121-foot-in-mouth
1180    Description:
1181    
1182    Removed the "Release_110_37" tag because of a serious bug.
1183    This will be re-tagged once the bug is fixed.
1184    
1185    ----------------------------------------------------------------------
1186    Name: Matthias Blume
1187    Date: 2001/11/21 16:14:00 EST
1188    Tag: blume-20011121-forgottenfile
1189    Description:
1190    
1191    Forgot to add a file.  (Just a .tex-file -- part of
1192    the CM manual source.)
1193    
1194    ----------------------------------------------------------------------
1195    Name: Matthias Blume
1196    Date: 2001/11/21 16:10:00 EST
1197    Tag: blume-20011121-invalid_110_37
1198    Description:
1199    
1200    Note: I removed the original tag "Release_110_37" from this commit
1201          because we found a serious bug in all non-x86 backends.
1202          - Matthias
1203    
1204    1. Modifications to the SML/NJ code generator and to the runtime system
1205       so that code object name strings are directly inserted into code
1206       objects at code generation time.  The only business the runtime system
1207       has with this is now to read the name strings on occasions.
1208       (The encoding of the name string has also changed somewhat.)
1209    
1210    2. CM now implements a simple "set calculus" for specifying export lists.
1211       In particular, it is now possible to refer to the export lists of
1212       other libraries/groups/sources and form unions as well as differences.
1213       See the latest CM manual for details.
1214    
1215    3. An separate notion of "proxy" libraries has again be eliminated from
1216       CM's model.  (Proxy libraries are now simply a special case of using
1217       the export list calculus.)
1218    
1219    4. Some of the existing libraries now take advantage of the new set
1220       calculus.
1221       (Notice that not all libraries have been converted because some
1222       of the existing .cm-files are supposed to be backward compatible
1223       with 110.0.x.)
1224    
1225    5. Some cleanup in stand-alone programs.  (Don't use "exnMessage" -- use
1226       "General.exnMessage"!  The former relies on a certain hook to be
1227       initialized, and that often does not happen in the stand-alone case.)
1228    
1229    ----------------------------------------------------------------------
1230    Name: Lal George
1231    Date: 2001/11/21  13:56:18 EST
1232    Tag: george-2001121-pseudo-ops
1233    Description:
1234    
1235      Implemented a complete redesign of MLRISC pseudo-ops. Now there
1236      ought to never be any question of incompatabilities with
1237      pseudo-op syntax expected by host assemblers.
1238    
1239      For now, only modules supporting GAS syntax are implemented
1240      but more should follow, such as MASM, and vendor assembler
1241      syntax, e.g. IBM as, Sun as, etc.
1242    
1243    ----------------------------------------------------------------------
1244    Name: Matthias Blume
1245    Date: 2001/11/14 11:52:00 EST
1246    Tag: blume-20011114-srcname
1247    Description:
1248    
1249    1. Routed the name of the current source file to mlriscgen where it
1250       should be directly emitted into the code object.  (This last part
1251       is yet to be done.)
1252    
1253    2. Some cleanup of the pgraph code to make it match the proposal that
1254       I put out the other day.  (The proposal notwithstanding, things are
1255       still in flux here.)
1256    
1257    ----------------------------------------------------------------------
1258    Name: Lal George
1259    Date: 2001/11/14 09:44:04 EST
1260    Tag:
1261    Description:
1262    
1263      Fix for a backpatching bug reported by Allen.
1264    
1265      Because the boundary between short and long span-dependent
1266      instructions is +/- 128, there are an astounding number of
1267      span-dependent instructions whose size is over estimated.
1268    
1269      Allen came up with the idea of letting the size of span
1270      dependent instructions be non-monotonic, for a maxIter
1271      number of times, after which the size must be monotonically
1272      increasing.
1273    
1274      This table shows the number of span-dependent instructions
1275      whose size was over-estimated as a function of maxIter, for the
1276      file Parse/parse/ml.grm.sml:
1277    
1278         maxIter            # of instructions:
1279            10                      687
1280            20                      438
1281            30                      198
1282            40                        0
1283    
1284      In compiling the compiler, there is no significant difference in
1285      compilation speed between maxIter=10 and maxIter=40. Actually,
1286      my measurements showed that maxIter=40 was a tad faster than
1287      maxIter=10! Also 96% of the  files in the compiler reach a fix
1288      point within 13 iterations, so fixing maxIter at 40, while high,
1289      is okay.
1290    
1291    ----------------------------------------------------------------------
1292    Name: Matthias Blume
1293    Date: 2001/10/31 15:25:00 EST
1294    Tag: blume-20011031-pgraph
1295    Description:
1296    
1297    CKIT:
1298    * Changed the "Function" constructor of type Ast.ctype to carry optional
1299      argument identifiers.
1300    * Changed the return type of TypeUtil.getFunction accordingly.
1301    * Type equality ignores the argument names.
1302    * TypeUtil.composite tries to preserve argument names but gives up quickly
1303      if there is a mismatch.
1304    
1305    installation script:
1306    * attempts to use "curl" if available (unless "wget" is available as well)
1307    
1308    CM:
1309    * has an experimental implementation of "portable graphs" which I will
1310      soon propose as an implementation-independent library format
1311    * there are also new libraries $/pgraph.cm and $/pgraph-util.cm
1312    
1313    NLFFI-LIB:
1314    * some cleanup (all cosmetic)
1315    
1316    NLFFIGEN:
1317    * temporarily disabled the mechanism that suppresses ML output for
1318      C definitions whose identifiers start with an underscore character
1319    * generate val bindings for enum constants
1320    * user can request that only one style (light or heavy) is being used;
1321      default is to use both (command-line arguments: -heavy and -light)
1322    * fixed bug in handling of function types involving incomplete pointers
1323    * generate ML entry points that take record arguments (i.e., using
1324      named arguments) for C functions that have a prototype with named
1325      arguments
1326      (see changes to CKIT)
1327    
1328    ----------------------------------------------------------------------
1329    Name: Allen Leung
1330    Date: 2001/10/27 20:34:00 EDT
1331    Tag: leunga-20011027-x86-fast-fp-call
1332    Description:
1333    
1334       Fixed the bug described in blume-20010920-slowfp.
1335    
1336       The fix involves
1337          1. generating FCOPYs in FSTP in ia32-svid
1338          2. marking a CALL with the appropriate annotation
1339    
1340    ----------------------------------------------------------------------
1341    Name: Matthias Blume
1342    Date: 2001/10/16 11:32:00 EDT
1343    Tag: blume-20011016-netbsd
1344    Description:
1345    
1346    Underscore patch from Chris Richards (fixing problem with compiling
1347    runtime system under recent NetBSD).
1348    
1349    ----------------------------------------------------------------------
1350    Name: Allen Leung
1351    Date: 2001/10/12 17:18:32 EDT 2001
1352    Tag: leung-20011012-x86-printflowgraph
1353    Description:
1354    
1355    X86RA now uses a valid (instead of dummy) PrintFlowgraph module.
1356    
1357    ----------------------------------------------------------------------
1358    Name: Lal George
1359    Date: 2001/10/11 23:51:34 EDT
1360    Tag: george-20011011-too-many-instrs
1361    Description:
1362    
1363    The representation of a program point never expected to see more
1364    than 65536 instructions in a basic block!
1365    
1366    ----------------------------------------------------------------------
1367    Name: Lal George
1368    Date: 2001/10/09 09:41:37 EDT
1369    Tag: george-20011008-mlrisc-labels
1370    Description:
1371    
1372    Changed the machine description files to support printing of
1373    local and global labels in assembly code, based on host assembler
1374    conventions.
1375    
1376    ----------------------------------------------------------------------
1377    Name: Matthias Blume
1378    Date: 2001/09/25 15:25:00 EDT
1379    Tag: blume-20010925-exninfo
1380    Description:
1381    
1382    I provided a non-hook implementation of exnName (at the toplevel) and
1383    made the "dummy" implementation of exnMessage (at the toplevel) more
1384    useful: if nothing gets "hooked in", then at least you are going to
1385    see the exception name and a message indicating why you don't see more.
1386    
1387    [For the time being, programs that need exnMessage and want to use
1388    ml-build should either use General.exnMessage (strongly recommended) or
1389    refer to structure General at some other point so that CM sees a
1390    static dependency.]
1391    
1392    [Similar remarks go for "print" and "use":  If you want to use their
1393    functionality in stand-alone programs generated by ml-build, then use
1394    TextIO.output and Backend.Interact.useFile (from $smlnj/compiler.cm).]
1395    
1396    ----------------------------------------------------------------------
1397    Name: Matthias Blume
1398    Date: 2001/09/20 17:28:00 EDT
1399    Tag: blume-20010920-slowfp
1400    Description:
1401    
1402    Allen says that x86-fast-fp is not safe yet, so I turned it off again...
1403    
1404    ----------------------------------------------------------------------
1405    Name: Matthias Blume
1406    Date: 2001/09/20 17:20:00 EDT
1407    Tag: blume-20010920-canonicalpaths
1408    Description:
1409    
1410    0. Updated the BOOT file (something that I forgot to do earlier).
1411    
1412    1. Small internal change to CM so that it avoids "/../" in filenames
1413       as much as possible (but only where it is safe).
1414    
1415    2. Changed config/_run-sml (resulting in a changed bin/.run-sml) so
1416       that arguments that contain delimiters are passed through correctly.
1417       This change also means that all "special" arguments of the form
1418       @SMLxxx... must come first.
1419    
1420    3. Changed install script to put relative anchor names for tool commands
1421       into pathconfig.
1422    
1423    ----------------------------------------------------------------------
1424    Name: Matthias Blume
1425    Date: 2001/09/18 15:35:00 EDT
1426    Tag: blume-20010918-readme11036
1427    Description:
1428    
1429    Added README files.
1430    
1431    ----------------------------------------------------------------------
1432    Name: Matthias Blume
1433    Date: 2001/09/18 11:45:00 EDT
1434    Tag: Release_110_36 (retag)
1435    Description:
1436    
1437    Fixed mistake in config/preloads. Retagged as 110.36.
1438    
1439    ----------------------------------------------------------------------
1440    Name: Matthias Blume
1441    Date: 2001/09/18 09:40:00 EDT
1442    Tag: Release_110_36_orig (tag changed)
1443    Description:
1444    
1445    New version (110.36).  New bootfiles.
1446    
1447    ----------------------------------------------------------------------
1448    Name: Matthias Blume
1449    Date: 2001/09/14 16:15:00 EDT
1450    Tag: blume-20010914-x86fastfp
1451    Description:
1452    
1453    John committed some changes that Allen made, in particular a (hopefully)
1454    correctly working version of the x86-fp module.
1455    
1456    I changed the default setting of the Control.MLRISC.getFlag "x86-fast-fp"
1457    flag to "true".  Everything seems to compile to a fixpoint ok, and
1458    "mandelbrot" speeds up by about 15%.
1459    
1460    ----------------------------------------------------------------------
1461    Name: Matthias Blume
1462    Date: 2001/09/13 11:20:00 EDT
1463    Tag: blume-20010913-minimal
1464    Description:
1465    
1466    1. Stefan Monnier's patch to fix a miscompilation problem that
1467       was brought to light by John Reppy's work on Moby.
1468    
1469    2. Implemented a minimal "structure Compiler" that contains just
1470       "version" and "architecture".  The minimal version will be
1471       available when the full version is not.  This is for backward-
1472       compatibility with code that wants to test Compiler.version.
1473    
1474    ----------------------------------------------------------------------
1475    Name: Matthias Blume
1476    Date: 2001/08/28 14:03:00 EDT
1477    Tag: blume-20010828-ml-lex
1478    Description:
1479    
1480    Fix for bug 1581, received from Neophytos Michael.
1481    
1482    ----------------------------------------------------------------------
1483    Name: Matthias Blume
1484    Date: 2001/08/27 11:20:00 EDT
1485    Tag: blume-20010827-readme11035
1486    Description:
1487    
1488    Fleshed out the README file for 110.35.
1489    
1490    ----------------------------------------------------------------------
1491    Name: Matthias Blume
1492    Date: 2001/08/24 17:10:00 EDT
1493    Tag: Release_110_35
1494    Description:
1495    
1496    New version number (110.35).  New bootfiles.
1497    
1498    ----------------------------------------------------------------------
1499    Name: Lal George
1500    Date: 2001/08/24 13:47:18 EDT 2001
1501    Tag: george-20010824-MLRISC-graphs
1502    Description:
1503    
1504     removed clusters from MLRISC completely and replaced with graphs.
1505    
1506    ----------------------------------------------------------------------
1507    Name: Matthias Blume
1508    Date: 2001/08/23 17:50:00 EDT
1509    Tag: blume-20010823-toplevel
1510    Description:
1511    
1512    - some reorganization of the code that implements various kinds of
1513      environments in the compiler (static, dynamic, symbolic, combined)
1514    - re-implemented the EnvRef module so that evalStream works properly
1515      (if the stream contains references to "use", "CM.make", etc.)
1516    - cleaned up evalloop.sml and interact.sml (but they need more cleaning)
1517    
1518    ----------------------------------------------------------------------
1519    Name: Matthias Blume
1520    Date: 2001/08/20 15:50 EDT
1521    Tag: blume20010820-slipup
1522    Description:
1523    
1524    I forgot to commit a few files.  Here they are...
1525    
1526    ----------------------------------------------------------------------
1527    Name: Matthias Blume
1528    Date: 2001/08/20 15:35:00 EDT
1529    Tag: blume-20010820-debugprof
1530    Description:
1531    
1532    !!!! NEW BOOTFILES !!!!
1533    
1534    This is another round of reorganizing the compiler sources.  This
1535    time the main goal was to factor out all the "instrumentation"
1536    passes (for profiling and backtracing) into their own library.
1537    The difficulty was to do it in such a way that it does not depend
1538    on elaborate.cm but only on elabdata.cm.
1539    
1540    Therefore there have been further changes to both elaborate.cm and
1541    elabdata.cm -- more "generic" things have been moved from the former
1542    to the latter.  As a result, I was forced to split the assignment
1543    of numbers indicating "primtyc"s into two portions: SML-generic and
1544    SML/NJ-specific.  Since it would have been awkward to maintain,
1545    I bit the bullet and actually _changed_ the mapping between these
1546    numbers and primtycs.  The bottom line of this is that you need
1547    a new set of bin- and bootfiles.
1548    
1549    I have built new bootfiles for all architectures, so doing a fresh
1550    checkout and config/install.sh should be all you need.
1551    
1552    The newly created library's name is
1553    
1554        $smlnj/viscomp/debugprof.cm
1555    
1556    and its sources live under
1557    
1558        src/compiler/DebugProf
1559    
1560    ----------------------------------------------------------------------
1561    Name: Matthias Blume
1562    Date: 2001/08/15 17:15:00 EDT
1563    Tag: blume-20010815-compreorg
1564    Description:
1565    
1566    This is a first cut at reorganizing the CM libraries that make up the
1567    core of the compiler.  The idea is to separate out pieces that could
1568    be used independently by tools, e.g., the parser, the typechecker, etc.
1569    
1570    The current status is a step in this direction, but it is not quite
1571    satisfactory yet.  Expect more changes in the future.
1572    
1573    Here is the current (new) organization...
1574    
1575        What used to be $smlnj/viscomp/core.cm is now divided into
1576        six CM libraries:
1577    
1578             $smlnj/viscomp/basics.cm
1579                           /parser.cm
1580                           /elabdata.cm
1581                           /elaborate.cm
1582                           /execute.cm
1583                           /core.cm
1584    
1585        The CM files for these libraries live under src/system/smlnj/viscomp.
1586        All these libraries are proxy libraries that contain precisely
1587        one CM library component.  Here are the locations of the components
1588        (all within the src/compiler tree):
1589    
1590             Basics/basics.cm
1591             Parse/parser.cm
1592             ElabData/elabdata.cm
1593             Elaborator/elaborate.cm
1594             Execution/execute.cm
1595             core.cm
1596    
1597         [This organization is the same that has been used already
1598         for a while for the architecture-specific parts of the visible
1599         compiler and for the old version of core.cm.]
1600    
1601         As you will notice, many source files have been moved from their
1602         respective original locations to a new home in one of the above
1603         subtrees.
1604    
1605         The division of labor between the new libraries is the following:
1606    
1607             basics.cm:
1608                - Simple, basic definitions that pertain to many (or all) of
1609                  the other libraries.
1610             parser.cm:
1611                - The SML parser, producing output of type Ast.dec.
1612                - The type family for Ast is also defined and exported here.
1613             elabdata.cm:
1614                - The datatypes that describe input and output of the elaborator.
1615                  This includes types, absyn, and static environments.
1616             elaborator.cm:
1617                - The SML/NJ type checker and elaborator.
1618                  This maps an Ast.dec (with a given static environment) to
1619                  an Absyn.dec (with a new static environment).
1620                - This libraries implements certain modules that used to be
1621                  structures as functors (to remove dependencies on FLINT).
1622             execute.cm:
1623                - Everything having to do with executing binary code objects.
1624                - Dynamic environments.
1625             core.cm:
1626                - SML/NJ-specific instantiations of the elaborator and MLRISC.
1627                - Top-level modules.
1628                - FLINT (this should eventually become its own library)
1629    
1630    Notes:
1631    
1632    I am not 100% happy with the way I separated the elaborator (and its
1633    data structures) from FLINT.  Two instances of the same problem:
1634    
1635        1. Data structures contain certain fields that carry FLINT-specific
1636           information.  I hacked around this using exn and the property list
1637           module from smlnj-lib.  But the fact that there are middle-end
1638           specific fields around at all is a bit annoying.
1639    
1640        2. The elaborator calculates certain FLINT-related information.  I tried
1641           to make this as abstract as I could using functorization, but, again,
1642           the fact that the elaborator has to perform calculations on behalf
1643           of the middle-end at all is not nice.
1644    
1645        3. Having to used exn and property lists is unfortunate because it
1646           weakens type checking.  The other alternative (parameterizing
1647           nearly *everything*) is not appealing, though.
1648    
1649    I removed the "rebinding =" warning hack because due to the new organization
1650    it was awkward to maintain it.  As a result, the compiler now issues some of
1651    these warnings when compiling init.cmi during bootstrap compilation. On
1652    the plus side, you also get a warning when you do, for example:
1653       val op = = Int32.+
1654    which was not the case up to now.
1655    
1656    I placed "assign" and "deref" into the _Core structure so that the
1657    code that deals with the "lazy" keyword can find them there.  This
1658    removes the need for having access to the primitive environment
1659    during elaboration.
1660    
1661    ----------------------------------------------------------------------
1662    Name: Matthias Blume
1663    Date: 2001/08/13
1664    Tag: blume-20010813-closures
1665    Description:
1666    
1667    This fix was sent to us by Zhong Shao.  It is supposed to improve the
1668    performance of certain loops by avoiding needless closure allocation.
1669    
1670    ----------------------------------------------------------------------
1671    Name: Lal George
1672    Date: 2001/07/31 10:03:23 EDT 2001
1673    Tag: george-20010731-x86-fmalloc
1674    Description: Fixed bug in x86 calls
1675    
1676        There was a bug where call instructions would mysteriously
1677        vanish. The call instruction had to be one that returned
1678        a floating point value.
1679    
1680    ----------------------------------------------------------------------
1681    Name: Lal George
1682    Date: 2001/07/19 16:36:29 EDT 2001
1683    Tag: george-20010719-simple-cells
1684    Description:
1685    
1686    I have dramatically simplified the interface for CELLS in MLRISC.
1687    
1688    In summary, the cells interface is broken up into three parts:
1689    
1690      1. CellsBasis : CELLS_BASIS
1691    
1692            CellsBasis is a top level structure and common for all
1693            architectures.  it contains the definitions of basic datatypes
1694            and utility  functions over these types.
1695    
1696      2. functor Cells() : CELLS
1697    
1698            Cells generates an interface for CELLS that incorporates the
1699            specific resources on the target architecture, such as the
1700            presence of special register classes, their number and size,
1701            and various useful substructures.
1702    
1703      3. <ARCH>CELLS
1704    
1705            e.g. SparcCells: SPARCCELLS
1706    
1707            <ARCH>CELLS usually contains additional bindings for special
1708            registers  on the architecture, such as:
1709    
1710                    val r0 : cell           (* register zero *)
1711                    val y : cell            (* Y register *)
1712                    val psr : cell          (* processor status register *)
1713                    ...
1714    
1715            The structure returned by applying the Cells functor is opened
1716            in this interface.
1717    
1718    The main implication of all this is that the datatypes for cells is
1719    split between CellsBasis and CELLS -- a fairly simple change for user
1720    code.
1721    
1722    In the old scheme the CELLS interface had a definitional binding of
1723    the form:
1724    
1725            signature CELLS = sig
1726    
1727               structure CellsBasis = CellsBasis
1728    
1729               ...
1730    
1731            end
1732    
1733    With all the sharing constraints that goes on in MLRISC, this old
1734    design  quickly leads to errors such as:
1735    
1736            "structure definition spec inside of sharing ... "
1737    
1738    
1739    and appears to require an unacceptable amount of sharing and where
1740    constraint hackery.
1741    
1742    I think this error message (the interaction of definitional specs and
1743    sharing) requires more explanation on our web page.
1744    
1745    ----------------------------------------------------------------------
1746    Name: Matthias Blume
1747    Date: 2001/07/19 15:00:00 EDT
1748    Tag: blume-20010719-libreorg
1749    Description:
1750    
1751    This update puts together a fairly extensive but straightforward change
1752    to the way the libraries that implement the interactive system are
1753    organized:
1754    
1755       The biggest change is the elimination of structure Compiler.  As a
1756       replacement for this structure, there is now a CM library
1757       (known as $smlnj/compiler.cm or $smlnj/compiler/current.cm)
1758       that exports all the substructures of the original structure Compiler
1759       directly.  So instead of saying Compiler.Foo.bar one now simply
1760       says Foo.bar.  (The CM libraries actually export a collection of
1761       structures that is richer than the collection of substructures of
1762       structure Compiler.)
1763    
1764       To make the transition smooth, there is a separate library called
1765       $smlnj/compiler/compiler.cm which puts together and exports the
1766       original structure Compiler (or at least something very close to it).
1767    
1768       There are five members of the original structure Compiler
1769       that are not exported directly but which instead became members
1770       of a new structure Backend (described by signature BACKEND).  These are:
1771       structure Profile (: PROFILE), structure Compile (: COMPILE), structure
1772       Interact (: INTERACT), structure Machine (: MACHINE), and val
1773       architecture (: string).
1774    
1775       Structure Compiler.Version has become structure CompilerVersion.
1776    
1777       Cross-compilers for alpha32, hppa, ppc, sparc, and x86 are provided
1778       by $smlnj/compiler/<arch>.cm where <arch> is alpha32, hppa, ppc, sparc,
1779       or x86, respectively.
1780       Each of these exports the same frontend structures that
1781       $smlnj/compiler.cm exports.  But they do not have a structure Backend
1782       and instead export some structure <Arch>Backend where <Arch> is Alpha32,
1783       Hppa, PPC, Sparc, or X86, respectively.
1784    
1785       Library $smlnj/compiler/all.cm exports the union of the exports of
1786       $smlnj/compiler/<arch>.cm
1787    
1788       There are no structures <Arch>Compiler anymore, use
1789       $smlnj/compiler/<arch>.cm instead.
1790    
1791       Library host-compiler-0.cm is gone.  Instead, the internal library
1792       that instantiates CM is now called cm0.cm.  Selection of the host
1793       compiler (backend) is no longer done here but. (Responsibility for it
1794       now lies with $smlnj/compiler/current.cm.  This seems to be more
1795       logical.)
1796    
1797       Many individual files have been moved or renamed.  Some files have
1798       been split into multiple files, and some "dead" files have been deleted.
1799    
1800    Aside from these changes to library organization, there are also changes
1801    to the way the code itself is organized:
1802    
1803       Structure Binfile has been re-implemented in such a way that it no
1804       longer needs any knowledge of the compiler.  It exclusively deals
1805       with the details of binfile layout.  It no longer invokes the
1806       compiler (for the purpose of creating new prospective binfile
1807       content), and it no longer has any knowledge of how to interpret
1808       pickles.
1809    
1810       Structure Compile (: COMPILE) has been stripped down to the bare
1811       essentials of compilation.  It no longer deals with linking/execution.
1812       The interface has been cleaned up considerably.
1813    
1814       Utility routines for dealing with linking and execution have been
1815       moved into their own substructures.
1816    
1817       (The ultimate goal of these changes is to provide a light-weight
1818       binfile loader/linker (at least for, e.g., stable libraries) that
1819       does not require CM or the compiler to be present.)
1820    
1821    CM documentation has been updated to reflect the changes to library
1822    organization.
1823    
1824    ----------------------------------------------------------------------
1825    Name: Matthias Blume
1826    Date: 2001/07/10 17:30:00 EDT
1827    Tag: Release_110_34
1828    Description:
1829    
1830    Minor tweak to 110.34 (re-tagged):
1831    
1832      - README.html file added to CVS repository
1833      - runtime compiles properly under FreeBSD 3.X and 4.X
1834    
1835    ----------------------------------------------------------------------
1836    Name: Matthias Blume
1837    Date: 2001/07/10 17:30:00 EDT
1838    Tag: Release_110_34
1839    Description:
1840    
1841    New version number (110.34). New bootfiles.
1842    
1843    ----------------------------------------------------------------------
1844    Name: Matthias Blume
1845    Date: 2001/07/09 16:00:00 EDT
1846    Tag: blume-20010709-more-varargs
1847    Description:
1848    
1849    I changed the handling of varargs in ml-nlffigen again:
1850    The ellipsis ... will now simply be ignored (with an accompanying warning).
1851    
1852    The immediate effect is that you can actually call a varargs function
1853    from ML -- but you can't actually supply any arguments beyond the ones
1854    specified explicitly.  (For example, you can call printf with its format
1855    string, but you cannot pass additional arguments.)
1856    
1857    This behavior is only marginally more useful than the one before, but
1858    it has the advantage that a function or, more importantly, a function
1859    type never gets dropped on the floor, thus avoiding follow-up problems with
1860    other types that refer to the offending one.
1861    
1862    ----------------------------------------------------------------------
1863    Name: Matthias Blume
1864    Date: 2001/07/09 11:25:00 EDT
1865    Tag: blume-20010709-varargs
1866    Description:
1867    
1868    1. ckit-lib.cm now exports structure Error
1869    2. ml-nlffigen reports occurences of "..." (i.e., varargs function types)
1870       with a warning accompanied by a source location.  Moreover, it
1871       merely skips the offending function or type and proceeds with the
1872       rest of its work.u  As a result, one can safely feed C code containing
1873       "..." to ml-nlffigen.
1874    3. There are some internal improvements to CM, providing slightly
1875       more general string substitutions in the tools subsystem.
1876    
1877    ----------------------------------------------------------------------
1878    Name: Matthias Blume
1879    Date: 2001/06/27 15:10:00 EDT
1880    Tag: blume-20010627-concur
1881    Description:
1882    
1883    Fixed a small bug in CM's handling of parallel compilation.
1884    (You could observe the bug by Control-C-interrupting an ordinary
1885    CMB.make or CM.stabilize and then attaching some compile servers.
1886    The result was that all of a sudden the previously interrupted
1887    compilation would continue on its own.  This was because of
1888    an over-optimization: CM did not bother to clean out certain queues
1889    when no servers were attached "anyway", resulting in the contents
1890    of these queues to grab control when new servers did get attached.)
1891    
1892    There is also another minor update to the CM manual.
1893    
1894    ----------------------------------------------------------------------
1895    Name: Matthias Blume
1896    Date: 2001/06/26 16:15:00 EDT
1897    Tag: blume-20010626-cmdoc
1898    Description:
1899    
1900    Minor typo fixed in CM manual (syntax diagram for libraries).
1901    
1902    ----------------------------------------------------------------------
1903    Name: Matthias Blume
1904    Date: 2001/06/25 22:55:00 EDT
1905    Tag: blume-20010625-x86pc
1906    Description:
1907    
1908    Fixed a nasty bug in the X86 assembly code that caused signal
1909    handlers to fail (crash) randomly.
1910    
1911    ----------------------------------------------------------------------
1912    Name: Matthias Blume
1913    Date: 2001/06/25 12:05:00 EDT
1914    Tag: blume-20010625-nlffigen
1915    Description:
1916    
1917    This update fixes a number of minor bugs in ml-nlffigen as reported by
1918    Nick Carter <nbc@andrew.cmu.edu>.
1919    
1920      1. Silly but ok typedefs of the form "typedef void myvoid;" are now accepted.
1921      2. Default names for generated files are now derived from the name of
1922         the C file *without its directory*.  In particular, this causes generated
1923         files to be placed locally even if the C file is in some system directory.
1924      3. Default names for generated signatures and structures are also derived
1925         from the C file name without its directory.  This avoids silly things
1926         like "structure GL/GL".
1927         (Other silly names are still possible because ml-nlffigen does not do
1928          a thorough check of whether generated names are legal ML identifiers.
1929          When in doubt, use command line arguments to force particular names.)
1930    
1931    ----------------------------------------------------------------------
1932    Name: Matthias Blume
1933    Date: 2001/06/21 12:25:00 EDT
1934    Tag: blume-20010621-eXene
1935    Description:
1936    
1937    eXene now compiles and (sort of) works again.
1938    
1939    The library name (for version > 110.33) is $/eXene.cm.
1940    
1941    I also added an new example in src/eXene/examples/nbody.  See the
1942    README file there for details.
1943    
1944    ----------------------------------------------------------------------
1945    Name: Matthias Blume
1946    Date: 2001/06/20 16:40:00 EDT
1947    Tag: blume-20010620-cml
1948    Description:
1949    
1950    CML now compiles and works again.
1951    
1952    Libraries (for version > 110.33):
1953    
1954      $cml/cml.cm            Main CML library.
1955      $cml/basis.cm          CML's version of $/basis.cm.
1956      $cml/cml-internal.cm   Internal helper library.
1957      $cml/core-cml.cm       Internal helper library.
1958      $cml-lib/trace-cml.cm  Tracing facility.
1959      $cml-lib/smlnj-lib.cm  CML's version of $/smlnj-lib.cm
1960    
1961    The installer (config/install.sh) has been taught how to properly
1962    install this stuff.
1963    
1964    ----------------------------------------------------------------------
1965    Name: Matthias Blume
1966    Date: 2001/06/19 17:55:00 EDT
1967    Tag: blume-20010619-instantiate
1968    Description:
1969    
1970    This un-breaks the fix for bug 1432.
1971    (The bug was originally fixed in 110.9 but I broke it again some
1972    time after that.)
1973    
1974    ----------------------------------------------------------------------
1975    Name: Matthias Blume
1976    Date: 2001/06/19 17:25:00 EDT
1977    Tag: blume-20010619-signals
1978    Description:
1979    
1980    This should (hopefully) fix the long-standing signal handling bug.
1981    (The runtime system was constructing a continuation record with an
1982    incorrect descriptor which would cause the GC to drop data on the floor...)
1983    
1984    ----------------------------------------------------------------------
1985    Name: Matthias Blume
1986    Date: 2001/06/15 15:05:00 EDT
1987    Tag: blume-20010615-moresparc
1988    Description:
1989    
1990    Here is a short late-hour update related to Sparc c-calls:
1991    
1992     -- made handling of double-word arguments a bit smarter
1993    
1994     -- instruction selection phase tries to collapse certain clumsily
1995        constructed ML-Trees; typical example:
1996    
1997            ADD(ty,ADD(_,e,LI d1),LI d2)  ->  ADD(ty,e,LI(d1+d2))
1998    
1999        This currently has no further impact on SML/NJ since mlriscGen does
2000        not seem to generate such patterns in the first place, and c-calls
2001        (which did generate them in the beginning) has meanwhile been fixed
2002        so as to avoid them as well.
2003    
2004    ----------------------------------------------------------------------
2005    Name: Matthias Blume
2006    Date: 2001/06/15 15:05:00 EDT
2007    Tag: blume-20010615-sparc
2008    Description:
2009    
2010    The purpose of this update is to provide an implementation of NLFFI
2011    on Sparc machines.
2012    
2013    Here are the changes in detail:
2014    
2015       * src/MLRISC/sparc/c-calls/sparc-c-calls.sml is a new file containing
2016       the Sparc implementation of the c-calls API.
2017       * The Sparc backend of SML/NJ has been modified to uniformely use %fp
2018       for accessing the ML frame.  Thus, we have a real frame pointer and
2019       can freely modify %sp without need for an omit-frame-ptr phase.
2020       The vfp logic in src/compiler/CodeGen/* has been changed to accomodate
2021       this case.
2022       * ml-nlffigen has been taught to produce code for different architectures
2023       and calling conventions.
2024       * In a way similar to what was done in the x86 case, the Sparc
2025       backend uses its own specific extension to mltree.  (For example,
2026       it needs to be able to generate UNIMP instructions which are part
2027       of the calling convention.)
2028       * ml-nlffi-lib was reorganized to make it more modular (in particular,
2029       to make it easier to plug in new machine- and os-dependent parts).
2030    
2031    There are some other fairly unrelated bug fixes and cleanups as well:
2032    
2033       * I further hacked the .cm files for MLRISC tools (like MDLGen) so
2034       that they properly share their libraries with existing SML/NJ libraries.
2035       * I fixed a minor cosmetic bug in CM, supressing certain spurious
2036       follow-up error messages.
2037       * Updates to CM/CMB documentation.
2038    
2039    TODO items:
2040    
2041       * MLRISC should use a different register as its asmTemp on the Sparc.
2042         (The current %o2 is a really bad choice because it is part of the
2043         calling conventions, so things might interfere in unexpected ways.)
2044    
2045    ----------------------------------------------------------------------
2046    Name: Matthias Blume
2047    Date: 2001/06/07
2048    Tag: blume-20010607-calls
2049    Description:
2050    
2051    A number of internal changes related to C calls and calling conventions:
2052    
2053    1. ML-Tree CALL statements now carry a "pops" field.  It indicates the
2054       number of bytes popped implicitly (by the callee).  In most cases
2055       this field is 0 but on x86/win32 it is some non-zero value.  This
2056       is information provided for the benefit of the "omit-frameptr" pass.
2057    2. The CALL instruction on the x86 carries a similar "pops" field.
2058       The instruction selection phase copies its value from the ML-Tree
2059       CALL statement.
2060    3. On all other architectures, the instruction selection phase checks
2061       whether "pops=0" and complains if not.
2062    4. The c-calls implementation for x86 now accepts two calling conventions:
2063       "ccall" and "stdcall".  When "ccall" is selected, the caller cleans
2064       up after the call and pops is set to 0.  For "stdcall", the caller
2065       does nothing, leaving the cleanup to the callee; pops is set to
2066       the number of bytes that were pushed onto the stack.
2067    5. The cproto decoder (compiler/Semant/types/cproto.sml) now can
2068       distinguish between "ccall" and "stdcall".
2069    6. The UNIMP instruction has been added to the supported Sparc instruction
2070       set. (This is needed for implementing the official C calling convention
2071       on this architecture.)
2072    7. I fixed some of the .cm files under src/MLRISC/Tools to make them
2073       work with the latest CM.
2074    
2075    ----------------------------------------------------------------------
2076    Name: Matthias Blume
2077    Date: 2001/06/05 15:10:00 EDT
2078    Tag: blume-20010605-cm-index
2079    Description:
2080    
2081    0. The "lambdasplit" parameter for class "sml" in CM has been documented.
2082    
2083    1. CM can now generate "index files".  These are human-readable files
2084       that list on a per-.cm-file basis each toplevel symbol defined or
2085       imported.  The location of the index file for
2086       <p>/<d>.cm is <p>/CM/INDEX/<d>.cm.
2087       To enable index-file generation, set CM.Control.generate_index to true
2088       or export an environment-symbol: export CM_GENERATE_INDEX=true.
2089    
2090       The CM manual has been updated accordingly.
2091    
2092    2. I made some slight modifications to the c-calls API in MLRISC.
2093    
2094         a) There is now a callback to support saving/restoring of
2095            dedicated but caller-save registers around the actual call
2096            instruction.
2097         b) One can optionally specify a comment-annotation for the
2098            call instruction.
2099    
2100    3. SML/NJ (mlriscGen.sml) uses this new API for the rawccall primop.
2101       (For example, the comment annotation shows the C prototype of
2102        the function being called.)
2103    
2104    ----------------------------------------------------------------------
2105    Name: Matthias Blume
2106    Date: 2001/06/01 13:30:00 EDT
2107    Tag: blume-20010601-nlffi-cleanup
2108    Description:
2109    
2110    This is mostly a cleanup of MLFFI stuff:
2111    
2112       - some signature files have been put into a more exposed place
2113       - the ugly 'f type parameter is gone (simplifies types tremendously!)
2114       - ml-nlffigen changed accordingly
2115       - tutorial updated
2116    
2117    Other changes:
2118    
2119       - author's affiliation in CM manual(s) updated
2120       - some more recognized keywords added to Allen's sml.sty
2121    
2122    ----------------------------------------------------------------------
2123    Name: Matthias Blume
2124    Date: 2001/05/25 15:30:00 EDT
2125    Tag: blume-20010525-iptr
2126    Description:
2127    
2128      - put the official 110.33-README (as it appears on the ftp server) under
2129        CVS
2130      - fixed a small bug related to incomplete pointer types in
2131        ml-nlffigen
2132      - small cosmetic change to the ml-nlffi-lib's "arr" type constructor
2133        (it does not need the 'f type parameter)
2134    
2135    ----------------------------------------------------------------------
2136    Name: Matthias Blume
2137    Date: 2001/05/23 14:30:00 EDT
2138    Tag: Release_110_33
2139    Description:
2140    
2141    New version number (110.33).  New bootfiles.
2142    
2143    ----------------------------------------------------------------------
2144    Name: Matthias Blume
2145    Date: 2001/05/22 18:06:00 EDT
2146    Tag: blume-20010522-targets
2147    Description:
2148    
2149    Made install.sh use file config/targets.customized if it exists, falling
2150    back to config/targets if it doesn't.  This way one can have a customized
2151    version of the targets file without touching the "real thing", thus
2152    eliminating the constant fear of accidentally checking something bogus
2153    back into the CVS repository...  (File config/targets.customized must
2154    not be added to the repository!)
2155    
2156    ----------------------------------------------------------------------
2157    Name: Matthias Blume
2158    Date: 2001/05/22 16:30:00 EDT
2159    Tag: blume-20010522-minitut
2160    Description:
2161    
2162    1. Bug fix in ml-nlffigen; now (hopefully) correctly handling
2163       struct returns.
2164    2. Added src/ml-nlffi-lib/Doc/mini-tutorial.txt.  This is some very
2165       incomplete, preliminary documentation for NLFFI.
2166    
2167    ----------------------------------------------------------------------
2168    Name: Matthias Blume
2169    Date: 2001/05/14 11:30:00 EDT
2170    Tag: blume-20010514-script
2171    Description:
2172    
2173    Some bugs in install script fixed.
2174    
2175    In addition to that I also made a slight change to the NLFFI API:
2176    Functors generated by ml-nlffigen now take the dynamic library as a
2177    straight functor argument, not as a suspended one.  (The original
2178    functor code used to force the suspension right away anyway, so there
2179    was nothing gained by this complication of the interface.)
2180    
2181    ----------------------------------------------------------------------
2182    Name: Matthias Blume
2183    Date: 2001/05/11 14:35:00 EDT
2184    Tag: blume-20010511-ml-nlffi
2185    Description:
2186    
2187    I finally took the plunge and added my new FFI code to the main
2188    repository.  For x86-linux it is now ready for prime-time.
2189    
2190    There are two new subdirectories of "src":
2191    
2192      - ml-nlffi-lib:
2193           The utility library for programs using the FFI interface.
2194           Here is the implementation of $/c.cm and its associated low-level
2195           partners $/c-int.cm and $/memory.cm.
2196      - ml-nlffigen:
2197           A stand-alone program for generating ML glue code from C source
2198           code.
2199    
2200    Building ml-nlffigen requires $/ckit-lib.cm.
2201    
2202    The config/install.sh script has been updates to do the Right Thing
2203    (hopefully).
2204    
2205    Notice that the source tree for the C-Kit will not be put under "src"
2206    but directly under the installation root directory.  (This is the
2207    structure that currently exists on the CVS server when you check out
2208    module "sml".)  Fortunately, config/install.sh knows about this oddity.
2209    
2210    Bugs: No documentation yet.
2211    
2212    ----------------------------------------------------------------------
2213    Name: Matthias Blume
2214    Date: 2001/05/09 16:35:00 EDT
2215    Tag: blume-20010509-cpscontract
2216    Description:
2217    
2218    Fixed a bug in the accounting code in cpsopt/contract.sml.  (The
2219    wrapper/unwrapper elimination did not decrement usage counts and some
2220    dead variables got overlooked by the dead-up logic.)
2221    
2222    ----------------------------------------------------------------------
2223    Name: Lal George
2224    Date: 2001/05/08  17:26:09 EDT
2225    Tag: george-20010508-omit-frameptr
2226    Description:
2227    
2228    Changes to implement the omit-frame-pointer optimization to support
2229    raw C calls. For now, there is only support on the Intel x86, but
2230    other architectures will follow as more experience is gained with this.
2231    
2232    
2233    ----------------------------------------------------------------------
2234    Name: Matthias Blume
2235    Date: 2001/05/07 14:40:00 EDT
2236    Tag: blume-20010507-proxies
2237    Description:
2238    
2239    I made into "proxy libraries" all libraries that qualify for such a
2240    change.  (A qualifying library is a library that has another library or
2241    groups as its sole member and repeats that member's export list
2242    verbatim.  A proxy library avoids this repetition by omitting its export
2243    list, effectively inheriting the list that its (only) member exports.
2244    See the CM manual for more explanation.)
2245    The main effect is that explicit export lists for these libraries
2246    do not have to be kepts in sync, making maintenance a bit easier.
2247    
2248    I also added copyright notices to many .cm-files.
2249    
2250    Last but not least, I made a new set of bootfiles.
2251    
2252    ----------------------------------------------------------------------
2253    Name: Matthias Blume
2254    Date: 2001/05/04 17:00:00 EDT
2255    Tag: blume-20010504-cm-lsplit
2256    Description:
2257    
2258    0. John merged pending changes to $/smlnj-lib.cm
2259    
2260    1. Allen's previous change accidentally backed out of one of Lal's
2261       earlier changes.  I undid this mistake (re-introducing Lal's change).
2262    
2263    2. I used the new topOrder' function from graph-scc.sml (from $/smlnj-lib.cm)
2264       within the compiler where applicable.  There is some code simplification
2265       because of that.
2266    
2267    3. The "split" phase (in FLINT) is now part of the default list of phases.
2268       Compiler.Control.LambdaSplitting.* can be used to globally control the
2269       lambda-splitting (cross-module-inlining) engine.  In addition to that,
2270       it can now also be controlled on a per-source basis: CM has been taught
2271       a new tool parameter applicable to ML source files.
2272    
2273       - To turn lambda-splitting off completely:
2274            local open Compiler.Control.LambdaSplitting in
2275                val _ = set Off
2276            end
2277       - To make "no lambda-splitting" the global default (but allow per-source
2278         overriding); this is the initial setting:
2279            local open Compiler.Control.LambdaSplitting in
2280                val _ = set (Default NONE)
2281            end
2282       - To make "lambda-splitting with aggressiveness a" the global default
2283         (and allow per-source overriding):
2284            local open Compiler.Control.LambdaSplitting in
2285                val _ = set (Default (SOME a))
2286            end
2287    
2288       - To turn lambda-splitting off for a given ML souce file (say: a.sml)
2289         write (in the respective .cm-file):
2290            a.sml (lambdasplitting:off)
2291       - To turn lambda-splitting for a.sml on with minimal aggressiveness:
2292            a.sml (lambdasplitting:on)
2293       - To turn lambda-splitting for a.sml on with aggressiveness <a> (where
2294         <a> is a decimal non-negative integer):
2295            a.sml (lambdasplitting:<a>)
2296       - To turn lambda-splitting for a.sml on with maximal aggressiveness:
2297            a.sml (lambdasplitting:infinity)
2298       - To use the global default for a.sml:
2299            a.sml (lambdasplitting:default)
2300         or simply
2301            a.sml
2302    
2303    ----------------------------------------------------------------------
2304    Name: Allen Leung
2305    Date: 2001/05/04 01:57:00 EDT
2306    Tag: leunga-20010504-sync
2307    Description:
2308    
2309      MLRISC features.
2310    
2311      1. Fix to CMPXCHG instructions.
2312      2. Changed RA interface to allow annotations in callbacks.
2313      3. Added a new method to the stream interface to allow annotations updates.
2314    
2315    ----------------------------------------------------------------------
2316    Name: Matthias Blume
2317    Date: 2001/05/01 11:45:00 EDT
2318    Tag: blume-20010501-pcedittmp
2319    Description:
2320    
2321    Changed install.sh to use the current working directory instead of
2322    /usr/tmp for a temporary file (pcedittmp).  The previous choice
2323    of /usr/tmp caused trouble with MacOS X because of file premission
2324    problems.
2325    
2326    ----------------------------------------------------------------------
2327    Name: Matthias Blume
2328    Date: 2001/04/20 11:10:00 EDT
2329    Tag: blume-20010420-inMLflag
2330    Description:
2331    
2332     - added vp_limitPtrMask to vproc-state.h
2333       (for use by the raw-C-calls mechanism to implement proper interrupt
2334        handling)
2335     - made the ML compiler aware of various data-structure offsets so it
2336       can generate code for accessing the vp_inML flag and vp_limitPtrMask
2337     - tweaked mlriscGen.sml to have it emit interrupt-handling code for
2338       raw C-calls
2339    
2340    ----------------------------------------------------------------------
2341    Name: Lal George
2342    Date: 2001/04/20 09:15:28 EDT
2343    Tag: george-20010420-macosX
2344    Description:
2345    
2346     - Changes to port to Mac OS X; Darwin.
2347    
2348     - In the process I found that sqrt was broken on the PPC, because the
2349       fsqrt instruction is not implemented.
2350    
2351    ----------------------------------------------------------------------
2352    Name: Matthias Blume
2353    Date: 2001/04/18 12:45:00 EDT
2354    Tag: blume-20010418-ccalls
2355    Description:
2356    
2357     - fixed two off-by-4 errors in the x86-specific c-calls implementation
2358       (this bug prevented structure arguments containing pointers from being
2359        passed correctly)
2360     - changed the raw-C-call code in mlriscGen.sml in such a way that
2361       structure arguments are represented as a pointer to the beginning
2362       of the structure (instead of having a series of synthesized arguments,
2363       one for each structure member)
2364    
2365     - made makeml script's verbosity level configurable via environment
2366       variable (MAKEML_VERBOSITY)
2367    
2368     - eliminated placeholder implementations for f32l, w16s, i16s, and f32s
2369       in rawmem-x86.sml; we are now using the real thing
2370    
2371  ----------------------------------------------------------------------  ----------------------------------------------------------------------
2372  Name: Matthias Blume  Name: Matthias Blume
# Line 2042  Line 4398 
4398       elaborator). There were a lot of changes during my "linkpath" trials       elaborator). There were a lot of changes during my "linkpath" trials
4399       that could have been reverted to their original state but weren't.       that could have been reverted to their original state but weren't.
4400       Please, don't be too harsh on me for messing with this code a bit more       Please, don't be too harsh on me for messing with this code a bit more
4401       than what was strictly necessary...  (I _did_ resist the tempation       than what was strictly necessary...  (I _did_ resist the temptation
4402       of doing any "global reformatting" to avoid an untimely death at       of doing any "global reformatting" to avoid an untimely death at
4403       Dave's hands. :)       Dave's hands. :)
4404    

Legend:
Removed from v.806  
changed lines
  Added in v.1137

root@smlnj-gforge.cs.uchicago.edu
ViewVC Help
Powered by ViewVC 1.0.0