More documentation.

author: Drahflow <drahflow@gmx.de> 2014-05-05 22:59:16 +0200
committer: Drahflow <drahflow@gmx.de> 2014-05-05 22:59:16 +0200
commit: 67450c10e3cce51ea1204cee511902c25b725fad (patch)
tree: b90082074a312cf91c58286a2e2fbf73f456ea20 /doc
parent: 2eff6f053091983e658557af6c851c2f78725699 (diff)
5 files changed, 326 insertions, 1 deletions
diff --git a/doc/err.md b/doc/err.md
new file mode 100644
index 0000000..48ac1c4
--- /dev/null
+++ b/doc/err.md
@@ -0,0 +1,100 @@
+Error handling
+==============
+
+The easiest way to deal with errors in elymas is the `die` function. It takes a string, dumps it to the
+error output and terminates the program. It might however be preferable to act more gracefully if problems
+are encountered. To this end, an error handling library was created.
+
+The main idea is to have the error locations suggest possible further actions from which the calling
+function can then select a suitable one depending on circumstances. To give an example: While it would be
+perfectly ok to terminate the program on a failed `sys .read` in a single-user command line application,
+the same can certainly not be said about a webserver. However the code of `sys .read` cannot possibly know
+how to react correctly.
+
+The error handling does not automatically rewind the stack like it is usual in many other languages. If such
+behaviour is desired, it can be emulated via coroutines.
+
+Usually, different kinds of errors need different handling. Hence most functions take a string describing the
+kind of error which occured. This string is always treated as a prefix of a possibly more detailed description.
+If an error of (hypthetical) kind "io.write.diskfull" is raised, a handling routine for "io.write" will catch it,
+as would a handler for "io".
+
+
+`??`
+----
+
+Takes a string and raises an error of the specified kind.
+
+    ??fatal.testError
+
+`???`
+-----
+
+Takes a string and a scope and raises an error of the kind specified by the string. The scope members
+are possible ways to react to the error provided to upper layers of the application.
+
+    < { "ignored" dump } =*ignore { "oops" die } =*terminate > ???fatal.testError
+
+
+`?!`
+----
+
+Specifies behavior if an error occurs. Takes to function objects. The first is executed. If an error occurs
+during its execution, the second is invoked with the error handling scope provided on the stack.
+
+    {
+      < { "ignored" dump } =*ignore { "oops" die } =*terminate > ???fatal.testError
+    } /maybeFailFunction deff
+    |maybeFailFunction { .terminate } !?fatal # handle all fatal.* errors by the .terminate action
+
+
+`??!`
+-----
+
+Specifies behavior if an error occurs by mapping a lower level error to a higher level one.
+
+    { ... } # do stuff to the database
+    { ==lowLevelHandlers <      # capture handling proposals from lower levels
+      { ... } =*rollback        # provide new suggestions how to handle the error ...
+      { ... } =*closeDatabase   # possibly using low-level suggestions while doing so
+      { ... } =*terminate
+    > ??!io.database } ?!io     # map all io.* errors do io.database errors
+
+
+`??!'`
+------
+
+Just like `??!`, but does not take a string. Instead it re-raises the original error kind string.
+
+
+`?!!`
+-----
+
+Applies an array of error handling strategies in turn.
+
+    { ... } # do stuff to the database
+    [
+      { .rollback }                        # try the .rollback handler suggested by lower level
+      { .closeDatabase }                   # if it raises an error again, try the .closeDatabase handler
+      { "cannot recover from error" die }  # if this in turn fails, just die
+    ] ?!!io.database                       # apply above rules to any io.database.* errors
+
+
+`!!?`
+-----
+
+Clones the current coroutine state and returns it. This allows resetting the stack and
+instruction pointer to an earlier state.
+
+    [ "config.xml" "config.xml.bak" "config.xml.orig" ] ==configFiles
+    0 ==currentConfig
+    !!? ==checkpoint                      # clone current coroutine state
+    {
+       currentConfig configFiles * parse ...
+    } {                                   # on error:
+      --                                  # ignore lower level suggestions 
+      currentConfig 1 add =currentConfig  # try a different config file
+      currentConfig configFiles len lt {
+        checkpoint 0 !                    # rewind execution back to checkpoint
+      } rep                               # ... if candidates remain
+    } ?!io                                # apply above rules to any io.* errors
diff --git a/doc/execution.md b/doc/execution.md
index 3228d2e..317a945 100644
--- a/doc/execution.md
+++ b/doc/execution.md
@@ -1,5 +1,5 @@
 Executing things
-----------------
+================
 
 
 Objects are executed on various occasions in a program. The most obvious example is `*`, but similarly
diff --git a/doc/quoting.md b/doc/quoting.md
new file mode 100644
index 0000000..33615b9
--- /dev/null
+++ b/doc/quoting.md
@@ -0,0 +1,157 @@
+Quoting - how functions are made
+================================
+
+The behavior of the input parser depends on an integer counter, the *quote level*. If the quote level is zero,
+each identifier is resolved in the current scope and acted upon (i.e. either pushed to the stack or executed depending
+on the execution mode of the name). If, however, the quote level is non-zero, only those names which have the quoting
+execution mode associated get executed. For all other names, a new function is instead created and pushed to the stack.
+This function will upon execution look up the name and only then act upon it. This allows easy construction of complex
+function objects.
+
+In particular the functions `{` and `}` have quoting execution mode and are hence always executed directly when encountered.
+The `{` function puts a function-quote-begin marker on the stack andincreases the quote level by one. A later `}` then collects
+all stack contents up to the function-quote-begin marker from the stack and creates assembly instructions from them. These
+instructions will esentially execute the collected functions. After that `}` decrements the quote level and checks for zero.
+If zero has been reached, a new function object is created from the assembled instructions and the current scope and pushed
+to the stack. If the quote level is still non-zero after `}`, a new function is instead created which will upon execution
+create a new function object from the instructions and the then current scope.
+
+An example might make everything a little clearer:
+
+    1 1 add dump       # instant execution of `add`
+    0000000000000002
+    { 1 1 add } dump   # a function object is created
+    <function: 00006000005D5600>
+    { 1 1 add } * dump
+    0000000000000002
+
+The function `quoted` returns the current quote level and can be very useful for creating macros. For now however, let's just
+use it to make the example a little clearer.
+
+    { "===" dump quoted dump _ dump } /debug defq     # define a helper function showing current quote level and stack top
+                                                      # note that this function has quote execution mode
+    1 debug 1 debug add debug
+    "==="
+    0000000000000000
+    0000000000000001
+    "==="
+    0000000000000000
+    0000000000000001
+    "==="
+    0000000000000000
+    0000000000000002
+
+    { 1 debug 1 debug add debug } debug
+    "==="
+    0000000000000001              # inside the { the quote level is 1
+    0000000000000001              # constants are still pushed literally
+    "==="
+    0000000000000001
+    0000000000000001
+    "==="
+    0000000000000001
+    <function: 000060000062DEA0>  # but the add has been pushed as a function
+    "==="
+    0000000000000000
+    <function: 00006000005CE300>  # this is the resulting function object
+
+    { { 1 1 debug } debug } * dump
+    "==="
+    0000000000000002              # the nested { resulted in a quote level of 2
+    0000000000000001
+    "==="
+    0000000000000001
+    <function: 00006000006A4250>  # this is the output of debug, showing a function object 6A4250...
+    <function: 00006000006A6EE0>  # which is different from the one ultimately representing { 1 1 }
+
+Why are the two function objects in the last example different? Consider repeated executions of `{ { 1 1 } }`. Each invokation of the outer
+function should result in a new inner function object. Hence it is not pushed as a literal, but a function object is created (for inclusion
+in the outer braces) which will create a new inner function object on each execution. In principle, these implicitely created function
+objects can be assigned to variables and executed like all others.
+
+    { 3 } =*value         # define a function always returning 3
+    { } =*f               # predefine a function variable
+    { _ =f } /get defq    # grab a function object into f
+    { value get } --      # f now contains the function resolving value
+    value dump
+    0000000000000003
+    <
+      { 5 } /value deff   # redefine value, always return 5
+      f dump
+    0000000000000005      # notice how the new value is being used
+    >
+    value dump            # and the global one again gets resolved
+    0000000000000003
+
+
+How instructions are assembled
+------------------------------
+
+How exactly does `}` assemble instructions? First it searches the stack for the topmost function-quote-begin marker. From there on towards
+the stack top, each element is considered in turn. If it is a literal (i.e. integer, string or float) an instruction is created pushing
+the same literal. If it is a function object, an instruction is created calling that function (if necessary switching the current scope to
+the captured scope of the function beforehand and back after execution).
+
+Additionally, `}` adds a function header and footer to the instruction sequence. The header creates a new scope for the duration of the
+execution, while the footer switches back to the earlier scope again. This effectively results in local variable semantics for newly
+defined variables.
+
+
+What a function object contains
+-------------------------------
+
+Each function object can refer to
+* its instruction sequence
+* its captured scope
+* its expected input and output arguments
+
+When a function starts executing, the current scope is switched to the captured scope. Afterwards the instruction sequence will create
+a new scope as a child of this now current scope. At the end of the function this child scope is exited. Afterwards the calling function
+will switch the scope back to where it was before execution began.
+
+The function `}'` and `}"` allow construction of function objects with less scoping effects. This can be useful to create functions with
+unusal effects with respect to scopes. While `}'` creates a function which captures the enclosing scope but does not create a new child
+scope, `}"` neither captures the scope nor creates a new child. In effect the contents of `}'` will execute in the parent scope of the
+function definition while the contents of `}"` will execute in the scope of the calling site.
+
+    <
+      { == }' /set deff   # capture enclosing scope and execute == within it
+      { == }" /SET deff   # capture nothing
+    > ==s
+    s keys dump
+    [
+      "set"
+      "SET"
+    ]
+    0 /foo s .set         # execute set, i.e. execute == in the scope where { == } was written
+    s keys dump
+    [
+      "set"
+      "SET"
+      "foo"
+    ]
+    0 /bar s .SET         # execute SET, i.e. execute == in the current scope
+    s keys dump
+    [
+      "set"
+      "SET"
+      "foo"
+    ]
+    bar dump              # bar has been defined in the global scope
+    0000000000000000
+
+
+`}_`
+----
+
+The `}_` function is a macro defined in terms of `}`. It first creates a function just as `}`. Afterwards it also consumes the stack top
+element and combines it with the just created function object into a new function object which pushes the captured stack element before
+executing the created function.
+
+    { { add }_ } /makeAdder deff
+    5 makeAdder /addFive deff
+    3 addFive dump
+    0000000000000008
+
+Understanding the source code of `}_` in compiler/standerd.ey is actually a very worthwile exercise for understanding the quoting rules
+of elymas.
diff --git a/doc/sys.md b/doc/sys.md
new file mode 100644
index 0000000..184ecde
--- /dev/null
+++ b/doc/sys.md
@@ -0,0 +1,65 @@
+System Interaction
+==================
+
+Some facilities have been developed to access the outside world.
+
+`sys .exit`
+-----------
+
+Exists the program with the indicated error code.
+
+    0 sys .exit       # successful termination
+
+
+`sys .file`
+-----------
+
+Creates a scope representing a file. This object supports `.open`, `.close`, `.read`, `.write`, `.writeall`, `.eachLine`.
+Three files `sys .in`, `sys .out`, `sys .err` are predefined and represent the standard input, output and error streams respectively.
+
+    sys .file ":" via
+      "foo.txt" :open
+      8 :read dump          # first 8 bytes of foo.txt, possibly less if foo.txt is shorter
+      { dump } :eachLine    # dump each line of foo.txt (excluding the 8 bytes already read)
+      :close
+    "Hallo Welt!\n" sys .out .writeall
+    Hallo Welt!
+
+As `.write` directly maps to the write(2) syscall, it might not write all bytes. Instead it returns the number of bytes written as an integer.
+Usually, you want to use `.writeall` which will call write(2) repeatedly, until all bytes are written.
+
+`sys .fdToFile` will create a file representing scope directly from a unix file descriptor number.
+
+
+`sys .freeze`
+-------------
+
+To create stand-alone executables, `sys .freeze` takes a filename and a function object and creates an executable which will
+execute the function object when started.
+
+    { "Hello World!\n" sys .out .writeall 0 sys .exit } "hello" sys .freeze
+
+An elymas interpreter can be implemented via `include` easily:
+
+    {
+      sys .argv len { 0 sys .argv * } { "/proc/self/fd/0" } ? * include
+      0 sys .exit
+    } "interpreter" sys .freeze
+
+
+`sys .mkdir`
+------------
+
+Creates a new directory.
+
+
+`sys .ls` / `sys .readdir`
+--------------------------
+
+List the contents of a directory. `sys .ls` excludes files with a leading dot.
+
+
+`sys .rename`
+-------------
+
+Takes two filenames. Renames the first (stack second-to-top) to the second (stack top).
diff --git a/doc/tutorial.md b/doc/tutorial.md
index e3abf7c..22ed19f 100644
--- a/doc/tutorial.md
+++ b/doc/tutorial.md
@@ -164,3 +164,6 @@ Recommended reading order
 * scopes.md - where variables live
 * global.md - global functions
 * execution.md - executing things
+* quoting.md - function definition
+* sys.md - some interfaces to the operating system
+* err.md - error handling
author	Drahflow <drahflow@gmx.de>	2014-05-05 22:59:16 +0200
committer	Drahflow <drahflow@gmx.de>	2014-05-05 22:59:16 +0200
commit	67450c10e3cce51ea1204cee511902c25b725fad (patch)
tree	b90082074a312cf91c58286a2e2fbf73f456ea20 /doc
parent	2eff6f053091983e658557af6c851c2f78725699 (diff)