Staple Code Parser

1.0.0

A code parser tool for documentation markup

Table of Contents

About Staple-Parser

This system implements a Lisp code parser to implement marking up definition references within code snippets.

How To

You can parse a lisp source snippets using parse:

(staple-code-parser:parse "(defun foo (a) (+ 1 a))")

This will return a list of "parse results". Parse results represent all information about the toplevel source form that was parsed. Typically you will want to pass this to parse-result->definition-list, which will return a list of definitions and their source locations that were found within the parse results.

(staple-code-parser:parse-result->definition-list *)
; => ((#<DEFINITIONS:FUNCTION +> (16 . 17)) (#<DEFINITIONS:MACRO DEFUN> (1 . 6)))

The definitions objects are from the Definitions library. Please see its documentation on how to handle these kinds of objects. This definition list is used in Staple to mark up the respective source parts with HTML links, but you could also use it for your own purposes.

Extending Staple-Parser

Since the parser does not compile or evaluate the code, it is missing a lot of information about what each symbol could be, hampering the quality of definition retrieval. You can help this out by implementing custom walkers for known forms that expand to parse results that are more easily understood.

The way to do this is twofold. You can either use define-walk-compound-form an expand into known parse results, transforming the contents as appropriate, or you can use define-walker-form to define a new parse result type. In the latter case you will also need to add define-sub-forms and define-definition-resolver to handle the traversal and lookup.

Have a look at the source files special-forms, standard-forms, and to-definitions for examples on how to use these.

System Information

Version: 1.0.0

Dependencies:

alexandria
definitions
concrete-syntax-tree
concrete-syntax-tree-lambda-list
concrete-syntax-tree-destructuring
eclector
eclector-concrete-syntax-tree
documentation-utils

Author: Yukari Hafner

License: zlib

Homepage: https://Shinmera.github.io/staple/

Bug Tracker: https://github.com/Shinmera/staple/issues

Sources: https://github.com/Shinmera/staple.git

Definition Index

STAPLE-CODE-PARSER

ORG.SHIRAKUMO.STAPLE.CODE-PARSER

No documentation provided.

EXTERNAL CLASS

ENVIRONMENT

Container for environment information used during walking.

See PARENT
See NAMESPACES
See LOOKUP
See AUGMENT-ENVIRONMENT!
See AUGMENTED-ENVIRONMENT

EXTERNAL CLASS

PLACEHOLDER

This class represents symbols that are not present in the host.

They are emitted in parsed code snippets in place of symbols that
cannot be read properly.

See PLACEHOLDER-NAME
See PLACEHOLDER-PACKAGE
See PLACEHOLDER-INTERN

EXTERNAL FUNCTION

AUGMENT-ENVIRONMENT!

ENVIRONMENT
NAMES
VALUES

Augments the given environment with the new values for the given names.

Returns the modified environment.

See LOOKUP
See ENVIRONMENT

EXTERNAL FUNCTION

AUGMENTED-ENVIRONMENT

PARENT
NAMES
VALUES
&KEY
CLASS

Returns a new environment with the changed values in place.

The old environment is a parent to the new one.

See AUGMENT-ENVIRONMENT!
See ENVIRONMENT

EXTERNAL FUNCTION

PARSE-RESULT->DEFINITION-LIST

RESULT

Turn the parse-result into a list of definitions and source locations.

For instance:
  ((:CALL (0 . 10) (:VARIABLE (1 . 5) NULL) (:LITERAL (6 . 9) NIL)))
  => ((#<DEFINITIONS:FUNCTION NULL> (1 . 5)))

This uses FIND-DEFINITIONS to find suitable definitions for a parse
result, as well as SUB-RESULTS to traverse the parse result tree.

See FIND-DEFINITIONS
See SUB-RESULTS

EXTERNAL FUNCTION

TIE-TO-SOURCE

SOURCE
DEFS

Turns each def into a list of source and def.

EXTERNAL FUNCTION

WALK-BINDINGS

BINDINGS
ENVIRONMENT

Walk the set of LET bindings in the environment.

Returns a list of cons cells where the CAR is the variable definition
of the binding and the cdr is the parse result of the value.

See WALK

EXTERNAL FUNCTION

WALK-BODY

CST
ENVIRONMENT

Same as WALK-IMPLICIT-PROGN, but filters out declarations from the cst.

See WALK-IMPLICIT-PROGN

EXTERNAL FUNCTION

WALK-IMPLICIT-PROGN

CST
ENVIRONMENT

Walks the CST as a list of forms and returns the list of parse-results for each form.

See WALK

EXTERNAL FUNCTION

WALK-LAMBDA-LIKE

CST
ENVIRONMENT
&OPTIONAL
LAMBDA-LIST-PARSER

Walk a lambda-like structure.

Parses the lambda-list and body forms appropriately and returns a
parse-result for a lambda. The given parser is used to process the
lambda-list.

See WALK-IMPLICIT-PROGN

EXTERNAL GENERIC-FUNCTION

FIND-DEFINITIONS

TYPE
SOURCE
ARGS

Returns any matching definitions for the given parse result.

All parse results have the structure of (TYPE SOURCE . ARGS).
Thus you can simply destructure it and pass the arguments to this
function to retrieve its definitions.

See DEFINE-DEFINITION-RESOLVER

EXTERNAL GENERIC-FUNCTION

LOOKUP

NAME
NAMESPACE
ENVIRONMENT

Looks up the name in the namespace of the environment.

This will traverse the environment chain upwards until no parent can
be found anymore in case the current environment's namespace does not
contain the value.

When used as a setf place the value is always stored in the given
environment's namespace.

See NAMESPACE
See ENVIRONMENT

EXTERNAL GENERIC-FUNCTION
(SETF LOOKUP)
- VALUE
- NAME
- NAMESPACE
- ENVIRONMENT
No documentation provided.

EXTERNAL GENERIC-FUNCTION

PARSE

INPUT

Parses the input and returns a list of parse results, each for one toplevel.

First uses READ-TOPLEVEL to read all toplevel forms, then uses WALK
for each of the read CSTs to turn them into parse results.

See READ-TOPLEVEL
See WALK

EXTERNAL GENERIC-FUNCTION

PLACEHOLDER-INTERN

OBJECT

Returns whether the symbol being read is an internal or external symbol.

See PLACEHOLDER

EXTERNAL GENERIC-FUNCTION

PLACEHOLDER-NAME

OBJECT

Returns the symbol-name of the symbol this is a placeholder for.

See PLACEHOLDER

EXTERNAL GENERIC-FUNCTION

PLACEHOLDER-PACKAGE

OBJECT

Returns the symbol-package name of the symbol this is a placeholder for.

See PLACEHOLDER

EXTERNAL GENERIC-FUNCTION

READ-TOPLEVEL

INPUT

Reads the toplevel of an input.

The INPUT may be a string, pathname, or a stream (by default).
Returns a list of CSTs representing all toplevel forms that were read.

EXTERNAL GENERIC-FUNCTION

SUB-RESULTS

TYPE
ARGS

Returns all parse results that are sub-results of this parse result.

All parse results have the structure of (TYPE SOURCE . ARGS).
Thus you can simply destructure it and pass the arguments to this
function to retrieve its definitions.

See DEFINE-SUB-RESULTS

EXTERNAL GENERIC-FUNCTION

WALK

CST
ENVIRONMENT

Walks the given CST in the environment.

Should return a parse result structure.
Parse results are lists of the following form:

  PARSE-RESULT ::= (TYPE SOURCE . ARGS)
  TYPE         --- The type of the form we've walked. Typically this
                   is a symbol of the form itself, like LAMBDA, or a
                   keyword if a generic variant is encountered like
                   for :CALLs and :MACROs.
  SOURCE       ::= (START . END)
  ARGS         --- Additional arguments for the parse result,
                   including additional parse-results.

Generally see the overall concrete-syntax-tree system for explanations
on how to use this.

Note that you probably want to define a method on WALK-FORM instead,
as that is called automatically as appropriate for each CST:CONST-CST,
and WALK-ATOM is called for each CST:ATOM-CST.

See ENVIRONMENT

EXTERNAL GENERIC-FUNCTION

WALK-ATOM

ATOM
ENVIRONMENT

Walks an atom.

If the atom is a symbol, it returns a parse result of a literal for
keywords and booleans, or a variable for symbols. For everything else
it returns a parse result for a literal.

EXTERNAL GENERIC-FUNCTION

WALK-FORM

OPERATOR-VAR2
CST
ENVIRONMENT

Walks a form.

The form is identified by the car of the cons. The entirety of the
form as a CST, including the operator, are passed along as well.

EXTERNAL MACRO

DEFINE-DEFINITION-RESOLVER

TYPE

SOURCE
&REST
ARGS

&BODY
BODY

Shorthand to define a find-definitions method and destructure the arguments of the parse result.

See FIND-DEFINITIONS

EXTERNAL MACRO

DEFINE-SUB-RESULTS

TYPE
ARGS
&BODY
BODY

Shorthand to define a sub-results method and destructure the arguments of the parse result.

See SUB-RESULTS

EXTERNAL MACRO

DEFINE-WALK-COMPOUND-FORM

OPERATOR

CST-VAR
&OPTIONAL
ENVIRONMENT-VAR

&BODY
BODY

Shorthand to define a WALK-FORM method.

Adds local functions for WALK and WALK-IMPLICIT-PROGN that
automatically pass the environment along so you don't need to repeat
it.

See WALK-FORM

EXTERNAL MACRO

DEFINE-WALKER-FORM

FORM

CST-VAR
&OPTIONAL
ENVIRONMENT-VAR
SOURCE

&BODY
BODY

Shorthand to define simple walker forms.

The FORM should be a destructuring description of the kind of form to
walk. The return value of the BODY should be the list of additional
arguments for the parse result. The type and source of the parse
result are automatically added for you.

If you need control over the type or source, look at
DEFINE-WALK-COMPOUND-FORM instead.

See DEFINE-WALK-COMPOUND-FORM