doc: add internals/ (WIP). Add a part for index.

author: Tristan Gingold <tgingold@free.fr> 2020-01-04 17:59:44 +0100
committer: Tristan Gingold <tgingold@free.fr> 2020-01-06 18:20:28 +0100
commit: 09af03505bbd72f676394415c16c14bea5154513 (patch)
tree: 8444da62bd105b00aa2321acc3785357c550b0cb /doc/internals/Frontend.rst
parent: 61c0e71793576646cb8374cd462bcda7cf6e410e (diff)
download: ghdl-09af03505bbd72f676394415c16c14bea5154513.tar.gz
ghdl-09af03505bbd72f676394415c16c14bea5154513.tar.bz2
ghdl-09af03505bbd72f676394415c16c14bea5154513.zip
1 files changed, 24 insertions, 0 deletions
diff --git a/doc/internals/Frontend.rst b/doc/internals/Frontend.rst
new file mode 100644
index 000000000..6d5e1da5c
--- /dev/null
+++ b/doc/internals/Frontend.rst
@@ -0,0 +1,24 @@
+.. _INT:Frontend:
+
+Front-end
+#########
+
+Input files (or source files) are read by `files_map.ad[sb]`.  Only regular files can be read, because they are read entirely before being scanned.  This simplifies the scanner, but this also allows to have a unique index for each character in any file.  Therefore the source location is a simple 32-bit integer whose type is `Location_Type`.  From the location, `files_map` can deduce the source file (type is `Source_File_Entry`) and then the offset in the source file.  There is a line table for each source file in order to speed-up the conversion from file offset to line number and column number.
+
+The scanner (file :file:`vhdl-scanner.ad[sb]`) reads the source files and creates token
+from them.  The tokens are defined in file :file:`vhdl-tokens.ads`.  Tokens are scanned
+one by one, so the scanner doesn't keep in memory the previous token.  Integer or
+floating point numbers are special tokens because beside the token itself there is
+also a variable for the value of the number.
+
+For identifiers there is a table containing all identifiers.  This is implemented by
+file :file:`name_table.ad[sb]`.  Each identifier is associated to a 32-bit number
+(they are internalized).  So the number is used to reference an identifier.  About
+one thousand identifiers are predefined (by :file:`std_names.ad[sb]`).  Most of
+them are reserved identifiers (or keywords).  When the scanner find an identifier, it
+checks if it is a keyword.  In that case it changes the token to the keyword token.
+
+The procedure `scan` is called to get the next token.  The location of the token and
+the location after the token are available to store it in the parser tree.
+
+The main clieant of the scanner is the parser.
author	Tristan Gingold <tgingold@free.fr>	2020-01-04 17:59:44 +0100
committer	Tristan Gingold <tgingold@free.fr>	2020-01-06 18:20:28 +0100
commit	09af03505bbd72f676394415c16c14bea5154513 (patch)
tree	8444da62bd105b00aa2321acc3785357c550b0cb /doc/internals/Frontend.rst
parent	61c0e71793576646cb8374cd462bcda7cf6e410e (diff)
download	ghdl-09af03505bbd72f676394415c16c14bea5154513.tar.gz ghdl-09af03505bbd72f676394415c16c14bea5154513.tar.bz2 ghdl-09af03505bbd72f676394415c16c14bea5154513.zip