# Idea: Forth Key Value Data Structure map ## forth-map This **forth-map** data structure can be used to map strings to arbitrary data. Other programming languages have similar data structures and call them *dictionaries*, *hashes* or *hashmaps*. Forth-map is an example of an abstract data type. You operate the data structure via its interface words `Map`, `Map:` and `>value` (see glossary below to see how they work) not referring to its inner implementation structure. ## Usage: `include map.fs` ( requires [[en:pfw:unit|UNIT]] words to be present ) ## Glossary: - `map ( -- map )` Create a new map with the identifier `map` that can be passed on the stack and stored in variables etc. Items in the map are accessed via `>value` in the form c-addr u map `>value` -- addr - `Map: ( -- )` Create a new named map ``. Items in `` are accessed in the form: c-addr u `` -- addr - `>value ( c-addr u map -- addr )` Return the body address of an item with key `c-addr` `u` in the key/value map `map`. ## Examples: ```forth map Constant ma s" xlerb" ma >value . ( creates new item in map ma and prints its address ) s" xlerb" ma >value . ( uses existing item in map ma and prints its address ) s" xlerb" ma >value @ . ( uses existing item in map ma and prints its value ) 99 s" xlerb" ma >value ! ( uses existing item in map ma and sets its value ) Map: mama s" xlerb" mama . ( creates new item in named map mama and prints its address ) s" xlerb" mama . ( uses existing item in named map mama and prints its address ) s" xlerb" mama @ . ( uses existing item in named map mama and prints its value ) 99 s" xlerb" mama ! ( uses existing item in named map mama and sets its value ) ``` ## Map implementation Whereas you use map by means of its fixed, specified interface words, it can be implemented in many ways. You can implement the search/hash/find algorithm that identifies the value by looking up the given key string on a low level bases doing tree/list traversal and string comparison on your own. However, assuming that dictionary lookup if Forth his highly optimized, the implementation below uses Forth word lists to do the necessary mapping. Adding an item is implemented by creating a word in the wordlist. The lookup is implemented as Forth dictionary search. Find the code also at [https://github.com/uho/forth-map](https://github.com/uho/forth-map). **map.fs** ```forth [defined] end-unit 0= [IF] : unit ( -- unit-sys ) Create ; : internal ( unit-sys1 -- unit-sys2 ) ; : external ( unit-sys1 -- unit-sys2 ) ; : end-unit ( unit-sys -- ) ; [THEN] unit -map- \ How to use maps: \ In order to create a map you have two choices: \ \ 1. Create a named map (similar to a Forth VARIABLE): \ `Map: ` \ `` gets the stack effect `( c-addr u -- addr ). \ Executing `` later will take a key string and will return \ a cell address where a value can be stored and retrieved with `!` and `@`. \ Each time you invoke `` on the same string, you get the same address. \ (*same string* in the sense of a string with the same characters, \ `c-addr` might be different). \ \ 2. Create an unnamed map (similar to OPEN-FILE) \ `map` \ This puts a map identifier on the stack which you can store in a \ variable or use it to define a constant: \ `VARIABLE my-map map my-map !` \ \ In order to access items in the map you put the map identifier on the \ stack and the perfom the desired operation on the map: \ \ - store a value (e.g. 99) for a given key (e.g. ) \ `99 S" " my-map @ >value !` \ - fetch a value for a given key (e.g. ) \ `S" " my-map @ >value @` \ \ - iterate through the keys of a map \ ```:noname ( i*x c-addr u -- j*x flag ) \ type space true ; \ my-map @ iterate-map \ ``` \ \ `iterate-map ( i*x xt map -- j*x )` \ will take a map identifier `map` and an execution token `xt` from the stack \ and invokes `xt` on every key in the map until all keys are processed \ or until `xt` returns a `false` value. The stack `i*x` below the `xt` is \ guaranteed to be accessible by `xt` so it can be used to manage context \ information required for `xt` (such as the map identifier or a count). \ ************ Implementation ************* \ Maps aka key/value structures are implemented \ here with wordlists. \ So internally we call them word lists `wid` but externally \ we call them maps `map`. 1 cells VALUE map.space \ The size in bytes of the storage space for each value \ default is 1 cell \ Check if there is an item in the given word list `wid` \ with key `c-addr` `u`. \ If the item exists return its body address and `true`. \ If the item does not exist, return 0 and `false`. : item? ( c-addr u wid -- addr true | 0 false ) search-wordlist IF >body true EXIT THEN 0 false ; \ Create a variable like item with name `c-addr` `u` \ in the current word list. : "variable ( c-addr u -- ) s" CREATE " >r pad r@ move \ CREATE pad r@ + swap dup >r move \ caddr u pad r> r> + evaluate map.space ALLOT ; \ Create a new item with name `c-addr` `u` in the word list `wid`. \ Return its body address. : new-item ( c-addr u wordlist -- addr ) get-current >r set-current ['] "variable catch r> set-current throw here map.space - ; external \ Return the body address of an item with key `c-addr` `u` \ in the key/value map `map` : >value ( c-addr u map -- addr ) >r 2dup r@ item? IF r> drop nip nip EXIT THEN drop r> new-item ; \ Create a new map with the identifier `map` that can be \ passed on the stack and stored in variables etc. \ Items in the map are accessed via `>value` in the \ ( c-addr u map -- addr ) : map ( -- map ) wordlist ; \ Create a new named map ``. Items in `` are \ accessed in the form: ( c-addr u -- addr ). : Map: ( -- ) map Create , Does> ( c-addr u -- addr ) @ >value ; internal : invoke-xt ( i*x xt nt -- j*x xt flag ) swap >r name>string r@ execute r> swap ; external : iterate-map ( i*x xt map -- j*x ) ['] invoke-xt swap traverse-wordlist drop ; end-unit cr .( Key/Value maps loaded ) cr .( Usage: map Constant ) cr .( x s" key" >value ! ) cr .( s" key" >value @ ) cr .( or: ) cr .( Map: ) cr .( x s" key" ! ) cr .( s" key" @ ) 0 [IF] \ do test marker *test* include ttester.fs map Constant ma t{ s" xlerb" ma >value dup @ -> s" xlerb" ma >value 0 }t Map: mama t{ s" xlerb" mama dup @ -> s" xlerb" mama 0 }t : print-key ( c-addr u -- flag ) type space true ; : keys ( map -- ) cr ." keys of map " dup . cr ['] print-key swap iterate-map cr ; ma keys *test* : print-key/value ( ctr map c-addr u -- ctr map flag ) >r >r cr ." " dup r> r> 2dup type ." : " rot >value @ 0 .r ." ," swap 1+ swap true ; : print-map ( map -- ) >r cr ." {" 0 r@ ['] print-key/value r> iterate-map drop cr ." }" cr . ." items" ; [THEN] ``` ===== Do you wnat to discuss this implementation of have a forth-map implementation of your own? ===== === Please add your suggestions below. === ~~DISCUSSION~~