stdout-flip-2.5K_1.txt

/flipv2/20121112-100543-2.5K-ReLST-Wallace/stdout-flip-2.5K_1.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 34709 lines · 32683 code · 2026 blank · 0 comment · 0 complexity · 65c053b700960c62ea6c2bac5dde26ac MD5 · raw file

Seeding... 1
dir: dir isL
Python-Soar Flip environment.
To accept commands from an external sml process, you'll need to
type 'slave <log file> <n decisons>' at the prompt...
sourcing 'flip_predict.soar'
***********
Total: 11 productions sourced.

seeding Soar with 1 ...

soar> Entering slave mode:
  - log file 'rl-slave-2.5K_1.log'....
  - will exit slave mode after 2500 decisions
  waiting for commands from an externally connected sml process...
-/|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\-/|\-/|\sleeping...
-/|\-/1:    O: O1 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
rule alias: '*'

rule alias: '*'

|\-/|\-/2:    O: O4 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-3:    O: O5 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/4:    O: O7 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-5:    O: O10 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
/|6:    O: O11 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-/|7:    O: O13 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/|8:    O: O16 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-9:    O: O18 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
/|\10:    O: O20 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|11:    O: O22 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

\12:    O: O23 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|13:    O: O26 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-14:    O: O28 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
/|15:    O: O30 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
\-/16:    O: O32 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-17:    O: O33 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|18:    O: O36 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/19:    O: O38 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-20:    O: O39 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\21:    O: O41 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
-22:    O: O43 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|23:    O: O46 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/24:    O: O47 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\25:    O: O50 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
-/|26:    O: O52 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-27:    O: O54 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
/|28:    O: O56 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/29:    O: O57 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-30:    O: O59 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\31:    O: O62 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
-32:    O: O64 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\33:    O: O66 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|34:    O: O67 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/35:    O: O70 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\-/36:    O: O72 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-37:    O: O74 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\38:    O: O76 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/|39:    O: O77 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/40:    O: O80 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
|\-41:    O: O82 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/42:    O: O84 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\43:    O: O85 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/|44:    O: O88 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/45:    O: O90 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-46:    O: O92 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\47:    O: O94 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/48:    O: O95 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-49:    O: O98 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\50:    O: O100 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\-/|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\51:    O: O102 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
rule alias: '*'

rule alias: '*'

-52:    O: O104 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
/|53:    O: O106 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/54:    O: O108 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\55:    O: O109 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
-/|56:    O: O111 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/57:    O: O114 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\58:    O: O116 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|59:    O: O118 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-60:    O: O119 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/61:    O: O122 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

|62:    O: O123 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/63:    O: O126 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-64:    O: O128 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|65:    O: O129 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-/66:    O: O132 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-67:    O: O134 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|68:    O: O136 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/69:    O: O137 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-70:    O: O139 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/71:    O: O142 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
rule alias: '*'

|72:    O: O144 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
\-/73:    O: O146 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\74:    O: O148 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/75:    O: O149 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\76:    O: O152 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/|77:    O: O153 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/78:    O: O156 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
|\-79:    O: O158 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
/|\80:    O: O160 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/81:    O: O162 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
rule alias: '*'

|82:    O: O163 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/|83:    O: O166 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/84:    O: O168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
|\-85:    O: O170 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
/|\86:    O: O172 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|87:    O: O174 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/88:    O: O176 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-89:    O: O177 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\90:    O: O179 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/91:    O: O182 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

rule alias: '*'

|92:    O: O184 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
\-93:    O: O186 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|94:    O: O188 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/95:    O: O190 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-96:    O: O191 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\-97:    O: O194 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\98:    O: O196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/99:    O: O198 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-100:    O: O200 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\101:    O: O201 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
rule alias: '*'

rule alias: '*'

-/|\-/|\-/|\-/|\-/|\-/|\-/|\-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\sleeping...
-sleeping...
/sleeping...
|102:    O: O203 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-/|103:    O: O206 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-/104:    O: O207 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\105:    O: O210 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/106:    O: O211 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-107:    O: O213 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/|\-sleeping...
/108:    O: O216 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\109:    O: O218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-110:    O: O220 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\111:    O: O222 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

-112:    O: O223 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\113:    O: O225 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|114:    O: O227 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/115:    O: O229 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-/116:    O: O232 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
|\-117:    O: O234 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|118:    O: O236 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/119:    O: O238 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-120:    O: O239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\121:    O: O241 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

-122:    O: O244 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|123:    O: O246 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-124:    O: O247 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\125:    O: O249 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/126:    O: O251 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\-127:    O: O254 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|128:    O: O255 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/129:    O: O257 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-130:    O: O260 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
/|\131:    O: O262 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-132:    O: O263 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|133:    O: O265 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-134:    O: O268 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
/|135:    O: O270 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
\-/136:    O: O271 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|137:    O: O274 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/138:    O: O276 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
|\-139:    O: O277 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|140:    O: O279 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-141:    O: O282 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
/142:    O: O283 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-143:    O: O286 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|144:    O: O287 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/145:    O: O289 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\-146:    O: O292 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\147:    O: O294 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
-148:    O: O295 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\149:    O: O297 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|150:    O: O300 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/151:    O: O301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|152:    O: O303 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-153:    O: O305 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\154:    O: O308 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|155:    O: O309 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-156:    O: O312 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|157:    O: O313 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-158:    O: O315 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/159:    O: O317 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-160:    O: O320 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|161:    O: O322 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\162:    O: O323 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/163:    O: O325 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-164:    O: O327 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\165:    O: O329 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
-/166:    O: O332 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-167:    O: O333 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|168:    O: O335 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-169:    O: O337 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|170:    O: O339 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-171:    O: O341 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/172:    O: O344 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\173:    O: O345 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/|174:    O: O348 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/175:    O: O350 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/176:    O: O352 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-177:    O: O354 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\-178:    O: O355 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\179:    O: O357 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|180:    O: O360 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/181:    O: O362 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|182:    O: O363 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-183:    O: O366 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\-184:    O: O367 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|\185:    O: O370 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
-/|186:    O: O372 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
\-/187:    O: O374 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|188:    O: O376 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-189:    O: O377 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|190:    O: O379 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-191:    O: O382 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/192:    O: O384 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|193:    O: O385 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/194:    O: O388 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-195:    O: O389 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\196:    O: O391 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-197:    O: O394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\198:    O: O395 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|199:    O: O397 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/200:    O: O399 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-201:    O: O401 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|202:    O: O404 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-203:    O: O406 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\204:    O: O408 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-205:    O: O409 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\206:    O: O412 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|207:    O: O414 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/208:    O: O416 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\209:    O: O417 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|210:    O: O419 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/211:    O: O422 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|212:    O: O424 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/213:    O: O426 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-214:    O: O427 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|215:    O: O430 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\216:    O: O432 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|217:    O: O434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/218:    O: O436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-219:    O: O437 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|220:    O: O439 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/|221:    O: O442 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\222:    O: O444 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|223:    O: O445 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/|sleeping...
\224:    O: O448 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|225:    O: O450 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/226:    O: O451 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-/227:    O: O454 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/228:    O: O455 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-229:    O: O458 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\230:    O: O459 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/231:    O: O461 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|232:    O: O463 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/233:    O: O466 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-234:    O: O468 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|235:    O: O469 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-236:    O: O471 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\237:    O: O473 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/238:    O: O475 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|239:    O: O478 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-240:    O: O480 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\241:    O: O482 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-242:    O: O484 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\243:    O: O485 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|244:    O: O487 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\245:    O: O490 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|246:    O: O492 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/247:    O: O494 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\248:    O: O495 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|\249:    O: O498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|250:    O: O500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-251:    O: O502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/252:    O: O503 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\253:    O: O506 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-254:    O: O507 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|255:    O: O510 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
\-/256:    O: O511 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\-257:    O: O514 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|258:    O: O516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/259:    O: O518 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-260:    O: O520 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|261:    O: O522 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\262:    O: O524 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|263:    O: O526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/264:    O: O528 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-265:    O: O530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|266:    O: O532 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-/267:    O: O534 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\-268:    O: O536 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/269:    O: O538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\270:    O: O540 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/271:    O: O542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|272:    O: O544 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/273:    O: O545 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|274:    O: O548 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-275:    O: O550 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|276:    O: O551 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/277:    O: O554 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\278:    O: O555 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/279:    O: O557 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-280:    O: O559 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|281:    O: O561 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\282:    O: O563 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/|283:    O: O566 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-284:    O: O568 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|285:    O: O569 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/|286:    O: O572 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/287:    O: O574 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\-288:    O: O576 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\289:    O: O578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|290:    O: O580 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/291:    O: O582 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|292:    O: O584 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-293:    O: O586 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\294:    O: O587 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|295:    O: O590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\296:    O: O592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|297:    O: O594 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-298:    O: O596 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\299:    O: O597 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|300:    O: O599 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/|\-301:    O: O601 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/302:    O: O604 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\303:    O: O606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|304:    O: O608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/305:    O: O610 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-306:    O: O611 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\307:    O: O614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|308:    O: O616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/309:    O: O618 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-310:    O: O620 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\311:    O: O621 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-312:    O: O624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\313:    O: O626 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/314:    O: O628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\315:    O: O630 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/316:    O: O632 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-317:    O: O634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
/|318:    O: O636 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/319:    O: O638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-320:    O: O640 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|321:    O: O641 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\322:    O: O643 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/|323:    O: O645 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/324:    O: O648 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\325:    O: O649 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|326:    O: O651 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/327:    O: O654 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-328:    O: O655 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\329:    O: O657 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|330:    O: O660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-331:    O: O661 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/332:    O: O663 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\-333:    O: O665 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|334:    O: O667 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/335:    O: O670 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-336:    O: O671 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\337:    O: O673 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/338:    O: O676 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\339:    O: O678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-340:    O: O680 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|341:    O: O682 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\342:    O: O684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|343:    O: O686 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/344:    O: O687 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-345:    O: O689 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\-sleeping...
/346:    O: O691 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-347:    O: O693 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\348:    O: O696 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|349:    O: O698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/350:    O: O700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-351:    O: O702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/352:    O: O704 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\353:    O: O706 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|354:    O: O708 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/355:    O: O710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-356:    O: O712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\357:    O: O714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|358:    O: O716 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/359:    O: O718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
|\360:    O: O719 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|361:    O: O722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\362:    O: O724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|363:    O: O726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/364:    O: O728 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\365:    O: O730 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|366:    O: O732 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/367:    O: O733 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\368:    O: O735 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|369:    O: O738 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/370:    O: O740 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\371:    O: O742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-372:    O: O744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\373:    O: O745 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/374:    O: O748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-375:    O: O749 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\376:    O: O752 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|377:    O: O754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-378:    O: O755 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\379:    O: O757 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|380:    O: O759 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/381:    O: O762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|382:    O: O764 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/383:    O: O766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\384:    O: O767 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|385:    O: O770 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-386:    O: O772 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\387:    O: O773 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|388:    O: O776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-389:    O: O777 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\390:    O: O779 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
-/|391:    O: O782 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\392:    O: O784 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|393:    O: O786 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/394:    O: O788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\395:    O: O789 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|396:    O: O791 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-397:    O: O794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\398:    O: O795 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|399:    O: O797 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/400:    O: O800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-401:    O: O802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/402:    O: O804 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\403:    O: O805 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/404:    O: O807 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-405:    O: O809 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|406:    O: O812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-407:    O: O813 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\408:    O: O816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/409:    O: O817 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-410:    O: O820 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\411:    O: O822 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-412:    O: O824 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|413:    O: O826 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/414:    O: O828 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-415:    O: O830 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
/|\416:    O: O831 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/417:    O: O834 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-418:    O: O836 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|419:    O: O838 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-420:    O: O840 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/421:    O: O841 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|422:    O: O844 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/423:    O: O845 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-424:    O: O848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\425:    O: O850 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|426:    O: O851 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/427:    O: O854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-428:    O: O855 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\429:    O: O858 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|430:    O: O860 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/431:    O: O861 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|432:    O: O864 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-433:    O: O865 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\434:    O: O868 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-435:    O: O870 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\436:    O: O872 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|437:    O: O874 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/438:    O: O875 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|439:    O: O877 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-440:    O: O880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|441:    O: O882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\442:    O: O884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/443:    O: O886 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\444:    O: O888 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|445:    O: O890 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
\-/446:    O: O892 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-447:    O: O894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|448:    O: O895 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-449:    O: O898 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|450:    O: O900 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/|451:    O: O902 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\452:    O: O904 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|453:    O: O905 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/454:    O: O908 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-455:    O: O909 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|456:    O: O912 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-457:    O: O914 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\458:    O: O916 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|459:    O: O917 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/460:    O: O920 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-461:    O: O921 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/462:    O: O924 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-463:    O: O926 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\464:    O: O928 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|465:    O: O930 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/466:    O: O932 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-467:    O: O933 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|468:    O: O935 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\469:    O: O938 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/470:    O: O940 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-471:    O: O942 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/472:    O: O943 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\473:    O: O945 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
-/|474:    O: O947 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/475:    O: O949 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-476:    O: O952 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
/|\477:    O: O953 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|478:    O: O956 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/479:    O: O958 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\480:    O: O960 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|481:    O: O962 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\482:    O: O963 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|483:    O: O966 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/484:    O: O968 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-485:    O: O970 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\486:    O: O972 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\sleeping...
-487:    O: O974 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|488:    O: O975 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-489:    O: O978 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|490:    O: O980 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/491:    O: O982 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|492:    O: O984 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/493:    O: O985 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\494:    O: O988 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|495:    O: O990 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/496:    O: O992 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-497:    O: O993 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|498:    O: O995 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/499:    O: O998 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-500:    O: O999 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\-/501:    O: O1001 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|502:    O: O1004 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/503:    O: O1006 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\504:    O: O1007 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-505:    O: O1009 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\506:    O: O1012 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/507:    O: O1013 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\508:    O: O1015 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|509:    O: O1018 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/510:    O: O1020 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-511:    O: O1022 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/512:    O: O1023 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\513:    O: O1026 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-514:    O: O1027 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\515:    O: O1030 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|516:    O: O1032 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-517:    O: O1034 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
/|\518:    O: O1036 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/519:    O: O1038 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-520:    O: O1040 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\521:    O: O1042 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-522:    O: O1044 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\523:    O: O1046 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|524:    O: O1048 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/525:    O: O1050 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-526:    O: O1052 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\527:    O: O1053 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/528:    O: O1056 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-529:    O: O1057 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\530:    O: O1060 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|531:    O: O1062 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\532:    O: O1063 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/533:    O: O1065 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-534:    O: O1068 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\535:    O: O1070 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|536:    O: O1072 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-537:    O: O1074 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\538:    O: O1076 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\539:    O: O1078 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|540:    O: O1080 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-541:    O: O1082 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/542:    O: O1083 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-/543:    O: O1086 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-544:    O: O1088 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|545:    O: O1090 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/546:    O: O1092 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\547:    O: O1094 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|548:    O: O1096 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/549:    O: O1098 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\550:    O: O1099 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|551:    O: O1102 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\552:    O: O1104 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|553:    O: O1105 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-/554:    O: O1108 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-555:    O: O1110 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\556:    O: O1111 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/557:    O: O1114 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-558:    O: O1116 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\559:    O: O1117 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|560:    O: O1119 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/561:    O: O1122 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|562:    O: O1124 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
\-/563:    O: O1126 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-564:    O: O1127 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\565:    O: O1129 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/566:    O: O1132 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-567:    O: O1134 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\568:    O: O1136 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-569:    O: O1138 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\570:    O: O1139 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/571:    O: O1141 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|572:    O: O1144 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/573:    O: O1146 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-574:    O: O1148 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\575:    O: O1150 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|576:    O: O1152 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/577:    O: O1153 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-578:    O: O1156 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\579:    O: O1158 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|580:    O: O1160 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|581:    O: O1162 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\582:    O: O1164 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/583:    O: O1165 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-584:    O: O1168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|585:    O: O1170 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-586:    O: O1172 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/587:    O: O1173 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|588:    O: O1175 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/589:    O: O1178 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-590:    O: O1180 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\591:    O: O1181 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-592:    O: O1183 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\593:    O: O1185 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|594:    O: O1187 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/595:    O: O1189 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-596:    O: O1192 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\597:    O: O1194 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/598:    O: O1196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\599:    O: O1198 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|600:    O: O1200 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/601:    O: O1202 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|602:    O: O1204 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/603:    O: O1206 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|604:    O: O1208 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-605:    O: O1209 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/606:    O: O1211 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-607:    O: O1213 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\608:    O: O1216 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|609:    O: O1218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\610:    O: O1219 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|611:    O: O1221 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\612:    O: O1224 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
-/|613:    O: O1226 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/614:    O: O1227 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\615:    O: O1230 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|616:    O: O1232 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/617:    O: O1233 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\-618:    O: O1235 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\619:    O: O1237 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|620:    O: O1239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\621:    O: O1242 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-622:    O: O1244 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\623:    O: O1245 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|624:    O: O1248 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/625:    O: O1249 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|626:    O: O1252 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-627:    O: O1254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\628:    O: O1256 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|629:    O: O1258 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/630:    O: O1259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\631:    O: O1262 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-632:    O: O1263 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|633:    O: O1266 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/634:    O: O1268 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-635:    O: O1269 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\636:    O: O1272 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|637:    O: O1273 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/638:    O: O1276 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-639:    O: O1278 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\640:    O: O1280 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|641:    O: O1282 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\642:    O: O1283 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/643:    O: O1286 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\644:    O: O1288 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/645:    O: O1289 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-646:    O: O1292 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/647:    O: O1294 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\648:    O: O1295 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-649:    O: O1298 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\650:    O: O1300 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|651:    O: O1301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\652:    O: O1304 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\653:    O: O1306 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|654:    O: O1308 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
\-/655:    O: O1310 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-656:    O: O1311 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\657:    O: O1314 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/658:    O: O1316 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-659:    O: O1317 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\660:    O: O1320 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/661:    O: O1322 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|662:    O: O1323 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/663:    O: O1326 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\664:    O: O1328 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-665:    O: O1330 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\666:    O: O1331 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-667:    O: O1334 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|668:    O: O1336 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/669:    O: O1338 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-670:    O: O1340 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\671:    O: O1341 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
-672:    O: O1343 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|673:    O: O1346 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/674:    O: O1348 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-675:    O: O1350 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/676:    O: O1351 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-677:    O: O1353 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|678:    O: O1355 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/679:    O: O1357 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|680:    O: O1359 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/681:    O: O1362 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|682:    O: O1364 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/683:    O: O1365 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-684:    O: O1368 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\685:    O: O1370 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/686:    O: O1372 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-687:    O: O1374 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/688:    O: O1376 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-689:    O: O1378 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\690:    O: O1380 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|691:    O: O1381 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\692:    O: O1384 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\693:    O: O1386 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|694:    O: O1388 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-695:    O: O1390 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\696:    O: O1392 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/697:    O: O1394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-698:    O: O1396 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\699:    O: O1398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|700:    O: O1399 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-701:    O: O1402 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/702:    O: O1404 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\703:    O: O1405 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|704:    O: O1408 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/705:    O: O1409 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-706:    O: O1412 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\707:    O: O1414 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-708:    O: O1415 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\709:    O: O1417 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-710:    O: O1420 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\711:    O: O1421 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-712:    O: O1424 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|713:    O: O1425 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-714:    O: O1428 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\715:    O: O1430 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\716:    O: O1432 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\717:    O: O1434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|718:    O: O1436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-719:    O: O1437 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|720:    O: O1440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-721:    O: O1442 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/722:    O: O1444 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-723:    O: O1446 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\724:    O: O1448 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|725:    O: O1450 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|726:    O: O1452 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/727:    O: O1454 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-728:    O: O1455 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\729:    O: O1458 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/730:    O: O1460 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-731:    O: O1461 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/732:    O: O1463 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\733:    O: O1466 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|734:    O: O1467 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/735:    O: O1469 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-/736:    O: O1472 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\737:    O: O1474 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/738:    O: O1475 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-739:    O: O1477 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\740:    O: O1479 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/741:    O: O1482 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|742:    O: O1484 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-743:    O: O1486 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\744:    O: O1487 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|745:    O: O1490 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-746:    O: O1491 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\747:    O: O1494 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|748:    O: O1496 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/749:    O: O1498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-750:    O: O1500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\751:    O: O1502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-752:    O: O1503 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|753:    O: O1505 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/754:    O: O1507 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-755:    O: O1509 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\756:    O: O1511 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|757:    O: O1514 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/758:    O: O1516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-759:    O: O1518 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\760:    O: O1519 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|761:    O: O1521 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\762:    O: O1524 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|763:    O: O1526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/764:    O: O1528 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-765:    O: O1530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/766:    O: O1532 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|767:    O: O1534 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/768:    O: O1536 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-769:    O: O1538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|770:    O: O1539 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\771:    O: O1542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-772:    O: O1543 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|773:    O: O1546 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/|774:    O: O1547 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/775:    O: O1549 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-776:    O: O1552 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\777:    O: O1553 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/778:    O: O1556 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-779:    O: O1557 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\780:    O: O1559 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|781:    O: O1562 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\782:    O: O1563 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/783:    O: O1565 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-784:    O: O1568 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|785:    O: O1569 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\786:    O: O1572 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/787:    O: O1573 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-788:    O: O1576 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\789:    O: O1578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/790:    O: O1580 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-791:    O: O1582 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/792:    O: O1584 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-793:    O: O1585 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|794:    O: O1588 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/795:    O: O1590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-796:    O: O1592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\797:    O: O1594 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-798:    O: O1596 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\799:    O: O1598 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|800:    O: O1600 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/801:    O: O1601 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|802:    O: O1603 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/803:    O: O1606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-804:    O: O1608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|805:    O: O1610 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/806:    O: O1612 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-807:    O: O1614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\808:    O: O1616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|809:    O: O1618 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/810:    O: O1620 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-811:    O: O1622 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/812:    O: O1624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-813:    O: O1626 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\814:    O: O1628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|815:    O: O1629 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/816:    O: O1632 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\817:    O: O1634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|818:    O: O1635 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/819:    O: O1638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-820:    O: O1639 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\821:    O: O1641 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-822:    O: O1644 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\823:    O: O1645 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-824:    O: O1648 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\825:    O: O1649 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|826:    O: O1651 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/827:    O: O1654 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-828:    O: O1656 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\829:    O: O1657 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|830:    O: O1660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/831:    O: O1661 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|832:    O: O1664 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/833:    O: O1666 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-834:    O: O1667 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\835:    O: O1669 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/836:    O: O1672 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-837:    O: O1674 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\838:    O: O1675 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|839:    O: O1678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/840:    O: O1680 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-841:    O: O1681 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/842:    O: O1684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\843:    O: O1685 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/844:    O: O1688 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-845:    O: O1690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\846:    O: O1692 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|847:    O: O1694 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/848:    O: O1696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|849:    O: O1698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/850:    O: O1700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-851:    O: O1702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/852:    O: O1704 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-853:    O: O1706 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\854:    O: O1707 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|855:    O: O1710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-856:    O: O1712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\857:    O: O1714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|858:    O: O1715 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/859:    O: O1718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|860:    O: O1720 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-861:    O: O1722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/862:    O: O1724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/863:    O: O1726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-864:    O: O1727 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/865:    O: O1730 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-866:    O: O1731 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\867:    O: O1733 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|868:    O: O1736 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/869:    O: O1738 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-870:    O: O1740 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-871:    O: O1742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/872:    O: O1744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-873:    O: O1746 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\874:    O: O1748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/875:    O: O1750 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\876:    O: O1751 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|877:    O: O1754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\878:    O: O1756 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|879:    O: O1758 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/880:    O: O1760 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-881:    O: O1762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/882:    O: O1764 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-883:    O: O1766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\884:    O: O1768 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|885:    O: O1769 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/886:    O: O1772 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\887:    O: O1773 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|888:    O: O1776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/889:    O: O1778 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-890:    O: O1780 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|891:    O: O1781 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\892:    O: O1783 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|893:    O: O1786 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\894:    O: O1788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|895:    O: O1790 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/896:    O: O1792 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-897:    O: O1794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\898:    O: O1796 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|899:    O: O1798 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/900:    O: O1800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-901:    O: O1802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/902:    O: O1804 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\903:    O: O1806 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/904:    O: O1808 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-905:    O: O1810 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\906:    O: O1812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|907:    O: O1814 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/908:    O: O1816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\909:    O: O1818 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|910:    O: O1820 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/911:    O: O1822 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|912:    O: O1823 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\913:    O: O1825 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|914:    O: O1828 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/915:    O: O1829 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-916:    O: O1832 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\917:    O: O1834 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/918:    O: O1836 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-919:    O: O1837 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\920:    O: O1839 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|921:    O: O1842 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\922:    O: O1844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/923:    O: O1845 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-924:    O: O1848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\925:    O: O1850 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|926:    O: O1852 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/927:    O: O1854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-928:    O: O1856 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|929:    O: O1858 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/930:    O: O1860 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\931:    O: O1862 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-932:    O: O1864 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\933:    O: O1865 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|934:    O: O1868 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/935:    O: O1870 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\936:    O: O1872 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|937:    O: O1874 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/938:    O: O1876 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|939:    O: O1877 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/940:    O: O1880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|941:    O: O1882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\942:    O: O1884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|943:    O: O1886 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\944:    O: O1887 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|945:    O: O1889 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-946:    O: O1892 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\947:    O: O1894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|948:    O: O1896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/949:    O: O1898 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-950:    O: O1900 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\-/|\-/--- Input Phase --- 
=>WM: (13307: I2 ^dir U)
=>WM: (13306: I2 ^reward 1)
=>WM: (13305: I2 ^see 0)
=>WM: (13304: N950 ^status complete)
<=WM: (13293: I2 ^dir U)
<=WM: (13292: I2 ^reward 1)
<=WM: (13291: I2 ^see 0)
=>WM: (13308: I2 ^level-1 R0-root)
<=WM: (13294: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Firing propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Firing propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1900 ^name predict-no +)
 (S1 ^operator O1900 +)
Retracting propose*predict-yes
 -->
 (O1899 ^name predict-yes +)
 (S1 ^operator O1899 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R953 ^value 1 +)
 (R1 ^reward R953 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.)
=>WM: (13314: S1 ^operator O1902 +)
=>WM: (13313: S1 ^operator O1901 +)
=>WM: (13312: O1902 ^name predict-no)
=>WM: (13311: O1901 ^name predict-yes)
=>WM: (13310: R954 ^value 1)
=>WM: (13309: R1 ^reward R954)
<=WM: (13300: S1 ^operator O1899 +)
<=WM: (13301: S1 ^operator O1900 +)
<=WM: (13302: S1 ^operator O1900)
<=WM: (13295: R1 ^reward R953)
<=WM: (13298: O1900 ^name predict-no)
<=WM: (13297: O1899 ^name predict-yes)
<=WM: (13296: R953 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13315: S1 ^operator O1902)

   951:    O: O1902 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N951 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N950 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13316: I3 ^predict-no N951)
<=WM: (13304: N950 ^status complete)
<=WM: (13303: I3 ^predict-no N950)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (13320: I2 ^dir L)
=>WM: (13319: I2 ^reward 1)
=>WM: (13318: I2 ^see 0)
=>WM: (13317: N951 ^status complete)
<=WM: (13307: I2 ^dir U)
<=WM: (13306: I2 ^reward 1)
<=WM: (13305: I2 ^see 0)
=>WM: (13321: I2 ^level-1 R0-root)
<=WM: (13308: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1901 = 0.6195564468661043)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1902 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Firing propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Firing propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1902 = 0.314040627026034)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1901 = 0.3804224030022332)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Retracting propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.)
=>WM: (13328: S1 ^operator O1904 +)
=>WM: (13327: S1 ^operator O1903 +)
=>WM: (13326: I3 ^dir L)
=>WM: (13325: O1904 ^name predict-no)
=>WM: (13324: O1903 ^name predict-yes)
=>WM: (13323: R955 ^value 1)
=>WM: (13322: R1 ^reward R955)
<=WM: (13313: S1 ^operator O1901 +)
<=WM: (13314: S1 ^operator O1902 +)
<=WM: (13315: S1 ^operator O1902)
<=WM: (13299: I3 ^dir U)
<=WM: (13309: R1 ^reward R954)
<=WM: (13312: O1902 ^name predict-no)
<=WM: (13311: O1901 ^name predict-yes)
<=WM: (13310: R954 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1903 = 0.6195564468661043)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1903 = 0.3804224030022332)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1904 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1904 = 0.314040627026034)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1902 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1902 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1901 = 0.3804224030022332)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1901 = 0.6195564468661043)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13329: S1 ^operator O1903)

   952:    O: O1903 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N952 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N951 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13330: I3 ^predict-yes N952)
<=WM: (13317: N951 ^status complete)
<=WM: (13316: I3 ^predict-no N951)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13334: I2 ^dir R)
=>WM: (13333: I2 ^reward 1)
=>WM: (13332: I2 ^see 1)
=>WM: (13331: N952 ^status complete)
<=WM: (13320: I2 ^dir L)
<=WM: (13319: I2 ^reward 1)
<=WM: (13318: I2 ^see 0)
=>WM: (13335: I2 ^level-1 L1-root)
<=WM: (13321: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1903 = 0.7066224695034091)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1904 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Firing propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Firing propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.2298785768141863)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.2940444083423254)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Retracting propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1904 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1904 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1903 = 0.3804224030022332)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1903 = 0.6195564468661043)
=>WM: (13343: S1 ^operator O1906 +)
=>WM: (13342: S1 ^operator O1905 +)
=>WM: (13341: I3 ^dir R)
=>WM: (13340: O1906 ^name predict-no)
=>WM: (13339: O1905 ^name predict-yes)
=>WM: (13338: R956 ^value 1)
=>WM: (13337: R1 ^reward R956)
=>WM: (13336: I3 ^see 1)
<=WM: (13327: S1 ^operator O1903 +)
<=WM: (13329: S1 ^operator O1903)
<=WM: (13328: S1 ^operator O1904 +)
<=WM: (13326: I3 ^dir L)
<=WM: (13322: R1 ^reward R955)
<=WM: (13254: I3 ^see 0)
<=WM: (13325: O1904 ^name predict-no)
<=WM: (13324: O1903 ^name predict-yes)
<=WM: (13323: R955 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.2940444083423254)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1905 = 0.7066224695034091)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.2298785768141863)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1906 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.2298785768141863)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1904 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.2940444083423254)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1903 = 0.7066224695034091)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521353 -0.140931 0.380422 -> 0.521355 -0.140931 0.380424(R,m,v=1,0.819355,0.148974)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478624 0.140933 0.619556 -> 0.478626 0.140932 0.619559(R,m,v=1,1,0)
=>WM: (13344: S1 ^operator O1905)

   953:    O: O1905 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N953 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N952 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13345: I3 ^predict-yes N953)
<=WM: (13331: N952 ^status complete)
<=WM: (13330: I3 ^predict-yes N952)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13349: I2 ^dir R)
=>WM: (13348: I2 ^reward 1)
=>WM: (13347: I2 ^see 1)
=>WM: (13346: N953 ^status complete)
<=WM: (13334: I2 ^dir R)
<=WM: (13333: I2 ^reward 1)
<=WM: (13332: I2 ^see 1)
=>WM: (13350: I2 ^level-1 R1-root)
<=WM: (13335: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1905 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1906 = 0.7702047625716166)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Firing propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Firing propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.2298785768141863)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.2940444083423254)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Retracting propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1906 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.2298785768141863)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1905 = 0.7066224695034091)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.2940444083423254)
=>WM: (13356: S1 ^operator O1908 +)
=>WM: (13355: S1 ^operator O1907 +)
=>WM: (13354: O1908 ^name predict-no)
=>WM: (13353: O1907 ^name predict-yes)
=>WM: (13352: R957 ^value 1)
=>WM: (13351: R1 ^reward R957)
<=WM: (13342: S1 ^operator O1905 +)
<=WM: (13344: S1 ^operator O1905)
<=WM: (13343: S1 ^operator O1906 +)
<=WM: (13337: R1 ^reward R956)
<=WM: (13340: O1906 ^name predict-no)
<=WM: (13339: O1905 ^name predict-yes)
<=WM: (13338: R956 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1907 = 0.2940444083423254)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1907 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1908 = 0.2298785768141863)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1908 = 0.7702047625716166)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.2298785768141863)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1906 = 0.7702047625716166)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.2940444083423254)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1905 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501112 -0.207068 0.294044 -> 0.501062 -0.207073 0.293989(R,m,v=1,0.835616,0.138309)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499487 0.207136 0.706622 -> 0.499427 0.207129 0.706557(R,m,v=1,1,0)
=>WM: (13357: S1 ^operator O1908)

   954:    O: O1908 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N954 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N953 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13358: I3 ^predict-no N954)
<=WM: (13346: N953 ^status complete)
<=WM: (13345: I3 ^predict-yes N953)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13362: I2 ^dir U)
=>WM: (13361: I2 ^reward 1)
=>WM: (13360: I2 ^see 0)
=>WM: (13359: N954 ^status complete)
<=WM: (13349: I2 ^dir R)
<=WM: (13348: I2 ^reward 1)
<=WM: (13347: I2 ^see 1)
=>WM: (13363: I2 ^level-1 R0-root)
<=WM: (13350: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Firing propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Firing propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1908 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1907 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Retracting propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1908 = 0.7702047625716166)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1908 = 0.2298785768141863)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1907 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1907 = 0.2939886829338975)
=>WM: (13371: S1 ^operator O1910 +)
=>WM: (13370: S1 ^operator O1909 +)
=>WM: (13369: I3 ^dir U)
=>WM: (13368: O1910 ^name predict-no)
=>WM: (13367: O1909 ^name predict-yes)
=>WM: (13366: R958 ^value 1)
=>WM: (13365: R1 ^reward R958)
=>WM: (13364: I3 ^see 0)
<=WM: (13355: S1 ^operator O1907 +)
<=WM: (13356: S1 ^operator O1908 +)
<=WM: (13357: S1 ^operator O1908)
<=WM: (13341: I3 ^dir R)
<=WM: (13351: R1 ^reward R957)
<=WM: (13336: I3 ^see 1)
<=WM: (13354: O1908 ^name predict-no)
<=WM: (13353: O1907 ^name predict-yes)
<=WM: (13352: R957 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1909 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1910 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1908 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1907 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611927 -0.382049 0.229879 -> 0.611922 -0.38205 0.229872(R,m,v=1,0.842105,0.133746)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388141 0.382064 0.770205 -> 0.388134 0.382063 0.770196(R,m,v=1,1,0)
=>WM: (13372: S1 ^operator O1910)

   955:    O: O1910 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N955 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N954 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13373: I3 ^predict-no N955)
<=WM: (13359: N954 ^status complete)
<=WM: (13358: I3 ^predict-no N954)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13377: I2 ^dir L)
=>WM: (13376: I2 ^reward 1)
=>WM: (13375: I2 ^see 0)
=>WM: (13374: N955 ^status complete)
<=WM: (13362: I2 ^dir U)
<=WM: (13361: I2 ^reward 1)
<=WM: (13360: I2 ^see 0)
=>WM: (13378: I2 ^level-1 R0-root)
<=WM: (13363: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1909 = 0.6195585094345952)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1910 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Firing propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Firing propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1910 = 0.314040627026034)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1909 = 0.3804241528486575)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Retracting propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1910 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1909 = 0.)
=>WM: (13385: S1 ^operator O1912 +)
=>WM: (13384: S1 ^operator O1911 +)
=>WM: (13383: I3 ^dir L)
=>WM: (13382: O1912 ^name predict-no)
=>WM: (13381: O1911 ^name predict-yes)
=>WM: (13380: R959 ^value 1)
=>WM: (13379: R1 ^reward R959)
<=WM: (13370: S1 ^operator O1909 +)
<=WM: (13371: S1 ^operator O1910 +)
<=WM: (13372: S1 ^operator O1910)
<=WM: (13369: I3 ^dir U)
<=WM: (13365: R1 ^reward R958)
<=WM: (13368: O1910 ^name predict-no)
<=WM: (13367: O1909 ^name predict-yes)
<=WM: (13366: R958 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1911 = 0.6195585094345952)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3804241528486575)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1912 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.314040627026034)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1910 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1910 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1909 = 0.3804241528486575)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1909 = 0.6195585094345952)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13386: S1 ^operator O1911)

   956:    O: O1911 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N956 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N955 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13387: I3 ^predict-yes N956)
<=WM: (13374: N955 ^status complete)
<=WM: (13373: I3 ^predict-no N955)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13391: I2 ^dir L)
=>WM: (13390: I2 ^reward 1)
=>WM: (13389: I2 ^see 1)
=>WM: (13388: N956 ^status complete)
<=WM: (13377: I2 ^dir L)
<=WM: (13376: I2 ^reward 1)
<=WM: (13375: I2 ^see 0)
=>WM: (13392: I2 ^level-1 L1-root)
<=WM: (13378: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1911 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1912 = 0.6861879370801713)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Firing propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Firing propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.314040627026034)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3804241528486575)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Retracting propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1912 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3804241528486575)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1911 = 0.6195585094345952)
=>WM: (13399: S1 ^operator O1914 +)
=>WM: (13398: S1 ^operator O1913 +)
=>WM: (13397: O1914 ^name predict-no)
=>WM: (13396: O1913 ^name predict-yes)
=>WM: (13395: R960 ^value 1)
=>WM: (13394: R1 ^reward R960)
=>WM: (13393: I3 ^see 1)
<=WM: (13384: S1 ^operator O1911 +)
<=WM: (13386: S1 ^operator O1911)
<=WM: (13385: S1 ^operator O1912 +)
<=WM: (13379: R1 ^reward R959)
<=WM: (13364: I3 ^see 0)
<=WM: (13382: O1912 ^name predict-no)
<=WM: (13381: O1911 ^name predict-yes)
<=WM: (13380: R959 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3804241528486575)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1913 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.314040627026034)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1914 = 0.6861879370801713)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1912 = 0.6861879370801713)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3804241528486575)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1911 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521355 -0.140931 0.380424 -> 0.521357 -0.140931 0.380426(R,m,v=1,0.820513,0.148222)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478626 0.140932 0.619559 -> 0.478628 0.140932 0.61956(R,m,v=1,1,0)
=>WM: (13400: S1 ^operator O1914)

   957:    O: O1914 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N957 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N956 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13401: I3 ^predict-no N957)
<=WM: (13388: N956 ^status complete)
<=WM: (13387: I3 ^predict-yes N956)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13405: I2 ^dir L)
=>WM: (13404: I2 ^reward 1)
=>WM: (13403: I2 ^see 0)
=>WM: (13402: N957 ^status complete)
<=WM: (13391: I2 ^dir L)
<=WM: (13390: I2 ^reward 1)
<=WM: (13389: I2 ^see 1)
=>WM: (13406: I2 ^level-1 L0-root)
<=WM: (13392: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O1913 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O1914 = 0.6857507825115492)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Firing propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Firing propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.314040627026034)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3804255857519139)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Retracting propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1914 = 0.6861879370801713)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1913 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3804255857519139)
=>WM: (13413: S1 ^operator O1916 +)
=>WM: (13412: S1 ^operator O1915 +)
=>WM: (13411: O1916 ^name predict-no)
=>WM: (13410: O1915 ^name predict-yes)
=>WM: (13409: R961 ^value 1)
=>WM: (13408: R1 ^reward R961)
=>WM: (13407: I3 ^see 0)
<=WM: (13398: S1 ^operator O1913 +)
<=WM: (13399: S1 ^operator O1914 +)
<=WM: (13400: S1 ^operator O1914)
<=WM: (13394: R1 ^reward R960)
<=WM: (13393: I3 ^see 1)
<=WM: (13397: O1914 ^name predict-no)
<=WM: (13396: O1913 ^name predict-yes)
<=WM: (13395: R960 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1915 = 0.3804255857519139)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O1915 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1916 = 0.314040627026034)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O1916 = 0.6857507825115492)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.314040627026034)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O1914 = 0.6857507825115492)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3804255857519139)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O1913 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.314041 -> 0.485031 -0.17101 0.314022(R,m,v=1,0.858108,0.122587)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515134 0.171054 0.686188 -> 0.515116 0.171049 0.686165(R,m,v=1,1,0)
=>WM: (13414: S1 ^operator O1916)

   958:    O: O1916 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N958 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N957 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13415: I3 ^predict-no N958)
<=WM: (13402: N957 ^status complete)
<=WM: (13401: I3 ^predict-no N957)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13419: I2 ^dir R)
=>WM: (13418: I2 ^reward 1)
=>WM: (13417: I2 ^see 0)
=>WM: (13416: N958 ^status complete)
<=WM: (13405: I2 ^dir L)
<=WM: (13404: I2 ^reward 1)
<=WM: (13403: I2 ^see 0)
=>WM: (13420: I2 ^level-1 L0-root)
<=WM: (13406: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1915 = 0.7053811599250611)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1916 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Firing propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Firing propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.2298717920574965)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.2939886829338975)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Retracting propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O1916 = 0.6857507825115492)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1916 = 0.3140215711634288)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O1915 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1915 = 0.3804255857519139)
=>WM: (13427: S1 ^operator O1918 +)
=>WM: (13426: S1 ^operator O1917 +)
=>WM: (13425: I3 ^dir R)
=>WM: (13424: O1918 ^name predict-no)
=>WM: (13423: O1917 ^name predict-yes)
=>WM: (13422: R962 ^value 1)
=>WM: (13421: R1 ^reward R962)
<=WM: (13412: S1 ^operator O1915 +)
<=WM: (13413: S1 ^operator O1916 +)
<=WM: (13414: S1 ^operator O1916)
<=WM: (13383: I3 ^dir L)
<=WM: (13408: R1 ^reward R961)
<=WM: (13411: O1916 ^name predict-no)
<=WM: (13410: O1915 ^name predict-yes)
<=WM: (13409: R961 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1917 = 0.7053811599250611)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1917 = 0.2939886829338975)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1918 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1918 = 0.2298717920574965)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1916 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.2939886829338975)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1915 = 0.7053811599250611)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485031 -0.17101 0.314022 -> 0.485046 -0.171006 0.314041(R,m,v=1,0.85906,0.121894)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514789 0.170962 0.685751 -> 0.514806 0.170967 0.685773(R,m,v=1,1,0)
=>WM: (13428: S1 ^operator O1917)

   959:    O: O1917 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N959 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N958 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13429: I3 ^predict-yes N959)
<=WM: (13416: N958 ^status complete)
<=WM: (13415: I3 ^predict-no N958)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13433: I2 ^dir U)
=>WM: (13432: I2 ^reward 1)
=>WM: (13431: I2 ^see 1)
=>WM: (13430: N959 ^status complete)
<=WM: (13419: I2 ^dir R)
<=WM: (13418: I2 ^reward 1)
<=WM: (13417: I2 ^see 0)
=>WM: (13434: I2 ^level-1 R1-root)
<=WM: (13420: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Firing propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Firing propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Retracting propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1918 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1918 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1917 = 0.2939886829338975)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1917 = 0.7053811599250611)
=>WM: (13442: S1 ^operator O1920 +)
=>WM: (13441: S1 ^operator O1919 +)
=>WM: (13440: I3 ^dir U)
=>WM: (13439: O1920 ^name predict-no)
=>WM: (13438: O1919 ^name predict-yes)
=>WM: (13437: R963 ^value 1)
=>WM: (13436: R1 ^reward R963)
=>WM: (13435: I3 ^see 1)
<=WM: (13426: S1 ^operator O1917 +)
<=WM: (13428: S1 ^operator O1917)
<=WM: (13427: S1 ^operator O1918 +)
<=WM: (13425: I3 ^dir R)
<=WM: (13421: R1 ^reward R962)
<=WM: (13407: I3 ^see 0)
<=WM: (13424: O1918 ^name predict-no)
<=WM: (13423: O1917 ^name predict-yes)
<=WM: (13422: R962 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501062 -0.207073 0.293989 -> 0.50111 -0.207069 0.294041(R,m,v=1,0.836735,0.137545)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498366 0.207015 0.705381 -> 0.498423 0.207021 0.705444(R,m,v=1,1,0)
=>WM: (13443: S1 ^operator O1920)

   960:    O: O1920 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N960 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N959 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13444: I3 ^predict-no N960)
<=WM: (13430: N959 ^status complete)
<=WM: (13429: I3 ^predict-yes N959)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13448: I2 ^dir U)
=>WM: (13447: I2 ^reward 1)
=>WM: (13446: I2 ^see 0)
=>WM: (13445: N960 ^status complete)
<=WM: (13433: I2 ^dir U)
<=WM: (13432: I2 ^reward 1)
<=WM: (13431: I2 ^see 1)
=>WM: (13449: I2 ^level-1 R1-root)
<=WM: (13434: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Firing propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Firing propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Retracting propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)
=>WM: (13456: S1 ^operator O1922 +)
=>WM: (13455: S1 ^operator O1921 +)
=>WM: (13454: O1922 ^name predict-no)
=>WM: (13453: O1921 ^name predict-yes)
=>WM: (13452: R964 ^value 1)
=>WM: (13451: R1 ^reward R964)
=>WM: (13450: I3 ^see 0)
<=WM: (13441: S1 ^operator O1919 +)
<=WM: (13442: S1 ^operator O1920 +)
<=WM: (13443: S1 ^operator O1920)
<=WM: (13436: R1 ^reward R963)
<=WM: (13435: I3 ^see 1)
<=WM: (13439: O1920 ^name predict-no)
<=WM: (13438: O1919 ^name predict-yes)
<=WM: (13437: R963 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13457: S1 ^operator O1922)

   961:    O: O1922 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N961 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N960 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13458: I3 ^predict-no N961)
<=WM: (13445: N960 ^status complete)
<=WM: (13444: I3 ^predict-no N960)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (13462: I2 ^dir U)
=>WM: (13461: I2 ^reward 1)
=>WM: (13460: I2 ^see 0)
=>WM: (13459: N961 ^status complete)
<=WM: (13448: I2 ^dir U)
<=WM: (13447: I2 ^reward 1)
<=WM: (13446: I2 ^see 0)
=>WM: (13463: I2 ^level-1 R1-root)
<=WM: (13449: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Firing propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Firing propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Retracting propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)
=>WM: (13469: S1 ^operator O1924 +)
=>WM: (13468: S1 ^operator O1923 +)
=>WM: (13467: O1924 ^name predict-no)
=>WM: (13466: O1923 ^name predict-yes)
=>WM: (13465: R965 ^value 1)
=>WM: (13464: R1 ^reward R965)
<=WM: (13455: S1 ^operator O1921 +)
<=WM: (13456: S1 ^operator O1922 +)
<=WM: (13457: S1 ^operator O1922)
<=WM: (13451: R1 ^reward R964)
<=WM: (13454: O1922 ^name predict-no)
<=WM: (13453: O1921 ^name predict-yes)
<=WM: (13452: R964 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13470: S1 ^operator O1924)

   962:    O: O1924 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N962 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N961 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13471: I3 ^predict-no N962)
<=WM: (13459: N961 ^status complete)
<=WM: (13458: I3 ^predict-no N961)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13475: I2 ^dir U)
=>WM: (13474: I2 ^reward 1)
=>WM: (13473: I2 ^see 0)
=>WM: (13472: N962 ^status complete)
<=WM: (13462: I2 ^dir U)
<=WM: (13461: I2 ^reward 1)
<=WM: (13460: I2 ^see 0)
=>WM: (13476: I2 ^level-1 R1-root)
<=WM: (13463: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Firing propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Firing propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Retracting propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)
=>WM: (13482: S1 ^operator O1926 +)
=>WM: (13481: S1 ^operator O1925 +)
=>WM: (13480: O1926 ^name predict-no)
=>WM: (13479: O1925 ^name predict-yes)
=>WM: (13478: R966 ^value 1)
=>WM: (13477: R1 ^reward R966)
<=WM: (13468: S1 ^operator O1923 +)
<=WM: (13469: S1 ^operator O1924 +)
<=WM: (13470: S1 ^operator O1924)
<=WM: (13464: R1 ^reward R965)
<=WM: (13467: O1924 ^name predict-no)
<=WM: (13466: O1923 ^name predict-yes)
<=WM: (13465: R965 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13483: S1 ^operator O1926)

   963:    O: O1926 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N963 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N962 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13484: I3 ^predict-no N963)
<=WM: (13472: N962 ^status complete)
<=WM: (13471: I3 ^predict-no N962)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13488: I2 ^dir L)
=>WM: (13487: I2 ^reward 1)
=>WM: (13486: I2 ^see 0)
=>WM: (13485: N963 ^status complete)
<=WM: (13475: I2 ^dir U)
<=WM: (13474: I2 ^reward 1)
<=WM: (13473: I2 ^see 0)
=>WM: (13489: I2 ^level-1 R1-root)
<=WM: (13476: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1925 = 0.619629119351056)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1926 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Firing propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Firing propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 0.3140405292214645)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.3804255857519139)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Retracting propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.)
=>WM: (13496: S1 ^operator O1928 +)
=>WM: (13495: S1 ^operator O1927 +)
=>WM: (13494: I3 ^dir L)
=>WM: (13493: O1928 ^name predict-no)
=>WM: (13492: O1927 ^name predict-yes)
=>WM: (13491: R967 ^value 1)
=>WM: (13490: R1 ^reward R967)
<=WM: (13481: S1 ^operator O1925 +)
<=WM: (13482: S1 ^operator O1926 +)
<=WM: (13483: S1 ^operator O1926)
<=WM: (13440: I3 ^dir U)
<=WM: (13477: R1 ^reward R966)
<=WM: (13480: O1926 ^name predict-no)
<=WM: (13479: O1925 ^name predict-yes)
<=WM: (13478: R966 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1927 = 0.619629119351056)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.3804255857519139)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1928 = -0.1479504104026684)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 0.3140405292214645)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1926 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.3804255857519139)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1925 = 0.619629119351056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13497: S1 ^operator O1927)

   964:    O: O1927 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N964 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N963 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13498: I3 ^predict-yes N964)
<=WM: (13485: N963 ^status complete)
<=WM: (13484: I3 ^predict-no N963)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13502: I2 ^dir R)
=>WM: (13501: I2 ^reward 1)
=>WM: (13500: I2 ^see 1)
=>WM: (13499: N964 ^status complete)
<=WM: (13488: I2 ^dir L)
<=WM: (13487: I2 ^reward 1)
<=WM: (13486: I2 ^see 0)
=>WM: (13503: I2 ^level-1 L1-root)
<=WM: (13489: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1927 = 0.7065565782519569)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1928 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Firing propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Firing propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1928 = 0.2298717920574965)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1927 = 0.2940412798984666)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Retracting propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1928 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.3804255857519139)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1927 = 0.619629119351056)
=>WM: (13511: S1 ^operator O1930 +)
=>WM: (13510: S1 ^operator O1929 +)
=>WM: (13509: I3 ^dir R)
=>WM: (13508: O1930 ^name predict-no)
=>WM: (13507: O1929 ^name predict-yes)
=>WM: (13506: R968 ^value 1)
=>WM: (13505: R1 ^reward R968)
=>WM: (13504: I3 ^see 1)
<=WM: (13495: S1 ^operator O1927 +)
<=WM: (13497: S1 ^operator O1927)
<=WM: (13496: S1 ^operator O1928 +)
<=WM: (13494: I3 ^dir L)
<=WM: (13490: R1 ^reward R967)
<=WM: (13450: I3 ^see 0)
<=WM: (13493: O1928 ^name predict-no)
<=WM: (13492: O1927 ^name predict-yes)
<=WM: (13491: R967 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1929 = 0.2940412798984666)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1929 = 0.7065565782519569)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1930 = 0.2298717920574965)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1930 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1928 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1928 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1927 = 0.2940412798984666)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1927 = 0.7065565782519569)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521357 -0.140931 0.380426 -> 0.521352 -0.140931 0.380421(R,m,v=1,0.821656,0.147477)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478703 0.140926 0.619629 -> 0.478697 0.140926 0.619624(R,m,v=1,1,0)
=>WM: (13512: S1 ^operator O1929)

   965:    O: O1929 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N965 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N964 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13513: I3 ^predict-yes N965)
<=WM: (13499: N964 ^status complete)
<=WM: (13498: I3 ^predict-yes N964)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13517: I2 ^dir U)
=>WM: (13516: I2 ^reward 1)
=>WM: (13515: I2 ^see 1)
=>WM: (13514: N965 ^status complete)
<=WM: (13502: I2 ^dir R)
<=WM: (13501: I2 ^reward 1)
<=WM: (13500: I2 ^see 1)
=>WM: (13518: I2 ^level-1 R1-root)
<=WM: (13503: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Firing propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Firing propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1930 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1929 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Retracting propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1930 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1930 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1929 = 0.7065565782519569)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1929 = 0.2940412798984666)
=>WM: (13525: S1 ^operator O1932 +)
=>WM: (13524: S1 ^operator O1931 +)
=>WM: (13523: I3 ^dir U)
=>WM: (13522: O1932 ^name predict-no)
=>WM: (13521: O1931 ^name predict-yes)
=>WM: (13520: R969 ^value 1)
=>WM: (13519: R1 ^reward R969)
<=WM: (13510: S1 ^operator O1929 +)
<=WM: (13512: S1 ^operator O1929)
<=WM: (13511: S1 ^operator O1930 +)
<=WM: (13509: I3 ^dir R)
<=WM: (13505: R1 ^reward R968)
<=WM: (13508: O1930 ^name predict-no)
<=WM: (13507: O1929 ^name predict-yes)
<=WM: (13506: R968 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1930 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1929 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.50111 -0.207069 0.294041 -> 0.501065 -0.207074 0.293991(R,m,v=1,0.837838,0.13679)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499427 0.207129 0.706557 -> 0.499374 0.207123 0.706498(R,m,v=1,1,0)
=>WM: (13526: S1 ^operator O1932)

   966:    O: O1932 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N966 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N965 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13527: I3 ^predict-no N966)
<=WM: (13514: N965 ^status complete)
<=WM: (13513: I3 ^predict-yes N965)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13531: I2 ^dir L)
=>WM: (13530: I2 ^reward 1)
=>WM: (13529: I2 ^see 0)
=>WM: (13528: N966 ^status complete)
<=WM: (13517: I2 ^dir U)
<=WM: (13516: I2 ^reward 1)
<=WM: (13515: I2 ^see 1)
=>WM: (13532: I2 ^level-1 R1-root)
<=WM: (13518: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1931 = 0.6196238010864294)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1932 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Firing propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Firing propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 0.3140405292214645)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.380421069331616)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Retracting propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.)
=>WM: (13540: S1 ^operator O1934 +)
=>WM: (13539: S1 ^operator O1933 +)
=>WM: (13538: I3 ^dir L)
=>WM: (13537: O1934 ^name predict-no)
=>WM: (13536: O1933 ^name predict-yes)
=>WM: (13535: R970 ^value 1)
=>WM: (13534: R1 ^reward R970)
=>WM: (13533: I3 ^see 0)
<=WM: (13524: S1 ^operator O1931 +)
<=WM: (13525: S1 ^operator O1932 +)
<=WM: (13526: S1 ^operator O1932)
<=WM: (13523: I3 ^dir U)
<=WM: (13519: R1 ^reward R969)
<=WM: (13504: I3 ^see 1)
<=WM: (13522: O1932 ^name predict-no)
<=WM: (13521: O1931 ^name predict-yes)
<=WM: (13520: R969 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1933 = 0.6196238010864294)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1933 = 0.380421069331616)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1934 = -0.1479504104026684)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1934 = 0.3140405292214645)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1932 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.380421069331616)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1931 = 0.6196238010864294)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13541: S1 ^operator O1933)

   967:    O: O1933 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N967 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N966 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13542: I3 ^predict-yes N967)
<=WM: (13528: N966 ^status complete)
<=WM: (13527: I3 ^predict-no N966)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13546: I2 ^dir R)
=>WM: (13545: I2 ^reward 1)
=>WM: (13544: I2 ^see 1)
=>WM: (13543: N967 ^status complete)
<=WM: (13531: I2 ^dir L)
<=WM: (13530: I2 ^reward 1)
<=WM: (13529: I2 ^see 0)
=>WM: (13547: I2 ^level-1 L1-root)
<=WM: (13532: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1933 = 0.7064977054068989)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1934 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Firing propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Firing propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1934 = 0.2298717920574965)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1933 = 0.2939914352270483)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Retracting propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1934 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1934 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1933 = 0.380421069331616)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1933 = 0.6196238010864294)
=>WM: (13555: S1 ^operator O1936 +)
=>WM: (13554: S1 ^operator O1935 +)
=>WM: (13553: I3 ^dir R)
=>WM: (13552: O1936 ^name predict-no)
=>WM: (13551: O1935 ^name predict-yes)
=>WM: (13550: R971 ^value 1)
=>WM: (13549: R1 ^reward R971)
=>WM: (13548: I3 ^see 1)
<=WM: (13539: S1 ^operator O1933 +)
<=WM: (13541: S1 ^operator O1933)
<=WM: (13540: S1 ^operator O1934 +)
<=WM: (13538: I3 ^dir L)
<=WM: (13534: R1 ^reward R970)
<=WM: (13533: I3 ^see 0)
<=WM: (13537: O1934 ^name predict-no)
<=WM: (13536: O1933 ^name predict-yes)
<=WM: (13535: R970 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.2939914352270483)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1935 = 0.7064977054068989)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.2298717920574965)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1936 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1934 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1934 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1933 = 0.2939914352270483)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1933 = 0.7064977054068989)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521352 -0.140931 0.380421 -> 0.521348 -0.14093 0.380417(R,m,v=1,0.822785,0.146739)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478697 0.140926 0.619624 -> 0.478693 0.140927 0.619619(R,m,v=1,1,0)
=>WM: (13556: S1 ^operator O1935)

   968:    O: O1935 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N968 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N967 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13557: I3 ^predict-yes N968)
<=WM: (13543: N967 ^status complete)
<=WM: (13542: I3 ^predict-yes N967)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13561: I2 ^dir U)
=>WM: (13560: I2 ^reward 1)
=>WM: (13559: I2 ^see 1)
=>WM: (13558: N968 ^status complete)
<=WM: (13546: I2 ^dir R)
<=WM: (13545: I2 ^reward 1)
<=WM: (13544: I2 ^see 1)
=>WM: (13562: I2 ^level-1 R1-root)
<=WM: (13547: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Firing propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Firing propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Retracting propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1936 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1935 = 0.7064977054068989)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.2939914352270483)
=>WM: (13569: S1 ^operator O1938 +)
=>WM: (13568: S1 ^operator O1937 +)
=>WM: (13567: I3 ^dir U)
=>WM: (13566: O1938 ^name predict-no)
=>WM: (13565: O1937 ^name predict-yes)
=>WM: (13564: R972 ^value 1)
=>WM: (13563: R1 ^reward R972)
<=WM: (13554: S1 ^operator O1935 +)
<=WM: (13556: S1 ^operator O1935)
<=WM: (13555: S1 ^operator O1936 +)
<=WM: (13553: I3 ^dir R)
<=WM: (13549: R1 ^reward R971)
<=WM: (13552: O1936 ^name predict-no)
<=WM: (13551: O1935 ^name predict-yes)
<=WM: (13550: R971 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501065 -0.207074 0.293991 -> 0.501028 -0.207078 0.293951(R,m,v=1,0.838926,0.136042)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499374 0.207123 0.706498 -> 0.499331 0.207118 0.70645(R,m,v=1,1,0)
=>WM: (13570: S1 ^operator O1938)

   969:    O: O1938 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N969 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N968 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13571: I3 ^predict-no N969)
<=WM: (13558: N968 ^status complete)
<=WM: (13557: I3 ^predict-yes N968)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13575: I2 ^dir L)
=>WM: (13574: I2 ^reward 1)
=>WM: (13573: I2 ^see 0)
=>WM: (13572: N969 ^status complete)
<=WM: (13561: I2 ^dir U)
<=WM: (13560: I2 ^reward 1)
<=WM: (13559: I2 ^see 1)
=>WM: (13576: I2 ^level-1 R1-root)
<=WM: (13562: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1937 = 0.6196194522363663)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1938 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Firing propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Firing propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1938 = 0.3140405292214645)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1937 = 0.3804173687365902)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Retracting propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.)
=>WM: (13584: S1 ^operator O1940 +)
=>WM: (13583: S1 ^operator O1939 +)
=>WM: (13582: I3 ^dir L)
=>WM: (13581: O1940 ^name predict-no)
=>WM: (13580: O1939 ^name predict-yes)
=>WM: (13579: R973 ^value 1)
=>WM: (13578: R1 ^reward R973)
=>WM: (13577: I3 ^see 0)
<=WM: (13568: S1 ^operator O1937 +)
<=WM: (13569: S1 ^operator O1938 +)
<=WM: (13570: S1 ^operator O1938)
<=WM: (13567: I3 ^dir U)
<=WM: (13563: R1 ^reward R972)
<=WM: (13548: I3 ^see 1)
<=WM: (13566: O1938 ^name predict-no)
<=WM: (13565: O1937 ^name predict-yes)
<=WM: (13564: R972 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1939 = 0.6196194522363663)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.3804173687365902)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1940 = -0.1479504104026684)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 0.3140405292214645)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1938 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1938 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1937 = 0.3804173687365902)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1937 = 0.6196194522363663)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13585: S1 ^operator O1939)

   970:    O: O1939 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N970 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N969 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13586: I3 ^predict-yes N970)
<=WM: (13572: N969 ^status complete)
<=WM: (13571: I3 ^predict-no N969)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13590: I2 ^dir U)
=>WM: (13589: I2 ^reward 1)
=>WM: (13588: I2 ^see 1)
=>WM: (13587: N970 ^status complete)
<=WM: (13575: I2 ^dir L)
<=WM: (13574: I2 ^reward 1)
<=WM: (13573: I2 ^see 0)
=>WM: (13591: I2 ^level-1 L1-root)
<=WM: (13576: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Firing propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Firing propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Retracting propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1940 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.3804173687365902)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1939 = 0.6196194522363663)
=>WM: (13599: S1 ^operator O1942 +)
=>WM: (13598: S1 ^operator O1941 +)
=>WM: (13597: I3 ^dir U)
=>WM: (13596: O1942 ^name predict-no)
=>WM: (13595: O1941 ^name predict-yes)
=>WM: (13594: R974 ^value 1)
=>WM: (13593: R1 ^reward R974)
=>WM: (13592: I3 ^see 1)
<=WM: (13583: S1 ^operator O1939 +)
<=WM: (13585: S1 ^operator O1939)
<=WM: (13584: S1 ^operator O1940 +)
<=WM: (13582: I3 ^dir L)
<=WM: (13578: R1 ^reward R973)
<=WM: (13577: I3 ^see 0)
<=WM: (13581: O1940 ^name predict-no)
<=WM: (13580: O1939 ^name predict-yes)
<=WM: (13579: R973 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1941 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1942 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521348 -0.14093 0.380417 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.823899,0.146007)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478693 0.140927 0.619619 -> 0.478689 0.140927 0.619616(R,m,v=1,1,0)
=>WM: (13600: S1 ^operator O1942)

   971:    O: O1942 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N971 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N970 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13601: I3 ^predict-no N971)
<=WM: (13587: N970 ^status complete)
<=WM: (13586: I3 ^predict-yes N970)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (13605: I2 ^dir L)
=>WM: (13604: I2 ^reward 1)
=>WM: (13603: I2 ^see 0)
=>WM: (13602: N971 ^status complete)
<=WM: (13590: I2 ^dir U)
<=WM: (13589: I2 ^reward 1)
<=WM: (13588: I2 ^see 1)
=>WM: (13606: I2 ^level-1 L1-root)
<=WM: (13591: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1941 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1942 = 0.6861654297024582)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Firing propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Firing propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 0.3140405292214645)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.3804143351598744)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Retracting propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1942 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1941 = 0.)
=>WM: (13614: S1 ^operator O1944 +)
=>WM: (13613: S1 ^operator O1943 +)
=>WM: (13612: I3 ^dir L)
=>WM: (13611: O1944 ^name predict-no)
=>WM: (13610: O1943 ^name predict-yes)
=>WM: (13609: R975 ^value 1)
=>WM: (13608: R1 ^reward R975)
=>WM: (13607: I3 ^see 0)
<=WM: (13598: S1 ^operator O1941 +)
<=WM: (13599: S1 ^operator O1942 +)
<=WM: (13600: S1 ^operator O1942)
<=WM: (13597: I3 ^dir U)
<=WM: (13593: R1 ^reward R974)
<=WM: (13592: I3 ^see 1)
<=WM: (13596: O1942 ^name predict-no)
<=WM: (13595: O1941 ^name predict-yes)
<=WM: (13594: R974 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1943 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1943 = 0.3804143351598744)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1944 = 0.6861654297024582)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1944 = 0.3140405292214645)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1942 = 0.6861654297024582)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.3804143351598744)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1941 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13615: S1 ^operator O1944)

   972:    O: O1944 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N972 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N971 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13616: I3 ^predict-no N972)
<=WM: (13602: N971 ^status complete)
<=WM: (13601: I3 ^predict-no N971)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13620: I2 ^dir R)
=>WM: (13619: I2 ^reward 1)
=>WM: (13618: I2 ^see 0)
=>WM: (13617: N972 ^status complete)
<=WM: (13605: I2 ^dir L)
<=WM: (13604: I2 ^reward 1)
<=WM: (13603: I2 ^see 0)
=>WM: (13621: I2 ^level-1 L0-root)
<=WM: (13606: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1943 = 0.7054436376897688)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1944 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Firing propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Firing propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.2298717920574965)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.2939507002996337)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Retracting propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1944 = 0.3140405292214645)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1944 = 0.6861654297024582)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1943 = 0.3804143351598744)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1943 = -0.3470159027404986)
=>WM: (13628: S1 ^operator O1946 +)
=>WM: (13627: S1 ^operator O1945 +)
=>WM: (13626: I3 ^dir R)
=>WM: (13625: O1946 ^name predict-no)
=>WM: (13624: O1945 ^name predict-yes)
=>WM: (13623: R976 ^value 1)
=>WM: (13622: R1 ^reward R976)
<=WM: (13613: S1 ^operator O1943 +)
<=WM: (13614: S1 ^operator O1944 +)
<=WM: (13615: S1 ^operator O1944)
<=WM: (13612: I3 ^dir L)
<=WM: (13608: R1 ^reward R975)
<=WM: (13611: O1944 ^name predict-no)
<=WM: (13610: O1943 ^name predict-yes)
<=WM: (13609: R975 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.2939507002996337)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1945 = 0.7054436376897688)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.2298717920574965)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1946 = -0.2023211881870005)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1944 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.2939507002996337)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1943 = 0.7054436376897688)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.314041 -> 0.485033 -0.171009 0.314023(R,m,v=1,0.86,0.121208)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515116 0.171049 0.686165 -> 0.5151 0.171045 0.686145(R,m,v=1,1,0)
=>WM: (13629: S1 ^operator O1945)

   973:    O: O1945 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N973 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N972 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13630: I3 ^predict-yes N973)
<=WM: (13617: N972 ^status complete)
<=WM: (13616: I3 ^predict-no N972)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13634: I2 ^dir U)
=>WM: (13633: I2 ^reward 1)
=>WM: (13632: I2 ^see 1)
=>WM: (13631: N973 ^status complete)
<=WM: (13620: I2 ^dir R)
<=WM: (13619: I2 ^reward 1)
<=WM: (13618: I2 ^see 0)
=>WM: (13635: I2 ^level-1 R1-root)
<=WM: (13621: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Firing propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Firing propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1946 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1945 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Retracting propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O1946 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1945 = 0.7054436376897688)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.2939507002996337)
=>WM: (13643: S1 ^operator O1948 +)
=>WM: (13642: S1 ^operator O1947 +)
=>WM: (13641: I3 ^dir U)
=>WM: (13640: O1948 ^name predict-no)
=>WM: (13639: O1947 ^name predict-yes)
=>WM: (13638: R977 ^value 1)
=>WM: (13637: R1 ^reward R977)
=>WM: (13636: I3 ^see 1)
<=WM: (13627: S1 ^operator O1945 +)
<=WM: (13629: S1 ^operator O1945)
<=WM: (13628: S1 ^operator O1946 +)
<=WM: (13626: I3 ^dir R)
<=WM: (13622: R1 ^reward R976)
<=WM: (13607: I3 ^see 0)
<=WM: (13625: O1946 ^name predict-no)
<=WM: (13624: O1945 ^name predict-yes)
<=WM: (13623: R976 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1946 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1945 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501028 -0.207078 0.293951 -> 0.501074 -0.207073 0.294001(R,m,v=1,0.84,0.135302)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498423 0.207021 0.705444 -> 0.498477 0.207026 0.705503(R,m,v=1,1,0)
=>WM: (13644: S1 ^operator O1948)

   974:    O: O1948 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N974 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N973 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13645: I3 ^predict-no N974)
<=WM: (13631: N973 ^status complete)
<=WM: (13630: I3 ^predict-yes N973)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13649: I2 ^dir L)
=>WM: (13648: I2 ^reward 1)
=>WM: (13647: I2 ^see 0)
=>WM: (13646: N974 ^status complete)
<=WM: (13634: I2 ^dir U)
<=WM: (13633: I2 ^reward 1)
<=WM: (13632: I2 ^see 1)
=>WM: (13650: I2 ^level-1 R1-root)
<=WM: (13635: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1947 = 0.6196158942331635)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1948 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Firing propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Firing propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1948 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1947 = 0.3804143351598744)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Retracting propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.)
=>WM: (13658: S1 ^operator O1950 +)
=>WM: (13657: S1 ^operator O1949 +)
=>WM: (13656: I3 ^dir L)
=>WM: (13655: O1950 ^name predict-no)
=>WM: (13654: O1949 ^name predict-yes)
=>WM: (13653: R978 ^value 1)
=>WM: (13652: R1 ^reward R978)
=>WM: (13651: I3 ^see 0)
<=WM: (13642: S1 ^operator O1947 +)
<=WM: (13643: S1 ^operator O1948 +)
<=WM: (13644: S1 ^operator O1948)
<=WM: (13641: I3 ^dir U)
<=WM: (13637: R1 ^reward R977)
<=WM: (13636: I3 ^see 1)
<=WM: (13640: O1948 ^name predict-no)
<=WM: (13639: O1947 ^name predict-yes)
<=WM: (13638: R977 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1949 = 0.6196158942331635)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1949 = 0.3804143351598744)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1950 = -0.1479504104026684)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1950 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1948 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1948 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1947 = 0.3804143351598744)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1947 = 0.6196158942331635)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13659: S1 ^operator O1949)

   975:    O: O1949 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N975 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N974 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13660: I3 ^predict-yes N975)
<=WM: (13646: N974 ^status complete)
<=WM: (13645: I3 ^predict-no N974)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13664: I2 ^dir R)
=>WM: (13663: I2 ^reward 1)
=>WM: (13662: I2 ^see 1)
=>WM: (13661: N975 ^status complete)
<=WM: (13649: I2 ^dir L)
<=WM: (13648: I2 ^reward 1)
<=WM: (13647: I2 ^see 0)
=>WM: (13665: I2 ^level-1 L1-root)
<=WM: (13650: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1949 = 0.7064496972060428)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1950 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Firing propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Firing propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1950 = 0.2298717920574965)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1949 = 0.2940010828283485)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Retracting propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1950 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1950 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1949 = 0.3804143351598744)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1949 = 0.6196158942331635)
=>WM: (13673: S1 ^operator O1952 +)
=>WM: (13672: S1 ^operator O1951 +)
=>WM: (13671: I3 ^dir R)
=>WM: (13670: O1952 ^name predict-no)
=>WM: (13669: O1951 ^name predict-yes)
=>WM: (13668: R979 ^value 1)
=>WM: (13667: R1 ^reward R979)
=>WM: (13666: I3 ^see 1)
<=WM: (13657: S1 ^operator O1949 +)
<=WM: (13659: S1 ^operator O1949)
<=WM: (13658: S1 ^operator O1950 +)
<=WM: (13656: I3 ^dir L)
<=WM: (13652: R1 ^reward R978)
<=WM: (13651: I3 ^see 0)
<=WM: (13655: O1950 ^name predict-no)
<=WM: (13654: O1949 ^name predict-yes)
<=WM: (13653: R978 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.2940010828283485)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1951 = 0.7064496972060428)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.2298717920574965)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1952 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1950 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1950 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1949 = 0.2940010828283485)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1949 = 0.7064496972060428)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.825,0.145283)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478689 0.140927 0.619616 -> 0.478686 0.140927 0.619613(R,m,v=1,1,0)
=>WM: (13674: S1 ^operator O1951)

   976:    O: O1951 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N976 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N975 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13675: I3 ^predict-yes N976)
<=WM: (13661: N975 ^status complete)
<=WM: (13660: I3 ^predict-yes N975)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13679: I2 ^dir R)
=>WM: (13678: I2 ^reward 1)
=>WM: (13677: I2 ^see 1)
=>WM: (13676: N976 ^status complete)
<=WM: (13664: I2 ^dir R)
<=WM: (13663: I2 ^reward 1)
<=WM: (13662: I2 ^see 1)
=>WM: (13680: I2 ^level-1 R1-root)
<=WM: (13665: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1951 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1952 = 0.7701964997777864)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Firing propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Firing propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.2298717920574965)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.2940010828283485)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Retracting propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1952 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1951 = 0.7064496972060428)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.2940010828283485)
=>WM: (13686: S1 ^operator O1954 +)
=>WM: (13685: S1 ^operator O1953 +)
=>WM: (13684: O1954 ^name predict-no)
=>WM: (13683: O1953 ^name predict-yes)
=>WM: (13682: R980 ^value 1)
=>WM: (13681: R1 ^reward R980)
<=WM: (13672: S1 ^operator O1951 +)
<=WM: (13674: S1 ^operator O1951)
<=WM: (13673: S1 ^operator O1952 +)
<=WM: (13667: R1 ^reward R979)
<=WM: (13670: O1952 ^name predict-no)
<=WM: (13669: O1951 ^name predict-yes)
<=WM: (13668: R979 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1953 = 0.2940010828283485)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1953 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1954 = 0.2298717920574965)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1954 = 0.7701964997777864)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1952 = 0.7701964997777864)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.2940010828283485)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1951 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501074 -0.207073 0.294001 -> 0.50104 -0.207077 0.293964(R,m,v=1,0.84106,0.13457)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499331 0.207118 0.70645 -> 0.499292 0.207114 0.706406(R,m,v=1,1,0)
=>WM: (13687: S1 ^operator O1954)

   977:    O: O1954 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N977 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N976 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13688: I3 ^predict-no N977)
<=WM: (13676: N976 ^status complete)
<=WM: (13675: I3 ^predict-yes N976)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13692: I2 ^dir U)
=>WM: (13691: I2 ^reward 1)
=>WM: (13690: I2 ^see 0)
=>WM: (13689: N977 ^status complete)
<=WM: (13679: I2 ^dir R)
<=WM: (13678: I2 ^reward 1)
<=WM: (13677: I2 ^see 1)
=>WM: (13693: I2 ^level-1 R0-root)
<=WM: (13680: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Firing propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Firing propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1954 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1953 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Retracting propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1954 = 0.7701964997777864)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1954 = 0.2298717920574965)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1953 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1953 = 0.2939636257009906)
=>WM: (13701: S1 ^operator O1956 +)
=>WM: (13700: S1 ^operator O1955 +)
=>WM: (13699: I3 ^dir U)
=>WM: (13698: O1956 ^name predict-no)
=>WM: (13697: O1955 ^name predict-yes)
=>WM: (13696: R981 ^value 1)
=>WM: (13695: R1 ^reward R981)
=>WM: (13694: I3 ^see 0)
<=WM: (13685: S1 ^operator O1953 +)
<=WM: (13686: S1 ^operator O1954 +)
<=WM: (13687: S1 ^operator O1954)
<=WM: (13671: I3 ^dir R)
<=WM: (13681: R1 ^reward R980)
<=WM: (13666: I3 ^see 1)
<=WM: (13684: O1954 ^name predict-no)
<=WM: (13683: O1953 ^name predict-yes)
<=WM: (13682: R980 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1954 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1953 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611922 -0.38205 0.229872 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.843023,0.133109)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388134 0.382063 0.770196 -> 0.388128 0.382061 0.77019(R,m,v=1,1,0)
=>WM: (13702: S1 ^operator O1956)

   978:    O: O1956 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N978 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N977 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13703: I3 ^predict-no N978)
<=WM: (13689: N977 ^status complete)
<=WM: (13688: I3 ^predict-no N977)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13707: I2 ^dir U)
=>WM: (13706: I2 ^reward 1)
=>WM: (13705: I2 ^see 0)
=>WM: (13704: N978 ^status complete)
<=WM: (13692: I2 ^dir U)
<=WM: (13691: I2 ^reward 1)
<=WM: (13690: I2 ^see 0)
=>WM: (13708: I2 ^level-1 R0-root)
<=WM: (13693: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Firing propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Firing propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Retracting propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)
=>WM: (13714: S1 ^operator O1958 +)
=>WM: (13713: S1 ^operator O1957 +)
=>WM: (13712: O1958 ^name predict-no)
=>WM: (13711: O1957 ^name predict-yes)
=>WM: (13710: R982 ^value 1)
=>WM: (13709: R1 ^reward R982)
<=WM: (13700: S1 ^operator O1955 +)
<=WM: (13701: S1 ^operator O1956 +)
<=WM: (13702: S1 ^operator O1956)
<=WM: (13695: R1 ^reward R981)
<=WM: (13698: O1956 ^name predict-no)
<=WM: (13697: O1955 ^name predict-yes)
<=WM: (13696: R981 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13715: S1 ^operator O1958)

   979:    O: O1958 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N979 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N978 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13716: I3 ^predict-no N979)
<=WM: (13704: N978 ^status complete)
<=WM: (13703: I3 ^predict-no N978)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13720: I2 ^dir L)
=>WM: (13719: I2 ^reward 1)
=>WM: (13718: I2 ^see 0)
=>WM: (13717: N979 ^status complete)
<=WM: (13707: I2 ^dir U)
<=WM: (13706: I2 ^reward 1)
<=WM: (13705: I2 ^see 0)
=>WM: (13721: I2 ^level-1 R0-root)
<=WM: (13708: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1957 = 0.6195601949549704)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1958 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Firing propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Firing propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1958 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1957 = 0.3804118472151704)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Retracting propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.)
=>WM: (13728: S1 ^operator O1960 +)
=>WM: (13727: S1 ^operator O1959 +)
=>WM: (13726: I3 ^dir L)
=>WM: (13725: O1960 ^name predict-no)
=>WM: (13724: O1959 ^name predict-yes)
=>WM: (13723: R983 ^value 1)
=>WM: (13722: R1 ^reward R983)
<=WM: (13713: S1 ^operator O1957 +)
<=WM: (13714: S1 ^operator O1958 +)
<=WM: (13715: S1 ^operator O1958)
<=WM: (13699: I3 ^dir U)
<=WM: (13709: R1 ^reward R982)
<=WM: (13712: O1958 ^name predict-no)
<=WM: (13711: O1957 ^name predict-yes)
<=WM: (13710: R982 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1959 = 0.6195601949549704)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1959 = 0.3804118472151704)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1960 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1960 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1958 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1958 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1957 = 0.3804118472151704)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1957 = 0.6195601949549704)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13729: S1 ^operator O1959)

   980:    O: O1959 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N980 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N979 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13730: I3 ^predict-yes N980)
<=WM: (13717: N979 ^status complete)
<=WM: (13716: I3 ^predict-no N979)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13734: I2 ^dir R)
=>WM: (13733: I2 ^reward 1)
=>WM: (13732: I2 ^see 1)
=>WM: (13731: N980 ^status complete)
<=WM: (13720: I2 ^dir L)
<=WM: (13719: I2 ^reward 1)
<=WM: (13718: I2 ^see 0)
=>WM: (13735: I2 ^level-1 L1-root)
<=WM: (13721: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1959 = 0.7064055971121673)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1960 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Firing propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Firing propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.2298662376128736)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.2939636257009906)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Retracting propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1960 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1960 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1959 = 0.3804118472151704)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1959 = 0.6195601949549704)
=>WM: (13743: S1 ^operator O1962 +)
=>WM: (13742: S1 ^operator O1961 +)
=>WM: (13741: I3 ^dir R)
=>WM: (13740: O1962 ^name predict-no)
=>WM: (13739: O1961 ^name predict-yes)
=>WM: (13738: R984 ^value 1)
=>WM: (13737: R1 ^reward R984)
=>WM: (13736: I3 ^see 1)
<=WM: (13727: S1 ^operator O1959 +)
<=WM: (13729: S1 ^operator O1959)
<=WM: (13728: S1 ^operator O1960 +)
<=WM: (13726: I3 ^dir L)
<=WM: (13722: R1 ^reward R983)
<=WM: (13694: I3 ^see 0)
<=WM: (13725: O1960 ^name predict-no)
<=WM: (13724: O1959 ^name predict-yes)
<=WM: (13723: R983 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.2939636257009906)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1961 = 0.7064055971121673)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.2298662376128736)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1962 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.2298662376128736)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1960 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.2939636257009906)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1959 = 0.7064055971121673)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.826087,0.144565)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478628 0.140932 0.61956 -> 0.478631 0.140932 0.619563(R,m,v=1,1,0)
=>WM: (13744: S1 ^operator O1961)

   981:    O: O1961 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N981 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N980 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13745: I3 ^predict-yes N981)
<=WM: (13731: N980 ^status complete)
<=WM: (13730: I3 ^predict-yes N980)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13749: I2 ^dir U)
=>WM: (13748: I2 ^reward 1)
=>WM: (13747: I2 ^see 1)
=>WM: (13746: N981 ^status complete)
<=WM: (13734: I2 ^dir R)
<=WM: (13733: I2 ^reward 1)
<=WM: (13732: I2 ^see 1)
=>WM: (13750: I2 ^level-1 R1-root)
<=WM: (13735: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Firing propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Firing propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1962 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1961 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Retracting propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1962 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.2298662376128736)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1961 = 0.7064055971121673)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.2939636257009906)
=>WM: (13757: S1 ^operator O1964 +)
=>WM: (13756: S1 ^operator O1963 +)
=>WM: (13755: I3 ^dir U)
=>WM: (13754: O1964 ^name predict-no)
=>WM: (13753: O1963 ^name predict-yes)
=>WM: (13752: R985 ^value 1)
=>WM: (13751: R1 ^reward R985)
<=WM: (13742: S1 ^operator O1961 +)
<=WM: (13744: S1 ^operator O1961)
<=WM: (13743: S1 ^operator O1962 +)
<=WM: (13741: I3 ^dir R)
<=WM: (13737: R1 ^reward R984)
<=WM: (13740: O1962 ^name predict-no)
<=WM: (13739: O1961 ^name predict-yes)
<=WM: (13738: R984 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1962 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1961 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.50104 -0.207077 0.293964 -> 0.501013 -0.20708 0.293933(R,m,v=1,0.842105,0.133845)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499292 0.207114 0.706406 -> 0.499259 0.20711 0.70637(R,m,v=1,1,0)
=>WM: (13758: S1 ^operator O1964)

   982:    O: O1964 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N982 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N981 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13759: I3 ^predict-no N982)
<=WM: (13746: N981 ^status complete)
<=WM: (13745: I3 ^predict-yes N981)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13763: I2 ^dir R)
=>WM: (13762: I2 ^reward 1)
=>WM: (13761: I2 ^see 0)
=>WM: (13760: N982 ^status complete)
<=WM: (13749: I2 ^dir U)
<=WM: (13748: I2 ^reward 1)
<=WM: (13747: I2 ^see 1)
=>WM: (13764: I2 ^level-1 R1-root)
<=WM: (13750: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1963 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1964 = 0.7701897521634826)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Firing propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Firing propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1964 = 0.2298662376128736)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1963 = 0.2939329791093226)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Retracting propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.)
=>WM: (13772: S1 ^operator O1966 +)
=>WM: (13771: S1 ^operator O1965 +)
=>WM: (13770: I3 ^dir R)
=>WM: (13769: O1966 ^name predict-no)
=>WM: (13768: O1965 ^name predict-yes)
=>WM: (13767: R986 ^value 1)
=>WM: (13766: R1 ^reward R986)
=>WM: (13765: I3 ^see 0)
<=WM: (13756: S1 ^operator O1963 +)
<=WM: (13757: S1 ^operator O1964 +)
<=WM: (13758: S1 ^operator O1964)
<=WM: (13755: I3 ^dir U)
<=WM: (13751: R1 ^reward R985)
<=WM: (13736: I3 ^see 1)
<=WM: (13754: O1964 ^name predict-no)
<=WM: (13753: O1963 ^name predict-yes)
<=WM: (13752: R985 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1965 = -0.252585164213872)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1965 = 0.2939329791093226)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1966 = 0.7701897521634826)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1966 = 0.2298662376128736)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1964 = 0.2298662376128736)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1964 = 0.7701897521634826)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1963 = 0.2939329791093226)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1963 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13773: S1 ^operator O1966)

   983:    O: O1966 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N983 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N982 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13774: I3 ^predict-no N983)
<=WM: (13760: N982 ^status complete)
<=WM: (13759: I3 ^predict-no N982)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13778: I2 ^dir L)
=>WM: (13777: I2 ^reward 1)
=>WM: (13776: I2 ^see 0)
=>WM: (13775: N983 ^status complete)
<=WM: (13763: I2 ^dir R)
<=WM: (13762: I2 ^reward 1)
<=WM: (13761: I2 ^see 0)
=>WM: (13779: I2 ^level-1 R0-root)
<=WM: (13764: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1965 = 0.6195629046335391)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1966 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Firing propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Firing propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1966 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1965 = 0.3804141458478695)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Retracting propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1966 = 0.2298662376128736)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1966 = 0.7701897521634826)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1965 = 0.2939329791093226)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1965 = -0.252585164213872)
=>WM: (13786: S1 ^operator O1968 +)
=>WM: (13785: S1 ^operator O1967 +)
=>WM: (13784: I3 ^dir L)
=>WM: (13783: O1968 ^name predict-no)
=>WM: (13782: O1967 ^name predict-yes)
=>WM: (13781: R987 ^value 1)
=>WM: (13780: R1 ^reward R987)
<=WM: (13771: S1 ^operator O1965 +)
<=WM: (13772: S1 ^operator O1966 +)
<=WM: (13773: S1 ^operator O1966)
<=WM: (13770: I3 ^dir R)
<=WM: (13766: R1 ^reward R986)
<=WM: (13769: O1966 ^name predict-no)
<=WM: (13768: O1965 ^name predict-yes)
<=WM: (13767: R986 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1967 = 0.3804141458478695)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1967 = 0.6195629046335391)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1968 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1968 = -0.2190661556260421)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1966 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1966 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1965 = 0.3804141458478695)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1965 = 0.6195629046335391)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611913 -0.382052 0.229862(R,m,v=1,0.843931,0.132477)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388128 0.382061 0.77019 -> 0.388124 0.38206 0.770184(R,m,v=1,1,0)
=>WM: (13787: S1 ^operator O1967)

   984:    O: O1967 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N984 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N983 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13788: I3 ^predict-yes N984)
<=WM: (13775: N983 ^status complete)
<=WM: (13774: I3 ^predict-no N983)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13792: I2 ^dir U)
=>WM: (13791: I2 ^reward 1)
=>WM: (13790: I2 ^see 1)
=>WM: (13789: N984 ^status complete)
<=WM: (13778: I2 ^dir L)
<=WM: (13777: I2 ^reward 1)
<=WM: (13776: I2 ^see 0)
=>WM: (13793: I2 ^level-1 L1-root)
<=WM: (13779: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Firing propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Firing propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1968 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1967 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Retracting propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1968 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1968 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1967 = 0.6195629046335391)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1967 = 0.3804141458478695)
=>WM: (13801: S1 ^operator O1970 +)
=>WM: (13800: S1 ^operator O1969 +)
=>WM: (13799: I3 ^dir U)
=>WM: (13798: O1970 ^name predict-no)
=>WM: (13797: O1969 ^name predict-yes)
=>WM: (13796: R988 ^value 1)
=>WM: (13795: R1 ^reward R988)
=>WM: (13794: I3 ^see 1)
<=WM: (13785: S1 ^operator O1967 +)
<=WM: (13787: S1 ^operator O1967)
<=WM: (13786: S1 ^operator O1968 +)
<=WM: (13784: I3 ^dir L)
<=WM: (13780: R1 ^reward R987)
<=WM: (13765: I3 ^see 0)
<=WM: (13783: O1968 ^name predict-no)
<=WM: (13782: O1967 ^name predict-yes)
<=WM: (13781: R987 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1969 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1970 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1968 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1967 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521346 -0.14093 0.380416(R,m,v=1,0.82716,0.143854)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478631 0.140932 0.619563 -> 0.478633 0.140932 0.619565(R,m,v=1,1,0)
=>WM: (13802: S1 ^operator O1970)

   985:    O: O1970 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N985 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N984 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13803: I3 ^predict-no N985)
<=WM: (13789: N984 ^status complete)
<=WM: (13788: I3 ^predict-yes N984)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13807: I2 ^dir R)
=>WM: (13806: I2 ^reward 1)
=>WM: (13805: I2 ^see 0)
=>WM: (13804: N985 ^status complete)
<=WM: (13792: I2 ^dir U)
<=WM: (13791: I2 ^reward 1)
<=WM: (13790: I2 ^see 1)
=>WM: (13808: I2 ^level-1 L1-root)
<=WM: (13793: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1969 = 0.7063695903698597)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1970 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Firing propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Firing propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1970 = 0.2298616880335552)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1969 = 0.2939329791093226)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Retracting propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1970 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1969 = 0.)
=>WM: (13816: S1 ^operator O1972 +)
=>WM: (13815: S1 ^operator O1971 +)
=>WM: (13814: I3 ^dir R)
=>WM: (13813: O1972 ^name predict-no)
=>WM: (13812: O1971 ^name predict-yes)
=>WM: (13811: R989 ^value 1)
=>WM: (13810: R1 ^reward R989)
=>WM: (13809: I3 ^see 0)
<=WM: (13800: S1 ^operator O1969 +)
<=WM: (13801: S1 ^operator O1970 +)
<=WM: (13802: S1 ^operator O1970)
<=WM: (13799: I3 ^dir U)
<=WM: (13795: R1 ^reward R988)
<=WM: (13794: I3 ^see 1)
<=WM: (13798: O1970 ^name predict-no)
<=WM: (13797: O1969 ^name predict-yes)
<=WM: (13796: R988 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1971 = 0.7063695903698597)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.2939329791093226)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1972 = -0.1937987592593187)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.2298616880335552)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1970 = 0.2298616880335552)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1970 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1969 = 0.2939329791093226)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1969 = 0.7063695903698597)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13817: S1 ^operator O1971)

   986:    O: O1971 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N986 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N985 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13818: I3 ^predict-yes N986)
<=WM: (13804: N985 ^status complete)
<=WM: (13803: I3 ^predict-no N985)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13822: I2 ^dir R)
=>WM: (13821: I2 ^reward 1)
=>WM: (13820: I2 ^see 1)
=>WM: (13819: N986 ^status complete)
<=WM: (13807: I2 ^dir R)
<=WM: (13806: I2 ^reward 1)
<=WM: (13805: I2 ^see 0)
=>WM: (13823: I2 ^level-1 R1-root)
<=WM: (13808: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1971 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1972 = 0.7701842386860367)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Firing propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Firing propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.2298616880335552)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.2939329791093226)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Retracting propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.2298616880335552)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1972 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.2939329791093226)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1971 = 0.7063695903698597)
=>WM: (13830: S1 ^operator O1974 +)
=>WM: (13829: S1 ^operator O1973 +)
=>WM: (13828: O1974 ^name predict-no)
=>WM: (13827: O1973 ^name predict-yes)
=>WM: (13826: R990 ^value 1)
=>WM: (13825: R1 ^reward R990)
=>WM: (13824: I3 ^see 1)
<=WM: (13815: S1 ^operator O1971 +)
<=WM: (13817: S1 ^operator O1971)
<=WM: (13816: S1 ^operator O1972 +)
<=WM: (13810: R1 ^reward R989)
<=WM: (13809: I3 ^see 0)
<=WM: (13813: O1972 ^name predict-no)
<=WM: (13812: O1971 ^name predict-yes)
<=WM: (13811: R989 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1973 = 0.2939329791093226)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1973 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1974 = 0.2298616880335552)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1974 = 0.7701842386860367)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.2298616880335552)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1972 = 0.7701842386860367)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.2939329791093226)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1971 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501013 -0.20708 0.293933 -> 0.50099 -0.207082 0.293908(R,m,v=1,0.843137,0.133127)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499259 0.20711 0.70637 -> 0.499233 0.207107 0.70634(R,m,v=1,1,0)
=>WM: (13831: S1 ^operator O1974)

   987:    O: O1974 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N987 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N986 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13832: I3 ^predict-no N987)
<=WM: (13819: N986 ^status complete)
<=WM: (13818: I3 ^predict-yes N986)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13836: I2 ^dir L)
=>WM: (13835: I2 ^reward 1)
=>WM: (13834: I2 ^see 0)
=>WM: (13833: N987 ^status complete)
<=WM: (13822: I2 ^dir R)
<=WM: (13821: I2 ^reward 1)
<=WM: (13820: I2 ^see 1)
=>WM: (13837: I2 ^level-1 R0-root)
<=WM: (13823: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1973 = 0.6195651222408995)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1974 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Firing propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Firing propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1974 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1973 = 0.3804160307887663)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Retracting propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1974 = 0.7701842386860367)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1974 = 0.2298616880335552)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1973 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1973 = 0.2939078922513593)
=>WM: (13845: S1 ^operator O1976 +)
=>WM: (13844: S1 ^operator O1975 +)
=>WM: (13843: I3 ^dir L)
=>WM: (13842: O1976 ^name predict-no)
=>WM: (13841: O1975 ^name predict-yes)
=>WM: (13840: R991 ^value 1)
=>WM: (13839: R1 ^reward R991)
=>WM: (13838: I3 ^see 0)
<=WM: (13829: S1 ^operator O1973 +)
<=WM: (13830: S1 ^operator O1974 +)
<=WM: (13831: S1 ^operator O1974)
<=WM: (13814: I3 ^dir R)
<=WM: (13825: R1 ^reward R990)
<=WM: (13824: I3 ^see 1)
<=WM: (13828: O1974 ^name predict-no)
<=WM: (13827: O1973 ^name predict-yes)
<=WM: (13826: R990 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1975 = 0.3804160307887663)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1975 = 0.6195651222408995)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1976 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1976 = -0.2190661556260421)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1974 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1974 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1973 = 0.3804160307887663)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1973 = 0.6195651222408995)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229862 -> 0.61191 -0.382052 0.229858(R,m,v=1,0.844828,0.131852)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388124 0.38206 0.770184 -> 0.38812 0.38206 0.77018(R,m,v=1,1,0)
=>WM: (13846: S1 ^operator O1975)

   988:    O: O1975 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N988 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N987 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13847: I3 ^predict-yes N988)
<=WM: (13833: N987 ^status complete)
<=WM: (13832: I3 ^predict-no N987)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13851: I2 ^dir U)
=>WM: (13850: I2 ^reward 1)
=>WM: (13849: I2 ^see 1)
=>WM: (13848: N988 ^status complete)
<=WM: (13836: I2 ^dir L)
<=WM: (13835: I2 ^reward 1)
<=WM: (13834: I2 ^see 0)
=>WM: (13852: I2 ^level-1 L1-root)
<=WM: (13837: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Firing propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Firing propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Retracting propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1976 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1976 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1975 = 0.6195651222408995)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1975 = 0.3804160307887663)
=>WM: (13860: S1 ^operator O1978 +)
=>WM: (13859: S1 ^operator O1977 +)
=>WM: (13858: I3 ^dir U)
=>WM: (13857: O1978 ^name predict-no)
=>WM: (13856: O1977 ^name predict-yes)
=>WM: (13855: R992 ^value 1)
=>WM: (13854: R1 ^reward R992)
=>WM: (13853: I3 ^see 1)
<=WM: (13844: S1 ^operator O1975 +)
<=WM: (13846: S1 ^operator O1975)
<=WM: (13845: S1 ^operator O1976 +)
<=WM: (13843: I3 ^dir L)
<=WM: (13839: R1 ^reward R991)
<=WM: (13838: I3 ^see 0)
<=WM: (13842: O1976 ^name predict-no)
<=WM: (13841: O1975 ^name predict-yes)
<=WM: (13840: R991 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521346 -0.14093 0.380416 -> 0.521348 -0.14093 0.380418(R,m,v=1,0.828221,0.143149)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478633 0.140932 0.619565 -> 0.478635 0.140932 0.619567(R,m,v=1,1,0)
=>WM: (13861: S1 ^operator O1978)

   989:    O: O1978 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N989 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N988 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13862: I3 ^predict-no N989)
<=WM: (13848: N988 ^status complete)
<=WM: (13847: I3 ^predict-yes N988)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13866: I2 ^dir R)
=>WM: (13865: I2 ^reward 1)
=>WM: (13864: I2 ^see 0)
=>WM: (13863: N989 ^status complete)
<=WM: (13851: I2 ^dir U)
<=WM: (13850: I2 ^reward 1)
<=WM: (13849: I2 ^see 1)
=>WM: (13867: I2 ^level-1 L1-root)
<=WM: (13852: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1977 = 0.7063401754803731)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1978 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Firing propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Firing propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1978 = 0.2298579596436188)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1977 = 0.2939078922513593)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Retracting propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.)
=>WM: (13875: S1 ^operator O1980 +)
=>WM: (13874: S1 ^operator O1979 +)
=>WM: (13873: I3 ^dir R)
=>WM: (13872: O1980 ^name predict-no)
=>WM: (13871: O1979 ^name predict-yes)
=>WM: (13870: R993 ^value 1)
=>WM: (13869: R1 ^reward R993)
=>WM: (13868: I3 ^see 0)
<=WM: (13859: S1 ^operator O1977 +)
<=WM: (13860: S1 ^operator O1978 +)
<=WM: (13861: S1 ^operator O1978)
<=WM: (13858: I3 ^dir U)
<=WM: (13854: R1 ^reward R992)
<=WM: (13853: I3 ^see 1)
<=WM: (13857: O1978 ^name predict-no)
<=WM: (13856: O1977 ^name predict-yes)
<=WM: (13855: R992 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1979 = 0.7063401754803731)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1979 = 0.2939078922513593)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1980 = -0.1937987592593187)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1980 = 0.2298579596436188)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1978 = 0.2298579596436188)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1978 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1977 = 0.2939078922513593)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1977 = 0.7063401754803731)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13876: S1 ^operator O1979)

   990:    O: O1979 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N990 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N989 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13877: I3 ^predict-yes N990)
<=WM: (13863: N989 ^status complete)
<=WM: (13862: I3 ^predict-no N989)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13881: I2 ^dir U)
=>WM: (13880: I2 ^reward 1)
=>WM: (13879: I2 ^see 1)
=>WM: (13878: N990 ^status complete)
<=WM: (13866: I2 ^dir R)
<=WM: (13865: I2 ^reward 1)
<=WM: (13864: I2 ^see 0)
=>WM: (13882: I2 ^level-1 R1-root)
<=WM: (13867: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Firing propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Firing propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1980 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1979 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Retracting propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1980 = 0.2298579596436188)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1980 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1979 = 0.2939078922513593)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1979 = 0.7063401754803731)
=>WM: (13890: S1 ^operator O1982 +)
=>WM: (13889: S1 ^operator O1981 +)
=>WM: (13888: I3 ^dir U)
=>WM: (13887: O1982 ^name predict-no)
=>WM: (13886: O1981 ^name predict-yes)
=>WM: (13885: R994 ^value 1)
=>WM: (13884: R1 ^reward R994)
=>WM: (13883: I3 ^see 1)
<=WM: (13874: S1 ^operator O1979 +)
<=WM: (13876: S1 ^operator O1979)
<=WM: (13875: S1 ^operator O1980 +)
<=WM: (13873: I3 ^dir R)
<=WM: (13869: R1 ^reward R993)
<=WM: (13868: I3 ^see 0)
<=WM: (13872: O1980 ^name predict-no)
<=WM: (13871: O1979 ^name predict-yes)
<=WM: (13870: R993 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1980 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1979 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.50099 -0.207082 0.293908 -> 0.500972 -0.207084 0.293887(R,m,v=1,0.844156,0.132417)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499233 0.207107 0.70634 -> 0.499211 0.207105 0.706316(R,m,v=1,1,0)
=>WM: (13891: S1 ^operator O1982)

   991:    O: O1982 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N991 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N990 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13892: I3 ^predict-no N991)
<=WM: (13878: N990 ^status complete)
<=WM: (13877: I3 ^predict-yes N990)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (13896: I2 ^dir U)
=>WM: (13895: I2 ^reward 1)
=>WM: (13894: I2 ^see 0)
=>WM: (13893: N991 ^status complete)
<=WM: (13881: I2 ^dir U)
<=WM: (13880: I2 ^reward 1)
<=WM: (13879: I2 ^see 1)
=>WM: (13897: I2 ^level-1 R1-root)
<=WM: (13882: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Firing propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Firing propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Retracting propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)
=>WM: (13904: S1 ^operator O1984 +)
=>WM: (13903: S1 ^operator O1983 +)
=>WM: (13902: O1984 ^name predict-no)
=>WM: (13901: O1983 ^name predict-yes)
=>WM: (13900: R995 ^value 1)
=>WM: (13899: R1 ^reward R995)
=>WM: (13898: I3 ^see 0)
<=WM: (13889: S1 ^operator O1981 +)
<=WM: (13890: S1 ^operator O1982 +)
<=WM: (13891: S1 ^operator O1982)
<=WM: (13884: R1 ^reward R994)
<=WM: (13883: I3 ^see 1)
<=WM: (13887: O1982 ^name predict-no)
<=WM: (13886: O1981 ^name predict-yes)
<=WM: (13885: R994 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13905: S1 ^operator O1984)

   992:    O: O1984 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N992 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N991 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13906: I3 ^predict-no N992)
<=WM: (13893: N991 ^status complete)
<=WM: (13892: I3 ^predict-no N991)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13910: I2 ^dir L)
=>WM: (13909: I2 ^reward 1)
=>WM: (13908: I2 ^see 0)
=>WM: (13907: N992 ^status complete)
<=WM: (13896: I2 ^dir U)
<=WM: (13895: I2 ^reward 1)
<=WM: (13894: I2 ^see 0)
=>WM: (13911: I2 ^level-1 R1-root)
<=WM: (13897: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1983 = 0.6196129817664832)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1984 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Firing propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Firing propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.380417577206794)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Retracting propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.)
=>WM: (13918: S1 ^operator O1986 +)
=>WM: (13917: S1 ^operator O1985 +)
=>WM: (13916: I3 ^dir L)
=>WM: (13915: O1986 ^name predict-no)
=>WM: (13914: O1985 ^name predict-yes)
=>WM: (13913: R996 ^value 1)
=>WM: (13912: R1 ^reward R996)
<=WM: (13903: S1 ^operator O1983 +)
<=WM: (13904: S1 ^operator O1984 +)
<=WM: (13905: S1 ^operator O1984)
<=WM: (13888: I3 ^dir U)
<=WM: (13899: R1 ^reward R995)
<=WM: (13902: O1984 ^name predict-no)
<=WM: (13901: O1983 ^name predict-yes)
<=WM: (13900: R995 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1985 = 0.6196129817664832)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.380417577206794)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1986 = -0.1479504104026684)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1984 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.380417577206794)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1983 = 0.6196129817664832)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13919: S1 ^operator O1985)

   993:    O: O1985 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N993 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N992 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13920: I3 ^predict-yes N993)
<=WM: (13907: N992 ^status complete)
<=WM: (13906: I3 ^predict-no N992)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13924: I2 ^dir R)
=>WM: (13923: I2 ^reward 1)
=>WM: (13922: I2 ^see 1)
=>WM: (13921: N993 ^status complete)
<=WM: (13910: I2 ^dir L)
<=WM: (13909: I2 ^reward 1)
<=WM: (13908: I2 ^see 0)
=>WM: (13925: I2 ^level-1 L1-root)
<=WM: (13911: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1985 = 0.7063161327052487)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1986 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Firing propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Firing propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1986 = 0.2298579596436188)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1985 = 0.29388734647702)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Retracting propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O1986 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.380417577206794)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O1985 = 0.6196129817664832)
=>WM: (13933: S1 ^operator O1988 +)
=>WM: (13932: S1 ^operator O1987 +)
=>WM: (13931: I3 ^dir R)
=>WM: (13930: O1988 ^name predict-no)
=>WM: (13929: O1987 ^name predict-yes)
=>WM: (13928: R997 ^value 1)
=>WM: (13927: R1 ^reward R997)
=>WM: (13926: I3 ^see 1)
<=WM: (13917: S1 ^operator O1985 +)
<=WM: (13919: S1 ^operator O1985)
<=WM: (13918: S1 ^operator O1986 +)
<=WM: (13916: I3 ^dir L)
<=WM: (13912: R1 ^reward R996)
<=WM: (13898: I3 ^see 0)
<=WM: (13915: O1986 ^name predict-no)
<=WM: (13914: O1985 ^name predict-yes)
<=WM: (13913: R996 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.29388734647702)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1987 = 0.7063161327052487)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.2298579596436188)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1988 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1986 = 0.2298579596436188)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1986 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1985 = 0.29388734647702)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1985 = 0.7063161327052487)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521348 -0.14093 0.380418 -> 0.521345 -0.14093 0.380415(R,m,v=1,0.829268,0.142451)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478686 0.140927 0.619613 -> 0.478682 0.140928 0.61961(R,m,v=1,1,0)
=>WM: (13934: S1 ^operator O1987)

   994:    O: O1987 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N994 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N993 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13935: I3 ^predict-yes N994)
<=WM: (13921: N993 ^status complete)
<=WM: (13920: I3 ^predict-yes N993)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13939: I2 ^dir R)
=>WM: (13938: I2 ^reward 1)
=>WM: (13937: I2 ^see 1)
=>WM: (13936: N994 ^status complete)
<=WM: (13924: I2 ^dir R)
<=WM: (13923: I2 ^reward 1)
<=WM: (13922: I2 ^see 1)
=>WM: (13940: I2 ^level-1 R1-root)
<=WM: (13925: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1987 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1988 = 0.7701797310679288)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Firing propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Firing propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.2298579596436188)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.29388734647702)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Retracting propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O1988 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.2298579596436188)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O1987 = 0.7063161327052487)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.29388734647702)
=>WM: (13946: S1 ^operator O1990 +)
=>WM: (13945: S1 ^operator O1989 +)
=>WM: (13944: O1990 ^name predict-no)
=>WM: (13943: O1989 ^name predict-yes)
=>WM: (13942: R998 ^value 1)
=>WM: (13941: R1 ^reward R998)
<=WM: (13932: S1 ^operator O1987 +)
<=WM: (13934: S1 ^operator O1987)
<=WM: (13933: S1 ^operator O1988 +)
<=WM: (13927: R1 ^reward R997)
<=WM: (13930: O1988 ^name predict-no)
<=WM: (13929: O1987 ^name predict-yes)
<=WM: (13928: R997 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1989 = 0.29388734647702)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1989 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1990 = 0.2298579596436188)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1990 = 0.7701797310679288)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.2298579596436188)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1988 = 0.7701797310679288)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.29388734647702)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1987 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.500972 -0.207084 0.293887 -> 0.500957 -0.207086 0.293871(R,m,v=1,0.845161,0.131713)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499211 0.207105 0.706316 -> 0.499194 0.207103 0.706296(R,m,v=1,1,0)
=>WM: (13947: S1 ^operator O1990)

   995:    O: O1990 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N995 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N994 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13948: I3 ^predict-no N995)
<=WM: (13936: N994 ^status complete)
<=WM: (13935: I3 ^predict-yes N994)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13952: I2 ^dir U)
=>WM: (13951: I2 ^reward 1)
=>WM: (13950: I2 ^see 0)
=>WM: (13949: N995 ^status complete)
<=WM: (13939: I2 ^dir R)
<=WM: (13938: I2 ^reward 1)
<=WM: (13937: I2 ^see 1)
=>WM: (13953: I2 ^level-1 R0-root)
<=WM: (13940: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Firing propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Firing propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Retracting propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O1990 = 0.7701797310679288)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1990 = 0.2298579596436188)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O1989 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1989 = 0.2938705117203769)
=>WM: (13961: S1 ^operator O1992 +)
=>WM: (13960: S1 ^operator O1991 +)
=>WM: (13959: I3 ^dir U)
=>WM: (13958: O1992 ^name predict-no)
=>WM: (13957: O1991 ^name predict-yes)
=>WM: (13956: R999 ^value 1)
=>WM: (13955: R1 ^reward R999)
=>WM: (13954: I3 ^see 0)
<=WM: (13945: S1 ^operator O1989 +)
<=WM: (13946: S1 ^operator O1990 +)
<=WM: (13947: S1 ^operator O1990)
<=WM: (13931: I3 ^dir R)
<=WM: (13941: R1 ^reward R998)
<=WM: (13926: I3 ^see 1)
<=WM: (13944: O1990 ^name predict-no)
<=WM: (13943: O1989 ^name predict-yes)
<=WM: (13942: R998 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382052 0.229858 -> 0.611908 -0.382053 0.229855(R,m,v=1,0.845714,0.131232)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.38812 0.38206 0.77018 -> 0.388117 0.382059 0.770176(R,m,v=1,1,0)
=>WM: (13962: S1 ^operator O1992)

   996:    O: O1992 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N996 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N995 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13963: I3 ^predict-no N996)
<=WM: (13949: N995 ^status complete)
<=WM: (13948: I3 ^predict-no N995)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13967: I2 ^dir U)
=>WM: (13966: I2 ^reward 1)
=>WM: (13965: I2 ^see 0)
=>WM: (13964: N996 ^status complete)
<=WM: (13952: I2 ^dir U)
<=WM: (13951: I2 ^reward 1)
<=WM: (13950: I2 ^see 0)
=>WM: (13968: I2 ^level-1 R0-root)
<=WM: (13953: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Firing propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Firing propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Retracting propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)
=>WM: (13974: S1 ^operator O1994 +)
=>WM: (13973: S1 ^operator O1993 +)
=>WM: (13972: O1994 ^name predict-no)
=>WM: (13971: O1993 ^name predict-yes)
=>WM: (13970: R1000 ^value 1)
=>WM: (13969: R1 ^reward R1000)
<=WM: (13960: S1 ^operator O1991 +)
<=WM: (13961: S1 ^operator O1992 +)
<=WM: (13962: S1 ^operator O1992)
<=WM: (13955: R1 ^reward R999)
<=WM: (13958: O1992 ^name predict-no)
<=WM: (13957: O1991 ^name predict-yes)
<=WM: (13956: R999 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13975: S1 ^operator O1994)

   997:    O: O1994 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N997 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N996 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13976: I3 ^predict-no N997)
<=WM: (13964: N996 ^status complete)
<=WM: (13963: I3 ^predict-no N996)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13980: I2 ^dir L)
=>WM: (13979: I2 ^reward 1)
=>WM: (13978: I2 ^see 0)
=>WM: (13977: N997 ^status complete)
<=WM: (13967: I2 ^dir U)
<=WM: (13966: I2 ^reward 1)
<=WM: (13965: I2 ^see 0)
=>WM: (13981: I2 ^level-1 R0-root)
<=WM: (13968: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1993 = 0.6195669380621123)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1994 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Firing propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Firing propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1994 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1993 = 0.380415072318069)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Retracting propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.)
=>WM: (13988: S1 ^operator O1996 +)
=>WM: (13987: S1 ^operator O1995 +)
=>WM: (13986: I3 ^dir L)
=>WM: (13985: O1996 ^name predict-no)
=>WM: (13984: O1995 ^name predict-yes)
=>WM: (13983: R1001 ^value 1)
=>WM: (13982: R1 ^reward R1001)
<=WM: (13973: S1 ^operator O1993 +)
<=WM: (13974: S1 ^operator O1994 +)
<=WM: (13975: S1 ^operator O1994)
<=WM: (13959: I3 ^dir U)
<=WM: (13969: R1 ^reward R1000)
<=WM: (13972: O1994 ^name predict-no)
<=WM: (13971: O1993 ^name predict-yes)
<=WM: (13970: R1000 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1995 = 0.6195669380621123)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.380415072318069)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1996 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1994 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1994 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1993 = 0.380415072318069)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1993 = 0.6195669380621123)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13989: S1 ^operator O1995)

   998:    O: O1995 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N998 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N997 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13990: I3 ^predict-yes N998)
<=WM: (13977: N997 ^status complete)
<=WM: (13976: I3 ^predict-no N997)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13994: I2 ^dir L)
=>WM: (13993: I2 ^reward 1)
=>WM: (13992: I2 ^see 1)
=>WM: (13991: N998 ^status complete)
<=WM: (13980: I2 ^dir L)
<=WM: (13979: I2 ^reward 1)
<=WM: (13978: I2 ^see 0)
=>WM: (13995: I2 ^level-1 L1-root)
<=WM: (13981: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1995 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1996 = 0.686145215235081)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Firing propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Firing propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3140233963466647)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.380415072318069)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Retracting propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O1996 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.380415072318069)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O1995 = 0.6195669380621123)
=>WM: (14002: S1 ^operator O1998 +)
=>WM: (14001: S1 ^operator O1997 +)
=>WM: (14000: O1998 ^name predict-no)
=>WM: (13999: O1997 ^name predict-yes)
=>WM: (13998: R1002 ^value 1)
=>WM: (13997: R1 ^reward R1002)
=>WM: (13996: I3 ^see 1)
<=WM: (13987: S1 ^operator O1995 +)
<=WM: (13989: S1 ^operator O1995)
<=WM: (13988: S1 ^operator O1996 +)
<=WM: (13982: R1 ^reward R1001)
<=WM: (13954: I3 ^see 0)
<=WM: (13985: O1996 ^name predict-no)
<=WM: (13984: O1995 ^name predict-yes)
<=WM: (13983: R1001 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.380415072318069)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1997 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 0.3140233963466647)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1998 = 0.686145215235081)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1996 = 0.686145215235081)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.380415072318069)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1995 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521345 -0.14093 0.380415 -> 0.521347 -0.14093 0.380417(R,m,v=1,0.830303,0.141759)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478635 0.140932 0.619567 -> 0.478637 0.140932 0.619569(R,m,v=1,1,0)
=>WM: (14003: S1 ^operator O1998)

   999:    O: O1998 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N999 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N998 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14004: I3 ^predict-no N999)
<=WM: (13991: N998 ^status complete)
<=WM: (13990: I3 ^predict-yes N998)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14008: I2 ^dir U)
=>WM: (14007: I2 ^reward 1)
=>WM: (14006: I2 ^see 0)
=>WM: (14005: N999 ^status complete)
<=WM: (13994: I2 ^dir L)
<=WM: (13993: I2 ^reward 1)
<=WM: (13992: I2 ^see 1)
=>WM: (14009: I2 ^level-1 L0-root)
<=WM: (13995: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Firing propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Firing propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1998 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1997 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Retracting propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O1998 = 0.686145215235081)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 0.3140233963466647)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O1997 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.3804165454412648)
=>WM: (14017: S1 ^operator O2000 +)
=>WM: (14016: S1 ^operator O1999 +)
=>WM: (14015: I3 ^dir U)
=>WM: (14014: O2000 ^name predict-no)
=>WM: (14013: O1999 ^name predict-yes)
=>WM: (14012: R1003 ^value 1)
=>WM: (14011: R1 ^reward R1003)
=>WM: (14010: I3 ^see 0)
<=WM: (14001: S1 ^operator O1997 +)
<=WM: (14002: S1 ^operator O1998 +)
<=WM: (14003: S1 ^operator O1998)
<=WM: (13986: I3 ^dir L)
<=WM: (13997: R1 ^reward R1002)
<=WM: (13996: I3 ^see 1)
<=WM: (14000: O1998 ^name predict-no)
<=WM: (13999: O1997 ^name predict-yes)
<=WM: (13998: R1002 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1999 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2000 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1998 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1997 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485033 -0.171009 0.314023 -> 0.485022 -0.171012 0.314009(R,m,v=1,0.860927,0.12053)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.5151 0.171045 0.686145 -> 0.515087 0.171042 0.686129(R,m,v=1,1,0)
=>WM: (14018: S1 ^operator O2000)

  1000:    O: O2000 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1000 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N999 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14019: I3 ^predict-no N1000)
<=WM: (14005: N999 ^status complete)
<=WM: (14004: I3 ^predict-no N999)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\-/|\-/|\--- Input Phase --- 
=>WM: (14023: I2 ^dir R)
=>WM: (14022: I2 ^reward 1)
=>WM: (14021: I2 ^see 0)
=>WM: (14020: N1000 ^status complete)
<=WM: (14008: I2 ^dir U)
<=WM: (14007: I2 ^reward 1)
<=WM: (14006: I2 ^see 0)
=>WM: (14024: I2 ^level-1 L0-root)
<=WM: (14009: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1999 = 0.7055034804752064)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2000 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Firing propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Firing propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2000 = 0.229854902707684)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1999 = 0.2938705117203769)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Retracting propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2000 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1999 = 0.)
=>WM: (14031: S1 ^operator O2002 +)
=>WM: (14030: S1 ^operator O2001 +)
=>WM: (14029: I3 ^dir R)
=>WM: (14028: O2002 ^name predict-no)
=>WM: (14027: O2001 ^name predict-yes)
=>WM: (14026: R1004 ^value 1)
=>WM: (14025: R1 ^reward R1004)
<=WM: (14016: S1 ^operator O1999 +)
<=WM: (14017: S1 ^operator O2000 +)
<=WM: (14018: S1 ^operator O2000)
<=WM: (14015: I3 ^dir U)
<=WM: (14011: R1 ^reward R1003)
<=WM: (14014: O2000 ^name predict-no)
<=WM: (14013: O1999 ^name predict-yes)
<=WM: (14012: R1003 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2001 = 0.7055034804752064)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2001 = 0.2938705117203769)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2002 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2002 = 0.229854902707684)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2000 = 0.229854902707684)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2000 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1999 = 0.2938705117203769)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O1999 = 0.7055034804752064)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14032: S1 ^operator O2001)

  1001:    O: O2001 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1001 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1000 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14033: I3 ^predict-yes N1001)
<=WM: (14020: N1000 ^status complete)
<=WM: (14019: I3 ^predict-no N1000)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (14037: I2 ^dir L)
=>WM: (14036: I2 ^reward 1)
=>WM: (14035: I2 ^see 1)
=>WM: (14034: N1001 ^status complete)
<=WM: (14023: I2 ^dir R)
<=WM: (14022: I2 ^reward 1)
<=WM: (14021: I2 ^see 0)
=>WM: (14038: I2 ^level-1 R1-root)
<=WM: (14024: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2001 = 0.6196100460529347)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2002 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Firing propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Firing propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 0.3140093857317092)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.3804165454412648)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Retracting propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2002 = 0.229854902707684)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2002 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2001 = 0.2938705117203769)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2001 = 0.7055034804752064)
=>WM: (14046: S1 ^operator O2004 +)
=>WM: (14045: S1 ^operator O2003 +)
=>WM: (14044: I3 ^dir L)
=>WM: (14043: O2004 ^name predict-no)
=>WM: (14042: O2003 ^name predict-yes)
=>WM: (14041: R1005 ^value 1)
=>WM: (14040: R1 ^reward R1005)
=>WM: (14039: I3 ^see 1)
<=WM: (14030: S1 ^operator O2001 +)
<=WM: (14032: S1 ^operator O2001)
<=WM: (14031: S1 ^operator O2002 +)
<=WM: (14029: I3 ^dir R)
<=WM: (14025: R1 ^reward R1004)
<=WM: (14010: I3 ^see 0)
<=WM: (14028: O2002 ^name predict-no)
<=WM: (14027: O2001 ^name predict-yes)
<=WM: (14026: R1004 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3804165454412648)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2003 = 0.6196100460529347)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3140093857317092)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2004 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 0.3140093857317092)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2002 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.3804165454412648)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2001 = 0.6196100460529347)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.500957 -0.207086 0.293871 -> 0.501003 -0.207081 0.293922(R,m,v=1,0.846154,0.131017)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498477 0.207026 0.705503 -> 0.498533 0.207032 0.705565(R,m,v=1,1,0)
=>WM: (14047: S1 ^operator O2003)

  1002:    O: O2003 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1002 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1001 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14048: I3 ^predict-yes N1002)
<=WM: (14034: N1001 ^status complete)
<=WM: (14033: I3 ^predict-yes N1001)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14052: I2 ^dir L)
=>WM: (14051: I2 ^reward 1)
=>WM: (14050: I2 ^see 1)
=>WM: (14049: N1002 ^status complete)
<=WM: (14037: I2 ^dir L)
<=WM: (14036: I2 ^reward 1)
<=WM: (14035: I2 ^see 1)
=>WM: (14053: I2 ^level-1 L1-root)
<=WM: (14038: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2003 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2004 = 0.6861287198581429)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Firing propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Firing propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3140093857317092)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3804165454412648)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Retracting propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2004 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3140093857317092)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2003 = 0.6196100460529347)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3804165454412648)
=>WM: (14059: S1 ^operator O2006 +)
=>WM: (14058: S1 ^operator O2005 +)
=>WM: (14057: O2006 ^name predict-no)
=>WM: (14056: O2005 ^name predict-yes)
=>WM: (14055: R1006 ^value 1)
=>WM: (14054: R1 ^reward R1006)
<=WM: (14045: S1 ^operator O2003 +)
<=WM: (14047: S1 ^operator O2003)
<=WM: (14046: S1 ^operator O2004 +)
<=WM: (14040: R1 ^reward R1005)
<=WM: (14043: O2004 ^name predict-no)
<=WM: (14042: O2003 ^name predict-yes)
<=WM: (14041: R1005 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2005 = 0.3804165454412648)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2005 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2006 = 0.3140093857317092)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2006 = 0.6861287198581429)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3140093857317092)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2004 = 0.6861287198581429)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3804165454412648)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2003 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521347 -0.14093 0.380417 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.831325,0.141073)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478682 0.140928 0.61961 -> 0.47868 0.140928 0.619607(R,m,v=1,1,0)
=>WM: (14060: S1 ^operator O2006)

  1003:    O: O2006 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1003 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1002 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14061: I3 ^predict-no N1003)
<=WM: (14049: N1002 ^status complete)
<=WM: (14048: I3 ^predict-yes N1002)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14065: I2 ^dir R)
=>WM: (14064: I2 ^reward 1)
=>WM: (14063: I2 ^see 0)
=>WM: (14062: N1003 ^status complete)
<=WM: (14052: I2 ^dir L)
<=WM: (14051: I2 ^reward 1)
<=WM: (14050: I2 ^see 1)
=>WM: (14066: I2 ^level-1 L0-root)
<=WM: (14053: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2005 = 0.7055651252992311)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2006 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Firing propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Firing propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.229854902707684)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.2939222491339341)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Retracting propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2006 = 0.6861287198581429)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2006 = 0.3140093857317092)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2005 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2005 = 0.380414370085626)
=>WM: (14074: S1 ^operator O2008 +)
=>WM: (14073: S1 ^operator O2007 +)
=>WM: (14072: I3 ^dir R)
=>WM: (14071: O2008 ^name predict-no)
=>WM: (14070: O2007 ^name predict-yes)
=>WM: (14069: R1007 ^value 1)
=>WM: (14068: R1 ^reward R1007)
=>WM: (14067: I3 ^see 0)
<=WM: (14058: S1 ^operator O2005 +)
<=WM: (14059: S1 ^operator O2006 +)
<=WM: (14060: S1 ^operator O2006)
<=WM: (14044: I3 ^dir L)
<=WM: (14054: R1 ^reward R1006)
<=WM: (14039: I3 ^see 1)
<=WM: (14057: O2006 ^name predict-no)
<=WM: (14056: O2005 ^name predict-yes)
<=WM: (14055: R1006 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.2939222491339341)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2007 = 0.7055651252992311)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.229854902707684)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2008 = -0.2023211881870005)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.229854902707684)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2006 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.2939222491339341)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2005 = 0.7055651252992311)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485022 -0.171012 0.314009 -> 0.485013 -0.171015 0.313998(R,m,v=1,0.861842,0.119859)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515087 0.171042 0.686129 -> 0.515077 0.171039 0.686115(R,m,v=1,1,0)
=>WM: (14075: S1 ^operator O2007)

  1004:    O: O2007 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1004 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1003 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14076: I3 ^predict-yes N1004)
<=WM: (14062: N1003 ^status complete)
<=WM: (14061: I3 ^predict-no N1003)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14080: I2 ^dir R)
=>WM: (14079: I2 ^reward 1)
=>WM: (14078: I2 ^see 1)
=>WM: (14077: N1004 ^status complete)
<=WM: (14065: I2 ^dir R)
<=WM: (14064: I2 ^reward 1)
<=WM: (14063: I2 ^see 0)
=>WM: (14081: I2 ^level-1 R1-root)
<=WM: (14066: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2007 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2008 = 0.7701760437619466)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Firing propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Firing propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.229854902707684)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.2939222491339341)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Retracting propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2008 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.229854902707684)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2007 = 0.7055651252992311)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.2939222491339341)
=>WM: (14088: S1 ^operator O2010 +)
=>WM: (14087: S1 ^operator O2009 +)
=>WM: (14086: O2010 ^name predict-no)
=>WM: (14085: O2009 ^name predict-yes)
=>WM: (14084: R1008 ^value 1)
=>WM: (14083: R1 ^reward R1008)
=>WM: (14082: I3 ^see 1)
<=WM: (14073: S1 ^operator O2007 +)
<=WM: (14075: S1 ^operator O2007)
<=WM: (14074: S1 ^operator O2008 +)
<=WM: (14068: R1 ^reward R1007)
<=WM: (14067: I3 ^see 0)
<=WM: (14071: O2008 ^name predict-no)
<=WM: (14070: O2007 ^name predict-yes)
<=WM: (14069: R1007 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2009 = 0.2939222491339341)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2009 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2010 = 0.229854902707684)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2010 = 0.7701760437619466)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.229854902707684)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2008 = 0.7701760437619466)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.2939222491339341)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2007 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501003 -0.207081 0.293922 -> 0.501042 -0.207077 0.293965(R,m,v=1,0.847134,0.130328)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498533 0.207032 0.705565 -> 0.498578 0.207037 0.705615(R,m,v=1,1,0)
=>WM: (14089: S1 ^operator O2010)

  1005:    O: O2010 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1005 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1004 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14090: I3 ^predict-no N1005)
<=WM: (14077: N1004 ^status complete)
<=WM: (14076: I3 ^predict-yes N1004)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14094: I2 ^dir U)
=>WM: (14093: I2 ^reward 1)
=>WM: (14092: I2 ^see 0)
=>WM: (14091: N1005 ^status complete)
<=WM: (14080: I2 ^dir R)
<=WM: (14079: I2 ^reward 1)
<=WM: (14078: I2 ^see 1)
=>WM: (14095: I2 ^level-1 R0-root)
<=WM: (14081: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Firing propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Firing propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2010 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2009 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Retracting propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2010 = 0.7701760437619466)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2010 = 0.229854902707684)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2009 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2009 = 0.2939645711914686)
=>WM: (14103: S1 ^operator O2012 +)
=>WM: (14102: S1 ^operator O2011 +)
=>WM: (14101: I3 ^dir U)
=>WM: (14100: O2012 ^name predict-no)
=>WM: (14099: O2011 ^name predict-yes)
=>WM: (14098: R1009 ^value 1)
=>WM: (14097: R1 ^reward R1009)
=>WM: (14096: I3 ^see 0)
<=WM: (14087: S1 ^operator O2009 +)
<=WM: (14088: S1 ^operator O2010 +)
<=WM: (14089: S1 ^operator O2010)
<=WM: (14072: I3 ^dir R)
<=WM: (14083: R1 ^reward R1008)
<=WM: (14082: I3 ^see 1)
<=WM: (14086: O2010 ^name predict-no)
<=WM: (14085: O2009 ^name predict-yes)
<=WM: (14084: R1008 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2011 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2012 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2010 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2009 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611908 -0.382053 0.229855 -> 0.611906 -0.382053 0.229852(R,m,v=1,0.846591,0.130617)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388117 0.382059 0.770176 -> 0.388115 0.382058 0.770173(R,m,v=1,1,0)
=>WM: (14104: S1 ^operator O2012)

  1006:    O: O2012 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1006 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1005 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14105: I3 ^predict-no N1006)
<=WM: (14091: N1005 ^status complete)
<=WM: (14090: I3 ^predict-no N1005)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14109: I2 ^dir R)
=>WM: (14108: I2 ^reward 1)
=>WM: (14107: I2 ^see 0)
=>WM: (14106: N1006 ^status complete)
<=WM: (14094: I2 ^dir U)
<=WM: (14093: I2 ^reward 1)
<=WM: (14092: I2 ^see 0)
=>WM: (14110: I2 ^level-1 R0-root)
<=WM: (14095: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2011 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2012 = 0.7700907188039023)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Firing propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Firing propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2012 = 0.2298523950867538)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2011 = 0.2939645711914686)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Retracting propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2012 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2011 = 0.)
=>WM: (14117: S1 ^operator O2014 +)
=>WM: (14116: S1 ^operator O2013 +)
=>WM: (14115: I3 ^dir R)
=>WM: (14114: O2014 ^name predict-no)
=>WM: (14113: O2013 ^name predict-yes)
=>WM: (14112: R1010 ^value 1)
=>WM: (14111: R1 ^reward R1010)
<=WM: (14102: S1 ^operator O2011 +)
<=WM: (14103: S1 ^operator O2012 +)
<=WM: (14104: S1 ^operator O2012)
<=WM: (14101: I3 ^dir U)
<=WM: (14097: R1 ^reward R1009)
<=WM: (14100: O2012 ^name predict-no)
<=WM: (14099: O2011 ^name predict-yes)
<=WM: (14098: R1009 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2013 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.2939645711914686)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2014 = 0.7700907188039023)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.2298523950867538)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2012 = 0.2298523950867538)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2012 = 0.7700907188039023)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2011 = 0.2939645711914686)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2011 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14118: S1 ^operator O2014)

  1007:    O: O2014 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1007 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1006 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14119: I3 ^predict-no N1007)
<=WM: (14106: N1006 ^status complete)
<=WM: (14105: I3 ^predict-no N1006)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14123: I2 ^dir R)
=>WM: (14122: I2 ^reward 1)
=>WM: (14121: I2 ^see 0)
=>WM: (14120: N1007 ^status complete)
<=WM: (14109: I2 ^dir R)
<=WM: (14108: I2 ^reward 1)
<=WM: (14107: I2 ^see 0)
=>WM: (14124: I2 ^level-1 R0-root)
<=WM: (14110: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2013 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2014 = 0.7700907188039023)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1011 ^value 1 +)
 (R1 ^reward R1011 +)
Firing propose*predict-yes
 -->
 (O2015 ^name predict-yes +)
 (S1 ^operator O2015 +)
Firing propose*predict-no
 -->
 (O2016 ^name predict-no +)
 (S1 ^operator O2016 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.2298523950867538)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.2939645711914686)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Retracting propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.2298523950867538)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2014 = 0.7700907188039023)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.2939645711914686)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2013 = -0.1254042659579056)
=>WM: (14130: S1 ^operator O2016 +)
=>WM: (14129: S1 ^operator O2015 +)
=>WM: (14128: O2016 ^name predict-no)
=>WM: (14127: O2015 ^name predict-yes)
=>WM: (14126: R1011 ^value 1)
=>WM: (14125: R1 ^reward R1011)
<=WM: (14116: S1 ^operator O2013 +)
<=WM: (14117: S1 ^operator O2014 +)
<=WM: (14118: S1 ^operator O2014)
<=WM: (14111: R1 ^reward R1010)
<=WM: (14114: O2014 ^name predict-no)
<=WM: (14113: O2013 ^name predict-yes)
<=WM: (14112: R1010 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2015 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2015 = 0.2939645711914686)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2016 = 0.7700907188039023)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2016 = 0.2298523950867538)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.2298523950867538)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2014 = 0.7700907188039023)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.2939645711914686)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2013 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611906 -0.382053 0.229852 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.847458,0.130008)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388048 0.382043 0.770091 -> 0.388052 0.382044 0.770096(R,m,v=1,1,0)
=>WM: (14131: S1 ^operator O2016)

  1008:    O: O2016 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1008 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1007 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14132: I3 ^predict-no N1008)
<=WM: (14120: N1007 ^status complete)
<=WM: (14119: I3 ^predict-no N1007)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14136: I2 ^dir L)
=>WM: (14135: I2 ^reward 1)
=>WM: (14134: I2 ^see 0)
=>WM: (14133: N1008 ^status complete)
<=WM: (14123: I2 ^dir R)
<=WM: (14122: I2 ^reward 1)
<=WM: (14121: I2 ^see 0)
=>WM: (14137: I2 ^level-1 R0-root)
<=WM: (14124: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2015 = 0.6195686662736642)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2016 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1012 ^value 1 +)
 (R1 ^reward R1012 +)
Firing propose*predict-yes
 -->
 (O2017 ^name predict-yes +)
 (S1 ^operator O2017 +)
Firing propose*predict-no
 -->
 (O2018 ^name predict-no +)
 (S1 ^operator O2018 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2016 = 0.3139979225569853)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2015 = 0.380414370085626)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2016 ^name predict-no +)
 (S1 ^operator O2016 +)
Retracting propose*predict-yes
 -->
 (O2015 ^name predict-yes +)
 (S1 ^operator O2015 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1011 ^value 1 +)
 (R1 ^reward R1011 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2016 = 0.229857000391985)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2016 = 0.7700959914561893)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2015 = 0.2939645711914686)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2015 = -0.1254042659579056)
=>WM: (14144: S1 ^operator O2018 +)
=>WM: (14143: S1 ^operator O2017 +)
=>WM: (14142: I3 ^dir L)
=>WM: (14141: O2018 ^name predict-no)
=>WM: (14140: O2017 ^name predict-yes)
=>WM: (14139: R1012 ^value 1)
=>WM: (14138: R1 ^reward R1012)
<=WM: (14129: S1 ^operator O2015 +)
<=WM: (14130: S1 ^operator O2016 +)
<=WM: (14131: S1 ^operator O2016)
<=WM: (14115: I3 ^dir R)
<=WM: (14125: R1 ^reward R1011)
<=WM: (14128: O2016 ^name predict-no)
<=WM: (14127: O2015 ^name predict-yes)
<=WM: (14126: R1011 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2017 = 0.6195686662736642)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2017 = 0.380414370085626)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2018 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2018 = 0.3139979225569853)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2016 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2016 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2015 = 0.380414370085626)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2015 = 0.6195686662736642)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382053 0.229857 -> 0.611913 -0.382052 0.229861(R,m,v=1,0.848315,0.129404)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388052 0.382044 0.770096 -> 0.388056 0.382045 0.7701(R,m,v=1,1,0)
=>WM: (14145: S1 ^operator O2017)

  1009:    O: O2017 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1009 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1008 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14146: I3 ^predict-yes N1009)
<=WM: (14133: N1008 ^status complete)
<=WM: (14132: I3 ^predict-no N1008)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14150: I2 ^dir R)
=>WM: (14149: I2 ^reward 1)
=>WM: (14148: I2 ^see 1)
=>WM: (14147: N1009 ^status complete)
<=WM: (14136: I2 ^dir L)
<=WM: (14135: I2 ^reward 1)
<=WM: (14134: I2 ^see 0)
=>WM: (14151: I2 ^level-1 L1-root)
<=WM: (14137: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2017 = 0.7062964705528377)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2018 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1013 ^value 1 +)
 (R1 ^reward R1013 +)
Firing propose*predict-yes
 -->
 (O2019 ^name predict-yes +)
 (S1 ^operator O2019 +)
Firing propose*predict-no
 -->
 (O2020 ^name predict-no +)
 (S1 ^operator O2020 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2018 = 0.2298608025432123)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2017 = 0.2939645711914686)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2018 ^name predict-no +)
 (S1 ^operator O2018 +)
Retracting propose*predict-yes
 -->
 (O2017 ^name predict-yes +)
 (S1 ^operator O2017 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1012 ^value 1 +)
 (R1 ^reward R1012 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2018 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2018 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2017 = 0.380414370085626)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2017 = 0.6195686662736642)
=>WM: (14159: S1 ^operator O2020 +)
=>WM: (14158: S1 ^operator O2019 +)
=>WM: (14157: I3 ^dir R)
=>WM: (14156: O2020 ^name predict-no)
=>WM: (14155: O2019 ^name predict-yes)
=>WM: (14154: R1013 ^value 1)
=>WM: (14153: R1 ^reward R1013)
=>WM: (14152: I3 ^see 1)
<=WM: (14143: S1 ^operator O2017 +)
<=WM: (14145: S1 ^operator O2017)
<=WM: (14144: S1 ^operator O2018 +)
<=WM: (14142: I3 ^dir L)
<=WM: (14138: R1 ^reward R1012)
<=WM: (14096: I3 ^see 0)
<=WM: (14141: O2018 ^name predict-no)
<=WM: (14140: O2017 ^name predict-yes)
<=WM: (14139: R1012 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2019 = 0.2939645711914686)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2019 = 0.7062964705528377)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2020 = 0.2298608025432123)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2020 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2018 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2018 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2017 = 0.2939645711914686)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2017 = 0.7062964705528377)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521346 -0.14093 0.380416(R,m,v=1,0.832335,0.140394)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478637 0.140932 0.619569 -> 0.478639 0.140931 0.61957(R,m,v=1,1,0)
=>WM: (14160: S1 ^operator O2019)

  1010:    O: O2019 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1010 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1009 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14161: I3 ^predict-yes N1010)
<=WM: (14147: N1009 ^status complete)
<=WM: (14146: I3 ^predict-yes N1009)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14165: I2 ^dir L)
=>WM: (14164: I2 ^reward 1)
=>WM: (14163: I2 ^see 1)
=>WM: (14162: N1010 ^status complete)
<=WM: (14150: I2 ^dir R)
<=WM: (14149: I2 ^reward 1)
<=WM: (14148: I2 ^see 1)
=>WM: (14166: I2 ^level-1 R1-root)
<=WM: (14151: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2019 = 0.6196074987347102)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2020 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1014 ^value 1 +)
 (R1 ^reward R1014 +)
Firing propose*predict-yes
 -->
 (O2021 ^name predict-yes +)
 (S1 ^operator O2021 +)
Firing propose*predict-no
 -->
 (O2022 ^name predict-no +)
 (S1 ^operator O2022 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2020 = 0.3139979225569853)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2019 = 0.3804157564584494)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2020 ^name predict-no +)
 (S1 ^operator O2020 +)
Retracting propose*predict-yes
 -->
 (O2019 ^name predict-yes +)
 (S1 ^operator O2019 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1013 ^value 1 +)
 (R1 ^reward R1013 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2020 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2020 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2019 = 0.7062964705528377)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2019 = 0.2939645711914686)
=>WM: (14173: S1 ^operator O2022 +)
=>WM: (14172: S1 ^operator O2021 +)
=>WM: (14171: I3 ^dir L)
=>WM: (14170: O2022 ^name predict-no)
=>WM: (14169: O2021 ^name predict-yes)
=>WM: (14168: R1014 ^value 1)
=>WM: (14167: R1 ^reward R1014)
<=WM: (14158: S1 ^operator O2019 +)
<=WM: (14160: S1 ^operator O2019)
<=WM: (14159: S1 ^operator O2020 +)
<=WM: (14157: I3 ^dir R)
<=WM: (14153: R1 ^reward R1013)
<=WM: (14156: O2020 ^name predict-no)
<=WM: (14155: O2019 ^name predict-yes)
<=WM: (14154: R1013 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2021 = 0.3804157564584494)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2021 = 0.6196074987347102)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2022 = 0.3139979225569853)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2022 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2020 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2020 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2019 = 0.3804157564584494)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2019 = 0.6196074987347102)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501042 -0.207077 0.293965 -> 0.501022 -0.207079 0.293943(R,m,v=1,0.848101,0.129646)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499194 0.207103 0.706296 -> 0.499171 0.2071 0.706271(R,m,v=1,1,0)
=>WM: (14174: S1 ^operator O2021)

  1011:    O: O2021 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1011 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1010 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14175: I3 ^predict-yes N1011)
<=WM: (14162: N1010 ^status complete)
<=WM: (14161: I3 ^predict-yes N1010)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (14179: I2 ^dir U)
=>WM: (14178: I2 ^reward 1)
=>WM: (14177: I2 ^see 1)
=>WM: (14176: N1011 ^status complete)
<=WM: (14165: I2 ^dir L)
<=WM: (14164: I2 ^reward 1)
<=WM: (14163: I2 ^see 1)
=>WM: (14180: I2 ^level-1 L1-root)
<=WM: (14166: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1015 ^value 1 +)
 (R1 ^reward R1015 +)
Firing propose*predict-yes
 -->
 (O2023 ^name predict-yes +)
 (S1 ^operator O2023 +)
Firing propose*predict-no
 -->
 (O2024 ^name predict-no +)
 (S1 ^operator O2024 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2022 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2021 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2022 ^name predict-no +)
 (S1 ^operator O2022 +)
Retracting propose*predict-yes
 -->
 (O2021 ^name predict-yes +)
 (S1 ^operator O2021 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1014 ^value 1 +)
 (R1 ^reward R1014 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2022 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2022 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2021 = 0.6196074987347102)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2021 = 0.3804157564584494)
=>WM: (14187: S1 ^operator O2024 +)
=>WM: (14186: S1 ^operator O2023 +)
=>WM: (14185: I3 ^dir U)
=>WM: (14184: O2024 ^name predict-no)
=>WM: (14183: O2023 ^name predict-yes)
=>WM: (14182: R1015 ^value 1)
=>WM: (14181: R1 ^reward R1015)
<=WM: (14172: S1 ^operator O2021 +)
<=WM: (14174: S1 ^operator O2021)
<=WM: (14173: S1 ^operator O2022 +)
<=WM: (14171: I3 ^dir L)
<=WM: (14167: R1 ^reward R1014)
<=WM: (14170: O2022 ^name predict-no)
<=WM: (14169: O2021 ^name predict-yes)
<=WM: (14168: R1014 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2023 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2024 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2022 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2021 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521346 -0.14093 0.380416 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.833333,0.139721)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.47868 0.140928 0.619607 -> 0.478677 0.140928 0.619605(R,m,v=1,1,0)
=>WM: (14188: S1 ^operator O2024)

  1012:    O: O2024 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1012 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1011 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14189: I3 ^predict-no N1012)
<=WM: (14176: N1011 ^status complete)
<=WM: (14175: I3 ^predict-yes N1011)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (14193: I2 ^dir L)
=>WM: (14192: I2 ^reward 1)
=>WM: (14191: I2 ^see 0)
=>WM: (14190: N1012 ^status complete)
<=WM: (14179: I2 ^dir U)
<=WM: (14178: I2 ^reward 1)
<=WM: (14177: I2 ^see 1)
=>WM: (14194: I2 ^level-1 L1-root)
<=WM: (14180: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2023 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2024 = 0.68611525175106)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1016 ^value 1 +)
 (R1 ^reward R1016 +)
Firing propose*predict-yes
 -->
 (O2025 ^name predict-yes +)
 (S1 ^operator O2025 +)
Firing propose*predict-no
 -->
 (O2026 ^name predict-no +)
 (S1 ^operator O2026 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2024 = 0.3139979225569853)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2023 = 0.3804138577541756)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2024 ^name predict-no +)
 (S1 ^operator O2024 +)
Retracting propose*predict-yes
 -->
 (O2023 ^name predict-yes +)
 (S1 ^operator O2023 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1015 ^value 1 +)
 (R1 ^reward R1015 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2024 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2023 = 0.)
=>WM: (14202: S1 ^operator O2026 +)
=>WM: (14201: S1 ^operator O2025 +)
=>WM: (14200: I3 ^dir L)
=>WM: (14199: O2026 ^name predict-no)
=>WM: (14198: O2025 ^name predict-yes)
=>WM: (14197: R1016 ^value 1)
=>WM: (14196: R1 ^reward R1016)
=>WM: (14195: I3 ^see 0)
<=WM: (14186: S1 ^operator O2023 +)
<=WM: (14187: S1 ^operator O2024 +)
<=WM: (14188: S1 ^operator O2024)
<=WM: (14185: I3 ^dir U)
<=WM: (14181: R1 ^reward R1015)
<=WM: (14152: I3 ^see 1)
<=WM: (14184: O2024 ^name predict-no)
<=WM: (14183: O2023 ^name predict-yes)
<=WM: (14182: R1015 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2025 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.3804138577541756)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2026 = 0.68611525175106)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 0.3139979225569853)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2024 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2024 = 0.68611525175106)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2023 = 0.3804138577541756)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2023 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14203: S1 ^operator O2026)

  1013:    O: O2026 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1013 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1012 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14204: I3 ^predict-no N1013)
<=WM: (14190: N1012 ^status complete)
<=WM: (14189: I3 ^predict-no N1012)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14208: I2 ^dir L)
=>WM: (14207: I2 ^reward 1)
=>WM: (14206: I2 ^see 0)
=>WM: (14205: N1013 ^status complete)
<=WM: (14193: I2 ^dir L)
<=WM: (14192: I2 ^reward 1)
<=WM: (14191: I2 ^see 0)
=>WM: (14209: I2 ^level-1 L0-root)
<=WM: (14194: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2025 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2026 = 0.6857730532944987)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1017 ^value 1 +)
 (R1 ^reward R1017 +)
Firing propose*predict-yes
 -->
 (O2027 ^name predict-yes +)
 (S1 ^operator O2027 +)
Firing propose*predict-no
 -->
 (O2028 ^name predict-no +)
 (S1 ^operator O2028 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 0.3139979225569853)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.3804138577541756)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2026 ^name predict-no +)
 (S1 ^operator O2026 +)
Retracting propose*predict-yes
 -->
 (O2025 ^name predict-yes +)
 (S1 ^operator O2025 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1016 ^value 1 +)
 (R1 ^reward R1016 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2026 = 0.68611525175106)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.3804138577541756)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2025 = -0.3470159027404986)
=>WM: (14215: S1 ^operator O2028 +)
=>WM: (14214: S1 ^operator O2027 +)
=>WM: (14213: O2028 ^name predict-no)
=>WM: (14212: O2027 ^name predict-yes)
=>WM: (14211: R1017 ^value 1)
=>WM: (14210: R1 ^reward R1017)
<=WM: (14201: S1 ^operator O2025 +)
<=WM: (14202: S1 ^operator O2026 +)
<=WM: (14203: S1 ^operator O2026)
<=WM: (14196: R1 ^reward R1016)
<=WM: (14199: O2026 ^name predict-no)
<=WM: (14198: O2025 ^name predict-yes)
<=WM: (14197: R1016 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2027 = 0.3804138577541756)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2027 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2028 = 0.3139979225569853)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2028 = 0.6857730532944987)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 0.3139979225569853)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2026 = 0.6857730532944987)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.3804138577541756)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2025 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485013 -0.171015 0.313998 -> 0.485005 -0.171017 0.313989(R,m,v=1,0.862745,0.119195)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515077 0.171039 0.686115 -> 0.515068 0.171036 0.686104(R,m,v=1,1,0)
=>WM: (14216: S1 ^operator O2028)

  1014:    O: O2028 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1014 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1013 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14217: I3 ^predict-no N1014)
<=WM: (14205: N1013 ^status complete)
<=WM: (14204: I3 ^predict-no N1013)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14221: I2 ^dir R)
=>WM: (14220: I2 ^reward 1)
=>WM: (14219: I2 ^see 0)
=>WM: (14218: N1014 ^status complete)
<=WM: (14208: I2 ^dir L)
<=WM: (14207: I2 ^reward 1)
<=WM: (14206: I2 ^see 0)
=>WM: (14222: I2 ^level-1 L0-root)
<=WM: (14209: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2027 = 0.7056154385005245)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2028 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1018 ^value 1 +)
 (R1 ^reward R1018 +)
Firing propose*predict-yes
 -->
 (O2029 ^name predict-yes +)
 (S1 ^operator O2029 +)
Firing propose*predict-no
 -->
 (O2030 ^name predict-no +)
 (S1 ^operator O2030 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2028 = 0.2298608025432123)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2027 = 0.2939430423129205)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2028 ^name predict-no +)
 (S1 ^operator O2028 +)
Retracting propose*predict-yes
 -->
 (O2027 ^name predict-yes +)
 (S1 ^operator O2027 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1017 ^value 1 +)
 (R1 ^reward R1017 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2028 = 0.6857730532944987)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2028 = 0.3139885389674749)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2027 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2027 = 0.3804138577541756)
=>WM: (14229: S1 ^operator O2030 +)
=>WM: (14228: S1 ^operator O2029 +)
=>WM: (14227: I3 ^dir R)
=>WM: (14226: O2030 ^name predict-no)
=>WM: (14225: O2029 ^name predict-yes)
=>WM: (14224: R1018 ^value 1)
=>WM: (14223: R1 ^reward R1018)
<=WM: (14214: S1 ^operator O2027 +)
<=WM: (14215: S1 ^operator O2028 +)
<=WM: (14216: S1 ^operator O2028)
<=WM: (14200: I3 ^dir L)
<=WM: (14210: R1 ^reward R1017)
<=WM: (14213: O2028 ^name predict-no)
<=WM: (14212: O2027 ^name predict-yes)
<=WM: (14211: R1017 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2029 = 0.7056154385005245)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2029 = 0.2939430423129205)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2030 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2030 = 0.2298608025432123)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2028 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2028 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2027 = 0.2939430423129205)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2027 = 0.7056154385005245)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485005 -0.171017 0.313989 -> 0.485021 -0.171013 0.314008(R,m,v=1,0.863636,0.118538)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514806 0.170967 0.685773 -> 0.514825 0.170972 0.685796(R,m,v=1,1,0)
=>WM: (14230: S1 ^operator O2029)

  1015:    O: O2029 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1015 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1014 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14231: I3 ^predict-yes N1015)
<=WM: (14218: N1014 ^status complete)
<=WM: (14217: I3 ^predict-no N1014)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14235: I2 ^dir R)
=>WM: (14234: I2 ^reward 1)
=>WM: (14233: I2 ^see 1)
=>WM: (14232: N1015 ^status complete)
<=WM: (14221: I2 ^dir R)
<=WM: (14220: I2 ^reward 1)
<=WM: (14219: I2 ^see 0)
=>WM: (14236: I2 ^level-1 R1-root)
<=WM: (14222: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2029 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2030 = 0.7701730258510331)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1019 ^value 1 +)
 (R1 ^reward R1019 +)
Firing propose*predict-yes
 -->
 (O2031 ^name predict-yes +)
 (S1 ^operator O2031 +)
Firing propose*predict-no
 -->
 (O2032 ^name predict-no +)
 (S1 ^operator O2032 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2030 = 0.2298608025432123)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2029 = 0.2939430423129205)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2030 ^name predict-no +)
 (S1 ^operator O2030 +)
Retracting propose*predict-yes
 -->
 (O2029 ^name predict-yes +)
 (S1 ^operator O2029 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1018 ^value 1 +)
 (R1 ^reward R1018 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2030 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2030 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2029 = 0.2939430423129205)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2029 = 0.7056154385005245)
=>WM: (14243: S1 ^operator O2032 +)
=>WM: (14242: S1 ^operator O2031 +)
=>WM: (14241: O2032 ^name predict-no)
=>WM: (14240: O2031 ^name predict-yes)
=>WM: (14239: R1019 ^value 1)
=>WM: (14238: R1 ^reward R1019)
=>WM: (14237: I3 ^see 1)
<=WM: (14228: S1 ^operator O2029 +)
<=WM: (14230: S1 ^operator O2029)
<=WM: (14229: S1 ^operator O2030 +)
<=WM: (14223: R1 ^reward R1018)
<=WM: (14195: I3 ^see 0)
<=WM: (14226: O2030 ^name predict-no)
<=WM: (14225: O2029 ^name predict-yes)
<=WM: (14224: R1018 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2031 = 0.2939430423129205)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2031 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2032 = 0.2298608025432123)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2032 = 0.7701730258510331)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2030 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2030 = 0.7701730258510331)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2029 = 0.2939430423129205)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2029 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501022 -0.207079 0.293943 -> 0.501055 -0.207076 0.293979(R,m,v=1,0.849057,0.128971)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498578 0.207037 0.705615 -> 0.498617 0.207041 0.705659(R,m,v=1,1,0)
=>WM: (14244: S1 ^operator O2032)

  1016:    O: O2032 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1016 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1015 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14245: I3 ^predict-no N1016)
<=WM: (14232: N1015 ^status complete)
<=WM: (14231: I3 ^predict-yes N1015)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14249: I2 ^dir R)
=>WM: (14248: I2 ^reward 1)
=>WM: (14247: I2 ^see 0)
=>WM: (14246: N1016 ^status complete)
<=WM: (14235: I2 ^dir R)
<=WM: (14234: I2 ^reward 1)
<=WM: (14233: I2 ^see 1)
=>WM: (14250: I2 ^level-1 R0-root)
<=WM: (14236: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2031 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2032 = 0.7701003386536001)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1020 ^value 1 +)
 (R1 ^reward R1020 +)
Firing propose*predict-yes
 -->
 (O2033 ^name predict-yes +)
 (S1 ^operator O2033 +)
Firing propose*predict-no
 -->
 (O2034 ^name predict-no +)
 (S1 ^operator O2034 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2032 = 0.2298608025432123)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2031 = 0.2939794178406799)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2032 ^name predict-no +)
 (S1 ^operator O2032 +)
Retracting propose*predict-yes
 -->
 (O2031 ^name predict-yes +)
 (S1 ^operator O2031 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1019 ^value 1 +)
 (R1 ^reward R1019 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2032 = 0.7701730258510331)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2032 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2031 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2031 = 0.2939794178406799)
=>WM: (14257: S1 ^operator O2034 +)
=>WM: (14256: S1 ^operator O2033 +)
=>WM: (14255: O2034 ^name predict-no)
=>WM: (14254: O2033 ^name predict-yes)
=>WM: (14253: R1020 ^value 1)
=>WM: (14252: R1 ^reward R1020)
=>WM: (14251: I3 ^see 0)
<=WM: (14242: S1 ^operator O2031 +)
<=WM: (14243: S1 ^operator O2032 +)
<=WM: (14244: S1 ^operator O2032)
<=WM: (14238: R1 ^reward R1019)
<=WM: (14237: I3 ^see 1)
<=WM: (14241: O2032 ^name predict-no)
<=WM: (14240: O2031 ^name predict-yes)
<=WM: (14239: R1019 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2033 = 0.2939794178406799)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2033 = -0.1254042659579056)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2034 = 0.2298608025432123)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2034 = 0.7701003386536001)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2032 = 0.2298608025432123)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2032 = 0.7701003386536001)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2031 = 0.2939794178406799)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2031 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229861 -> 0.611911 -0.382052 0.229858(R,m,v=1,0.849162,0.128805)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388115 0.382058 0.770173 -> 0.388112 0.382058 0.77017(R,m,v=1,1,0)
=>WM: (14258: S1 ^operator O2034)

  1017:    O: O2034 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1017 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1016 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14259: I3 ^predict-no N1017)
<=WM: (14246: N1016 ^status complete)
<=WM: (14245: I3 ^predict-no N1016)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14263: I2 ^dir R)
=>WM: (14262: I2 ^reward 1)
=>WM: (14261: I2 ^see 0)
=>WM: (14260: N1017 ^status complete)
<=WM: (14249: I2 ^dir R)
<=WM: (14248: I2 ^reward 1)
<=WM: (14247: I2 ^see 0)
=>WM: (14264: I2 ^level-1 R0-root)
<=WM: (14250: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2033 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2034 = 0.7701003386536001)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1021 ^value 1 +)
 (R1 ^reward R1021 +)
Firing propose*predict-yes
 -->
 (O2035 ^name predict-yes +)
 (S1 ^operator O2035 +)
Firing propose*predict-no
 -->
 (O2036 ^name predict-no +)
 (S1 ^operator O2036 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2034 = 0.2298580688851452)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2033 = 0.2939794178406799)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2034 ^name predict-no +)
 (S1 ^operator O2034 +)
Retracting propose*predict-yes
 -->
 (O2033 ^name predict-yes +)
 (S1 ^operator O2033 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1020 ^value 1 +)
 (R1 ^reward R1020 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2034 = 0.7701003386536001)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2034 = 0.2298580688851452)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2033 = -0.1254042659579056)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2033 = 0.2939794178406799)
=>WM: (14270: S1 ^operator O2036 +)
=>WM: (14269: S1 ^operator O2035 +)
=>WM: (14268: O2036 ^name predict-no)
=>WM: (14267: O2035 ^name predict-yes)
=>WM: (14266: R1021 ^value 1)
=>WM: (14265: R1 ^reward R1021)
<=WM: (14256: S1 ^operator O2033 +)
<=WM: (14257: S1 ^operator O2034 +)
<=WM: (14258: S1 ^operator O2034)
<=WM: (14252: R1 ^reward R1020)
<=WM: (14255: O2034 ^name predict-no)
<=WM: (14254: O2033 ^name predict-yes)
<=WM: (14253: R1020 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2035 = 0.2939794178406799)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2035 = -0.1254042659579056)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2036 = 0.2298580688851452)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2036 = 0.7701003386536001)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2034 = 0.2298580688851452)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2034 = 0.7701003386536001)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2033 = 0.2939794178406799)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2033 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611911 -0.382052 0.229858 -> 0.611913 -0.382052 0.229861(R,m,v=1,0.85,0.128212)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388056 0.382045 0.7701 -> 0.388059 0.382045 0.770104(R,m,v=1,1,0)
=>WM: (14271: S1 ^operator O2036)

  1018:    O: O2036 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1018 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1017 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14272: I3 ^predict-no N1018)
<=WM: (14260: N1017 ^status complete)
<=WM: (14259: I3 ^predict-no N1017)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (14276: I2 ^dir L)
=>WM: (14275: I2 ^reward 1)
=>WM: (14274: I2 ^see 0)
=>WM: (14273: N1018 ^status complete)
<=WM: (14263: I2 ^dir R)
<=WM: (14262: I2 ^reward 1)
<=WM: (14261: I2 ^see 0)
=>WM: (14277: I2 ^level-1 R0-root)
<=WM: (14264: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2035 = 0.6195702912967189)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2036 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1022 ^value 1 +)
 (R1 ^reward R1022 +)
Firing propose*predict-yes
 -->
 (O2037 ^name predict-yes +)
 (S1 ^operator O2037 +)
Firing propose*predict-no
 -->
 (O2038 ^name predict-no +)
 (S1 ^operator O2038 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2036 = 0.3140082846697959)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2035 = 0.3804138577541756)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2036 ^name predict-no +)
 (S1 ^operator O2036 +)
Retracting propose*predict-yes
 -->
 (O2035 ^name predict-yes +)
 (S1 ^operator O2035 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1021 ^value 1 +)
 (R1 ^reward R1021 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2036 = 0.7701041764174563)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2036 = 0.2298614269306036)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2035 = -0.1254042659579056)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2035 = 0.2939794178406799)
=>WM: (14284: S1 ^operator O2038 +)
=>WM: (14283: S1 ^operator O2037 +)
=>WM: (14282: I3 ^dir L)
=>WM: (14281: O2038 ^name predict-no)
=>WM: (14280: O2037 ^name predict-yes)
=>WM: (14279: R1022 ^value 1)
=>WM: (14278: R1 ^reward R1022)
<=WM: (14269: S1 ^operator O2035 +)
<=WM: (14270: S1 ^operator O2036 +)
<=WM: (14271: S1 ^operator O2036)
<=WM: (14227: I3 ^dir R)
<=WM: (14265: R1 ^reward R1021)
<=WM: (14268: O2036 ^name predict-no)
<=WM: (14267: O2035 ^name predict-yes)
<=WM: (14266: R1021 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2037 = 0.6195702912967189)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2037 = 0.3804138577541756)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2038 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2038 = 0.3140082846697959)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2036 = 0.3140082846697959)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2036 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2035 = 0.3804138577541756)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2035 = 0.6195702912967189)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229861 -> 0.611915 -0.382051 0.229864(R,m,v=1,0.850829,0.127624)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388059 0.382045 0.770104 -> 0.388061 0.382046 0.770107(R,m,v=1,1,0)
=>WM: (14285: S1 ^operator O2037)

  1019:    O: O2037 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1019 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1018 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14286: I3 ^predict-yes N1019)
<=WM: (14273: N1018 ^status complete)
<=WM: (14272: I3 ^predict-no N1018)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14290: I2 ^dir U)
=>WM: (14289: I2 ^reward 1)
=>WM: (14288: I2 ^see 1)
=>WM: (14287: N1019 ^status complete)
<=WM: (14276: I2 ^dir L)
<=WM: (14275: I2 ^reward 1)
<=WM: (14274: I2 ^see 0)
=>WM: (14291: I2 ^level-1 L1-root)
<=WM: (14277: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1023 ^value 1 +)
 (R1 ^reward R1023 +)
Firing propose*predict-yes
 -->
 (O2039 ^name predict-yes +)
 (S1 ^operator O2039 +)
Firing propose*predict-no
 -->
 (O2040 ^name predict-no +)
 (S1 ^operator O2040 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2038 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2037 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2038 ^name predict-no +)
 (S1 ^operator O2038 +)
Retracting propose*predict-yes
 -->
 (O2037 ^name predict-yes +)
 (S1 ^operator O2037 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1022 ^value 1 +)
 (R1 ^reward R1022 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2038 = 0.3140082846697959)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2038 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2037 = 0.3804138577541756)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2037 = 0.6195702912967189)
=>WM: (14299: S1 ^operator O2040 +)
=>WM: (14298: S1 ^operator O2039 +)
=>WM: (14297: I3 ^dir U)
=>WM: (14296: O2040 ^name predict-no)
=>WM: (14295: O2039 ^name predict-yes)
=>WM: (14294: R1023 ^value 1)
=>WM: (14293: R1 ^reward R1023)
=>WM: (14292: I3 ^see 1)
<=WM: (14283: S1 ^operator O2037 +)
<=WM: (14285: S1 ^operator O2037)
<=WM: (14284: S1 ^operator O2038 +)
<=WM: (14282: I3 ^dir L)
<=WM: (14278: R1 ^reward R1022)
<=WM: (14251: I3 ^see 0)
<=WM: (14281: O2038 ^name predict-no)
<=WM: (14280: O2037 ^name predict-yes)
<=WM: (14279: R1022 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2038 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2037 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521345 -0.14093 0.380415(R,m,v=1,0.83432,0.139053)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478639 0.140931 0.61957 -> 0.478641 0.140931 0.619572(R,m,v=1,1,0)
=>WM: (14300: S1 ^operator O2040)

  1020:    O: O2040 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1020 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1019 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14301: I3 ^predict-no N1020)
<=WM: (14287: N1019 ^status complete)
<=WM: (14286: I3 ^predict-yes N1019)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14305: I2 ^dir U)
=>WM: (14304: I2 ^reward 1)
=>WM: (14303: I2 ^see 0)
=>WM: (14302: N1020 ^status complete)
<=WM: (14290: I2 ^dir U)
<=WM: (14289: I2 ^reward 1)
<=WM: (14288: I2 ^see 1)
=>WM: (14306: I2 ^level-1 L1-root)
<=WM: (14291: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1024 ^value 1 +)
 (R1 ^reward R1024 +)
Firing propose*predict-yes
 -->
 (O2041 ^name predict-yes +)
 (S1 ^operator O2041 +)
Firing propose*predict-no
 -->
 (O2042 ^name predict-no +)
 (S1 ^operator O2042 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2040 ^name predict-no +)
 (S1 ^operator O2040 +)
Retracting propose*predict-yes
 -->
 (O2039 ^name predict-yes +)
 (S1 ^operator O2039 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1023 ^value 1 +)
 (R1 ^reward R1023 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.)
=>WM: (14313: S1 ^operator O2042 +)
=>WM: (14312: S1 ^operator O2041 +)
=>WM: (14311: O2042 ^name predict-no)
=>WM: (14310: O2041 ^name predict-yes)
=>WM: (14309: R1024 ^value 1)
=>WM: (14308: R1 ^reward R1024)
=>WM: (14307: I3 ^see 0)
<=WM: (14298: S1 ^operator O2039 +)
<=WM: (14299: S1 ^operator O2040 +)
<=WM: (14300: S1 ^operator O2040)
<=WM: (14293: R1 ^reward R1023)
<=WM: (14292: I3 ^see 1)
<=WM: (14296: O2040 ^name predict-no)
<=WM: (14295: O2039 ^name predict-yes)
<=WM: (14294: R1023 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2041 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2042 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14314: S1 ^operator O2042)

  1021:    O: O2042 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1021 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1020 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14315: I3 ^predict-no N1021)
<=WM: (14302: N1020 ^status complete)
<=WM: (14301: I3 ^predict-no N1020)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (14319: I2 ^dir R)
=>WM: (14318: I2 ^reward 1)
=>WM: (14317: I2 ^see 0)
=>WM: (14316: N1021 ^status complete)
<=WM: (14305: I2 ^dir U)
<=WM: (14304: I2 ^reward 1)
<=WM: (14303: I2 ^see 0)
=>WM: (14320: I2 ^level-1 L1-root)
<=WM: (14306: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2041 = 0.7062713203494733)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2042 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1025 ^value 1 +)
 (R1 ^reward R1025 +)
Firing propose*predict-yes
 -->
 (O2043 ^name predict-yes +)
 (S1 ^operator O2043 +)
Firing propose*predict-no
 -->
 (O2044 ^name predict-no +)
 (S1 ^operator O2044 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2042 = 0.229864201526749)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2041 = 0.2939794178406799)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2042 ^name predict-no +)
 (S1 ^operator O2042 +)
Retracting propose*predict-yes
 -->
 (O2041 ^name predict-yes +)
 (S1 ^operator O2041 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1024 ^value 1 +)
 (R1 ^reward R1024 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2042 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2041 = 0.)
=>WM: (14327: S1 ^operator O2044 +)
=>WM: (14326: S1 ^operator O2043 +)
=>WM: (14325: I3 ^dir R)
=>WM: (14324: O2044 ^name predict-no)
=>WM: (14323: O2043 ^name predict-yes)
=>WM: (14322: R1025 ^value 1)
=>WM: (14321: R1 ^reward R1025)
<=WM: (14312: S1 ^operator O2041 +)
<=WM: (14313: S1 ^operator O2042 +)
<=WM: (14314: S1 ^operator O2042)
<=WM: (14297: I3 ^dir U)
<=WM: (14308: R1 ^reward R1024)
<=WM: (14311: O2042 ^name predict-no)
<=WM: (14310: O2041 ^name predict-yes)
<=WM: (14309: R1024 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2043 = 0.7062713203494733)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2043 = 0.2939794178406799)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2044 = -0.1937987592593187)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2044 = 0.229864201526749)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2042 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2042 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2041 = 0.2939794178406799)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2041 = 0.7062713203494733)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14328: S1 ^operator O2043)

  1022:    O: O2043 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1022 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1021 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14329: I3 ^predict-yes N1022)
<=WM: (14316: N1021 ^status complete)
<=WM: (14315: I3 ^predict-no N1021)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14333: I2 ^dir L)
=>WM: (14332: I2 ^reward 1)
=>WM: (14331: I2 ^see 1)
=>WM: (14330: N1022 ^status complete)
<=WM: (14319: I2 ^dir R)
<=WM: (14318: I2 ^reward 1)
<=WM: (14317: I2 ^see 0)
=>WM: (14334: I2 ^level-1 R1-root)
<=WM: (14320: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2043 = 0.6196052772291735)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2044 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1026 ^value 1 +)
 (R1 ^reward R1026 +)
Firing propose*predict-yes
 -->
 (O2045 ^name predict-yes +)
 (S1 ^operator O2045 +)
Firing propose*predict-no
 -->
 (O2046 ^name predict-no +)
 (S1 ^operator O2046 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2044 = 0.3140082846697959)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2043 = 0.3804151506751392)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2044 ^name predict-no +)
 (S1 ^operator O2044 +)
Retracting propose*predict-yes
 -->
 (O2043 ^name predict-yes +)
 (S1 ^operator O2043 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1025 ^value 1 +)
 (R1 ^reward R1025 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2044 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2044 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2043 = 0.2939794178406799)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2043 = 0.7062713203494733)
=>WM: (14342: S1 ^operator O2046 +)
=>WM: (14341: S1 ^operator O2045 +)
=>WM: (14340: I3 ^dir L)
=>WM: (14339: O2046 ^name predict-no)
=>WM: (14338: O2045 ^name predict-yes)
=>WM: (14337: R1026 ^value 1)
=>WM: (14336: R1 ^reward R1026)
=>WM: (14335: I3 ^see 1)
<=WM: (14326: S1 ^operator O2043 +)
<=WM: (14328: S1 ^operator O2043)
<=WM: (14327: S1 ^operator O2044 +)
<=WM: (14325: I3 ^dir R)
<=WM: (14321: R1 ^reward R1025)
<=WM: (14307: I3 ^see 0)
<=WM: (14324: O2044 ^name predict-no)
<=WM: (14323: O2043 ^name predict-yes)
<=WM: (14322: R1025 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2045 = 0.3804151506751392)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2045 = 0.6196052772291735)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2046 = 0.3140082846697959)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2046 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2044 = 0.3140082846697959)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2044 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2043 = 0.3804151506751392)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2043 = 0.6196052772291735)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501055 -0.207076 0.293979 -> 0.501037 -0.207078 0.293959(R,m,v=1,0.85,0.128302)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499171 0.2071 0.706271 -> 0.499149 0.207098 0.706247(R,m,v=1,1,0)
=>WM: (14343: S1 ^operator O2045)

  1023:    O: O2045 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1023 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1022 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14344: I3 ^predict-yes N1023)
<=WM: (14330: N1022 ^status complete)
<=WM: (14329: I3 ^predict-yes N1022)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14348: I2 ^dir L)
=>WM: (14347: I2 ^reward 1)
=>WM: (14346: I2 ^see 1)
=>WM: (14345: N1023 ^status complete)
<=WM: (14333: I2 ^dir L)
<=WM: (14332: I2 ^reward 1)
<=WM: (14331: I2 ^see 1)
=>WM: (14349: I2 ^level-1 L1-root)
<=WM: (14334: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2045 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2046 = 0.6861042492871868)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1027 ^value 1 +)
 (R1 ^reward R1027 +)
Firing propose*predict-yes
 -->
 (O2047 ^name predict-yes +)
 (S1 ^operator O2047 +)
Firing propose*predict-no
 -->
 (O2048 ^name predict-no +)
 (S1 ^operator O2048 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2046 = 0.3140082846697959)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2045 = 0.3804151506751392)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2046 ^name predict-no +)
 (S1 ^operator O2046 +)
Retracting propose*predict-yes
 -->
 (O2045 ^name predict-yes +)
 (S1 ^operator O2045 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1026 ^value 1 +)
 (R1 ^reward R1026 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2046 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2046 = 0.3140082846697959)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2045 = 0.6196052772291735)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2045 = 0.3804151506751392)
=>WM: (14355: S1 ^operator O2048 +)
=>WM: (14354: S1 ^operator O2047 +)
=>WM: (14353: O2048 ^name predict-no)
=>WM: (14352: O2047 ^name predict-yes)
=>WM: (14351: R1027 ^value 1)
=>WM: (14350: R1 ^reward R1027)
<=WM: (14341: S1 ^operator O2045 +)
<=WM: (14343: S1 ^operator O2045)
<=WM: (14342: S1 ^operator O2046 +)
<=WM: (14336: R1 ^reward R1026)
<=WM: (14339: O2046 ^name predict-no)
<=WM: (14338: O2045 ^name predict-yes)
<=WM: (14337: R1026 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2047 = 0.3804151506751392)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2047 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2048 = 0.3140082846697959)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2048 = 0.6861042492871868)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2046 = 0.3140082846697959)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2046 = 0.6861042492871868)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2045 = 0.3804151506751392)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2045 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521345 -0.14093 0.380415 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.835294,0.138392)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478677 0.140928 0.619605 -> 0.478675 0.140928 0.619603(R,m,v=1,1,0)
=>WM: (14356: S1 ^operator O2048)

  1024:    O: O2048 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1024 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1023 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14357: I3 ^predict-no N1024)
<=WM: (14345: N1023 ^status complete)
<=WM: (14344: I3 ^predict-yes N1023)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14361: I2 ^dir U)
=>WM: (14360: I2 ^reward 1)
=>WM: (14359: I2 ^see 0)
=>WM: (14358: N1024 ^status complete)
<=WM: (14348: I2 ^dir L)
<=WM: (14347: I2 ^reward 1)
<=WM: (14346: I2 ^see 1)
=>WM: (14362: I2 ^level-1 L0-root)
<=WM: (14349: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1028 ^value 1 +)
 (R1 ^reward R1028 +)
Firing propose*predict-yes
 -->
 (O2049 ^name predict-yes +)
 (S1 ^operator O2049 +)
Firing propose*predict-no
 -->
 (O2050 ^name predict-no +)
 (S1 ^operator O2050 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2048 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2047 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2048 ^name predict-no +)
 (S1 ^operator O2048 +)
Retracting propose*predict-yes
 -->
 (O2047 ^name predict-yes +)
 (S1 ^operator O2047 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1027 ^value 1 +)
 (R1 ^reward R1027 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2048 = 0.6861042492871868)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2048 = 0.3140082846697959)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2047 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2047 = 0.3804134860259072)
=>WM: (14370: S1 ^operator O2050 +)
=>WM: (14369: S1 ^operator O2049 +)
=>WM: (14368: I3 ^dir U)
=>WM: (14367: O2050 ^name predict-no)
=>WM: (14366: O2049 ^name predict-yes)
=>WM: (14365: R1028 ^value 1)
=>WM: (14364: R1 ^reward R1028)
=>WM: (14363: I3 ^see 0)
<=WM: (14354: S1 ^operator O2047 +)
<=WM: (14355: S1 ^operator O2048 +)
<=WM: (14356: S1 ^operator O2048)
<=WM: (14340: I3 ^dir L)
<=WM: (14350: R1 ^reward R1027)
<=WM: (14335: I3 ^see 1)
<=WM: (14353: O2048 ^name predict-no)
<=WM: (14352: O2047 ^name predict-yes)
<=WM: (14351: R1027 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2049 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2050 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2048 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2047 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485021 -0.171013 0.314008 -> 0.485014 -0.171015 0.313999(R,m,v=1,0.864516,0.117889)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515068 0.171036 0.686104 -> 0.515059 0.171034 0.686093(R,m,v=1,1,0)
=>WM: (14371: S1 ^operator O2050)

  1025:    O: O2050 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1025 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1024 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14372: I3 ^predict-no N1025)
<=WM: (14358: N1024 ^status complete)
<=WM: (14357: I3 ^predict-no N1024)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14376: I2 ^dir R)
=>WM: (14375: I2 ^reward 1)
=>WM: (14374: I2 ^see 0)
=>WM: (14373: N1025 ^status complete)
<=WM: (14361: I2 ^dir U)
<=WM: (14360: I2 ^reward 1)
<=WM: (14359: I2 ^see 0)
=>WM: (14377: I2 ^level-1 L0-root)
<=WM: (14362: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2049 = 0.70565863259984)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2050 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1029 ^value 1 +)
 (R1 ^reward R1029 +)
Firing propose*predict-yes
 -->
 (O2051 ^name predict-yes +)
 (S1 ^operator O2051 +)
Firing propose*predict-no
 -->
 (O2052 ^name predict-no +)
 (S1 ^operator O2052 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2050 = 0.229864201526749)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2049 = 0.2939587815430382)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2050 ^name predict-no +)
 (S1 ^operator O2050 +)
Retracting propose*predict-yes
 -->
 (O2049 ^name predict-yes +)
 (S1 ^operator O2049 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1028 ^value 1 +)
 (R1 ^reward R1028 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2050 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2049 = 0.)
=>WM: (14384: S1 ^operator O2052 +)
=>WM: (14383: S1 ^operator O2051 +)
=>WM: (14382: I3 ^dir R)
=>WM: (14381: O2052 ^name predict-no)
=>WM: (14380: O2051 ^name predict-yes)
=>WM: (14379: R1029 ^value 1)
=>WM: (14378: R1 ^reward R1029)
<=WM: (14369: S1 ^operator O2049 +)
<=WM: (14370: S1 ^operator O2050 +)
<=WM: (14371: S1 ^operator O2050)
<=WM: (14368: I3 ^dir U)
<=WM: (14364: R1 ^reward R1028)
<=WM: (14367: O2050 ^name predict-no)
<=WM: (14366: O2049 ^name predict-yes)
<=WM: (14365: R1028 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2051 = 0.70565863259984)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2051 = 0.2939587815430382)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2052 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2052 = 0.229864201526749)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2050 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2050 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2049 = 0.2939587815430382)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2049 = 0.70565863259984)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14385: S1 ^operator O2051)

  1026:    O: O2051 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1026 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1025 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14386: I3 ^predict-yes N1026)
<=WM: (14373: N1025 ^status complete)
<=WM: (14372: I3 ^predict-no N1025)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14390: I2 ^dir U)
=>WM: (14389: I2 ^reward 1)
=>WM: (14388: I2 ^see 1)
=>WM: (14387: N1026 ^status complete)
<=WM: (14376: I2 ^dir R)
<=WM: (14375: I2 ^reward 1)
<=WM: (14374: I2 ^see 0)
=>WM: (14391: I2 ^level-1 R1-root)
<=WM: (14377: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1030 ^value 1 +)
 (R1 ^reward R1030 +)
Firing propose*predict-yes
 -->
 (O2053 ^name predict-yes +)
 (S1 ^operator O2053 +)
Firing propose*predict-no
 -->
 (O2054 ^name predict-no +)
 (S1 ^operator O2054 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2052 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2051 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2052 ^name predict-no +)
 (S1 ^operator O2052 +)
Retracting propose*predict-yes
 -->
 (O2051 ^name predict-yes +)
 (S1 ^operator O2051 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1029 ^value 1 +)
 (R1 ^reward R1029 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2052 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2052 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2051 = 0.2939587815430382)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2051 = 0.70565863259984)
=>WM: (14399: S1 ^operator O2054 +)
=>WM: (14398: S1 ^operator O2053 +)
=>WM: (14397: I3 ^dir U)
=>WM: (14396: O2054 ^name predict-no)
=>WM: (14395: O2053 ^name predict-yes)
=>WM: (14394: R1030 ^value 1)
=>WM: (14393: R1 ^reward R1030)
=>WM: (14392: I3 ^see 1)
<=WM: (14383: S1 ^operator O2051 +)
<=WM: (14385: S1 ^operator O2051)
<=WM: (14384: S1 ^operator O2052 +)
<=WM: (14382: I3 ^dir R)
<=WM: (14378: R1 ^reward R1029)
<=WM: (14363: I3 ^see 0)
<=WM: (14381: O2052 ^name predict-no)
<=WM: (14380: O2051 ^name predict-yes)
<=WM: (14379: R1029 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2053 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2054 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2052 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2051 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501037 -0.207078 0.293959 -> 0.501065 -0.207075 0.29399(R,m,v=1,0.850932,0.12764)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498617 0.207041 0.705659 -> 0.498651 0.207045 0.705696(R,m,v=1,1,0)
=>WM: (14400: S1 ^operator O2054)

  1027:    O: O2054 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1027 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1026 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14401: I3 ^predict-no N1027)
<=WM: (14387: N1026 ^status complete)
<=WM: (14386: I3 ^predict-yes N1026)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14405: I2 ^dir U)
=>WM: (14404: I2 ^reward 1)
=>WM: (14403: I2 ^see 0)
=>WM: (14402: N1027 ^status complete)
<=WM: (14390: I2 ^dir U)
<=WM: (14389: I2 ^reward 1)
<=WM: (14388: I2 ^see 1)
=>WM: (14406: I2 ^level-1 R1-root)
<=WM: (14391: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1031 ^value 1 +)
 (R1 ^reward R1031 +)
Firing propose*predict-yes
 -->
 (O2055 ^name predict-yes +)
 (S1 ^operator O2055 +)
Firing propose*predict-no
 -->
 (O2056 ^name predict-no +)
 (S1 ^operator O2056 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2054 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2053 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2054 ^name predict-no +)
 (S1 ^operator O2054 +)
Retracting propose*predict-yes
 -->
 (O2053 ^name predict-yes +)
 (S1 ^operator O2053 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1030 ^value 1 +)
 (R1 ^reward R1030 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2054 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2053 = 0.)
=>WM: (14413: S1 ^operator O2056 +)
=>WM: (14412: S1 ^operator O2055 +)
=>WM: (14411: O2056 ^name predict-no)
=>WM: (14410: O2055 ^name predict-yes)
=>WM: (14409: R1031 ^value 1)
=>WM: (14408: R1 ^reward R1031)
=>WM: (14407: I3 ^see 0)
<=WM: (14398: S1 ^operator O2053 +)
<=WM: (14399: S1 ^operator O2054 +)
<=WM: (14400: S1 ^operator O2054)
<=WM: (14393: R1 ^reward R1030)
<=WM: (14392: I3 ^see 1)
<=WM: (14396: O2054 ^name predict-no)
<=WM: (14395: O2053 ^name predict-yes)
<=WM: (14394: R1030 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2055 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2056 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2054 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2053 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14414: S1 ^operator O2056)

  1028:    O: O2056 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1028 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1027 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14415: I3 ^predict-no N1028)
<=WM: (14402: N1027 ^status complete)
<=WM: (14401: I3 ^predict-no N1027)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14419: I2 ^dir L)
=>WM: (14418: I2 ^reward 1)
=>WM: (14417: I2 ^see 0)
=>WM: (14416: N1028 ^status complete)
<=WM: (14405: I2 ^dir U)
<=WM: (14404: I2 ^reward 1)
<=WM: (14403: I2 ^see 0)
=>WM: (14420: I2 ^level-1 R1-root)
<=WM: (14406: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2055 = 0.6196033311566926)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2056 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1032 ^value 1 +)
 (R1 ^reward R1032 +)
Firing propose*predict-yes
 -->
 (O2057 ^name predict-yes +)
 (S1 ^operator O2057 +)
Firing propose*predict-no
 -->
 (O2058 ^name predict-no +)
 (S1 ^operator O2058 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2056 = 0.313998974224576)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2055 = 0.3804134860259072)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2056 ^name predict-no +)
 (S1 ^operator O2056 +)
Retracting propose*predict-yes
 -->
 (O2055 ^name predict-yes +)
 (S1 ^operator O2055 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1031 ^value 1 +)
 (R1 ^reward R1031 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2056 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2055 = 0.)
=>WM: (14427: S1 ^operator O2058 +)
=>WM: (14426: S1 ^operator O2057 +)
=>WM: (14425: I3 ^dir L)
=>WM: (14424: O2058 ^name predict-no)
=>WM: (14423: O2057 ^name predict-yes)
=>WM: (14422: R1032 ^value 1)
=>WM: (14421: R1 ^reward R1032)
<=WM: (14412: S1 ^operator O2055 +)
<=WM: (14413: S1 ^operator O2056 +)
<=WM: (14414: S1 ^operator O2056)
<=WM: (14397: I3 ^dir U)
<=WM: (14408: R1 ^reward R1031)
<=WM: (14411: O2056 ^name predict-no)
<=WM: (14410: O2055 ^name predict-yes)
<=WM: (14409: R1031 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2057 = 0.6196033311566926)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2057 = 0.3804134860259072)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2058 = -0.1479504104026684)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2058 = 0.313998974224576)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2056 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2056 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2055 = 0.3804134860259072)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2055 = 0.6196033311566926)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14428: S1 ^operator O2057)

  1029:    O: O2057 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1029 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1028 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14429: I3 ^predict-yes N1029)
<=WM: (14416: N1028 ^status complete)
<=WM: (14415: I3 ^predict-no N1028)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14433: I2 ^dir R)
=>WM: (14432: I2 ^reward 1)
=>WM: (14431: I2 ^see 1)
=>WM: (14430: N1029 ^status complete)
<=WM: (14419: I2 ^dir L)
<=WM: (14418: I2 ^reward 1)
<=WM: (14417: I2 ^see 0)
=>WM: (14434: I2 ^level-1 L1-root)
<=WM: (14420: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2057 = 0.7062472326455022)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2058 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1033 ^value 1 +)
 (R1 ^reward R1033 +)
Firing propose*predict-yes
 -->
 (O2059 ^name predict-yes +)
 (S1 ^operator O2059 +)
Firing propose*predict-no
 -->
 (O2060 ^name predict-no +)
 (S1 ^operator O2060 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2058 = 0.229864201526749)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2057 = 0.2939902369301627)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2058 ^name predict-no +)
 (S1 ^operator O2058 +)
Retracting propose*predict-yes
 -->
 (O2057 ^name predict-yes +)
 (S1 ^operator O2057 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1032 ^value 1 +)
 (R1 ^reward R1032 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2058 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2058 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2057 = 0.3804134860259072)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2057 = 0.6196033311566926)
=>WM: (14442: S1 ^operator O2060 +)
=>WM: (14441: S1 ^operator O2059 +)
=>WM: (14440: I3 ^dir R)
=>WM: (14439: O2060 ^name predict-no)
=>WM: (14438: O2059 ^name predict-yes)
=>WM: (14437: R1033 ^value 1)
=>WM: (14436: R1 ^reward R1033)
=>WM: (14435: I3 ^see 1)
<=WM: (14426: S1 ^operator O2057 +)
<=WM: (14428: S1 ^operator O2057)
<=WM: (14427: S1 ^operator O2058 +)
<=WM: (14425: I3 ^dir L)
<=WM: (14421: R1 ^reward R1032)
<=WM: (14407: I3 ^see 0)
<=WM: (14424: O2058 ^name predict-no)
<=WM: (14423: O2057 ^name predict-yes)
<=WM: (14422: R1032 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2059 = 0.2939902369301627)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2059 = 0.7062472326455022)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2060 = 0.229864201526749)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2060 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2058 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2058 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2057 = 0.2939902369301627)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2057 = 0.7062472326455022)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.836257,0.137736)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478675 0.140928 0.619603 -> 0.478673 0.140928 0.619602(R,m,v=1,1,0)
=>WM: (14443: S1 ^operator O2059)

  1030:    O: O2059 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1030 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1029 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14444: I3 ^predict-yes N1030)
<=WM: (14430: N1029 ^status complete)
<=WM: (14429: I3 ^predict-yes N1029)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14448: I2 ^dir L)
=>WM: (14447: I2 ^reward 1)
=>WM: (14446: I2 ^see 1)
=>WM: (14445: N1030 ^status complete)
<=WM: (14433: I2 ^dir R)
<=WM: (14432: I2 ^reward 1)
<=WM: (14431: I2 ^see 1)
=>WM: (14449: I2 ^level-1 R1-root)
<=WM: (14434: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2059 = 0.6196017333792301)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2060 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1034 ^value 1 +)
 (R1 ^reward R1034 +)
Firing propose*predict-yes
 -->
 (O2061 ^name predict-yes +)
 (S1 ^operator O2061 +)
Firing propose*predict-no
 -->
 (O2062 ^name predict-no +)
 (S1 ^operator O2062 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2060 = 0.313998974224576)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2059 = 0.380412116919439)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2060 ^name predict-no +)
 (S1 ^operator O2060 +)
Retracting propose*predict-yes
 -->
 (O2059 ^name predict-yes +)
 (S1 ^operator O2059 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1033 ^value 1 +)
 (R1 ^reward R1033 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2060 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2060 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2059 = 0.7062472326455022)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2059 = 0.2939902369301627)
=>WM: (14456: S1 ^operator O2062 +)
=>WM: (14455: S1 ^operator O2061 +)
=>WM: (14454: I3 ^dir L)
=>WM: (14453: O2062 ^name predict-no)
=>WM: (14452: O2061 ^name predict-yes)
=>WM: (14451: R1034 ^value 1)
=>WM: (14450: R1 ^reward R1034)
<=WM: (14441: S1 ^operator O2059 +)
<=WM: (14443: S1 ^operator O2059)
<=WM: (14442: S1 ^operator O2060 +)
<=WM: (14440: I3 ^dir R)
<=WM: (14436: R1 ^reward R1033)
<=WM: (14439: O2060 ^name predict-no)
<=WM: (14438: O2059 ^name predict-yes)
<=WM: (14437: R1033 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2061 = 0.380412116919439)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2061 = 0.6196017333792301)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2062 = 0.313998974224576)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2062 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2060 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2060 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2059 = 0.380412116919439)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2059 = 0.6196017333792301)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501065 -0.207075 0.29399 -> 0.501047 -0.207077 0.293971(R,m,v=1,0.851852,0.126984)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499149 0.207098 0.706247 -> 0.499129 0.207096 0.706224(R,m,v=1,1,0)
=>WM: (14457: S1 ^operator O2061)

  1031:    O: O2061 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1031 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1030 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14458: I3 ^predict-yes N1031)
<=WM: (14445: N1030 ^status complete)
<=WM: (14444: I3 ^predict-yes N1030)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (14462: I2 ^dir L)
=>WM: (14461: I2 ^reward 1)
=>WM: (14460: I2 ^see 1)
=>WM: (14459: N1031 ^status complete)
<=WM: (14448: I2 ^dir L)
<=WM: (14447: I2 ^reward 1)
<=WM: (14446: I2 ^see 1)
=>WM: (14463: I2 ^level-1 L1-root)
<=WM: (14449: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2061 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2062 = 0.6860933424731377)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1035 ^value 1 +)
 (R1 ^reward R1035 +)
Firing propose*predict-yes
 -->
 (O2063 ^name predict-yes +)
 (S1 ^operator O2063 +)
Firing propose*predict-no
 -->
 (O2064 ^name predict-no +)
 (S1 ^operator O2064 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2062 = 0.313998974224576)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2061 = 0.380412116919439)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2062 ^name predict-no +)
 (S1 ^operator O2062 +)
Retracting propose*predict-yes
 -->
 (O2061 ^name predict-yes +)
 (S1 ^operator O2061 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1034 ^value 1 +)
 (R1 ^reward R1034 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2062 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2062 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2061 = 0.6196017333792301)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2061 = 0.380412116919439)
=>WM: (14469: S1 ^operator O2064 +)
=>WM: (14468: S1 ^operator O2063 +)
=>WM: (14467: O2064 ^name predict-no)
=>WM: (14466: O2063 ^name predict-yes)
=>WM: (14465: R1035 ^value 1)
=>WM: (14464: R1 ^reward R1035)
<=WM: (14455: S1 ^operator O2061 +)
<=WM: (14457: S1 ^operator O2061)
<=WM: (14456: S1 ^operator O2062 +)
<=WM: (14450: R1 ^reward R1034)
<=WM: (14453: O2062 ^name predict-no)
<=WM: (14452: O2061 ^name predict-yes)
<=WM: (14451: R1034 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2063 = 0.380412116919439)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2063 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2064 = 0.313998974224576)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2064 = 0.6860933424731377)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2062 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2062 = 0.6860933424731377)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2061 = 0.380412116919439)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2061 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521341 -0.14093 0.380411(R,m,v=1,0.837209,0.137087)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478673 0.140928 0.619602 -> 0.478672 0.140929 0.6196(R,m,v=1,1,0)
=>WM: (14470: S1 ^operator O2064)

  1032:    O: O2064 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1032 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1031 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14471: I3 ^predict-no N1032)
<=WM: (14459: N1031 ^status complete)
<=WM: (14458: I3 ^predict-yes N1031)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14475: I2 ^dir L)
=>WM: (14474: I2 ^reward 1)
=>WM: (14473: I2 ^see 0)
=>WM: (14472: N1032 ^status complete)
<=WM: (14462: I2 ^dir L)
<=WM: (14461: I2 ^reward 1)
<=WM: (14460: I2 ^see 1)
=>WM: (14476: I2 ^level-1 L0-root)
<=WM: (14463: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2063 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2064 = 0.6857963029033564)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1036 ^value 1 +)
 (R1 ^reward R1036 +)
Firing propose*predict-yes
 -->
 (O2065 ^name predict-yes +)
 (S1 ^operator O2065 +)
Firing propose*predict-no
 -->
 (O2066 ^name predict-no +)
 (S1 ^operator O2066 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2064 = 0.313998974224576)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2063 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2064 ^name predict-no +)
 (S1 ^operator O2064 +)
Retracting propose*predict-yes
 -->
 (O2063 ^name predict-yes +)
 (S1 ^operator O2063 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1035 ^value 1 +)
 (R1 ^reward R1035 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2064 = 0.6860933424731377)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2064 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2063 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2063 = 0.3804109904199586)
=>WM: (14483: S1 ^operator O2066 +)
=>WM: (14482: S1 ^operator O2065 +)
=>WM: (14481: O2066 ^name predict-no)
=>WM: (14480: O2065 ^name predict-yes)
=>WM: (14479: R1036 ^value 1)
=>WM: (14478: R1 ^reward R1036)
=>WM: (14477: I3 ^see 0)
<=WM: (14468: S1 ^operator O2063 +)
<=WM: (14469: S1 ^operator O2064 +)
<=WM: (14470: S1 ^operator O2064)
<=WM: (14464: R1 ^reward R1035)
<=WM: (14435: I3 ^see 1)
<=WM: (14467: O2064 ^name predict-no)
<=WM: (14466: O2063 ^name predict-yes)
<=WM: (14465: R1035 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2065 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2065 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2066 = 0.313998974224576)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2066 = 0.6857963029033564)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2064 = 0.313998974224576)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2064 = 0.6857963029033564)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2063 = 0.3804109904199586)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2063 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485014 -0.171015 0.313999 -> 0.485008 -0.171016 0.313991(R,m,v=1,0.865385,0.117246)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515059 0.171034 0.686093 -> 0.515052 0.171032 0.686084(R,m,v=1,1,0)
=>WM: (14484: S1 ^operator O2066)

  1033:    O: O2066 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1033 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1032 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14485: I3 ^predict-no N1033)
<=WM: (14472: N1032 ^status complete)
<=WM: (14471: I3 ^predict-no N1032)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14489: I2 ^dir L)
=>WM: (14488: I2 ^reward 1)
=>WM: (14487: I2 ^see 0)
=>WM: (14486: N1033 ^status complete)
<=WM: (14475: I2 ^dir L)
<=WM: (14474: I2 ^reward 1)
<=WM: (14473: I2 ^see 0)
=>WM: (14490: I2 ^level-1 L0-root)
<=WM: (14476: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2065 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2066 = 0.6857963029033564)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1037 ^value 1 +)
 (R1 ^reward R1037 +)
Firing propose*predict-yes
 -->
 (O2067 ^name predict-yes +)
 (S1 ^operator O2067 +)
Firing propose*predict-no
 -->
 (O2068 ^name predict-no +)
 (S1 ^operator O2068 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2066 = 0.3139913445638368)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2065 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2066 ^name predict-no +)
 (S1 ^operator O2066 +)
Retracting propose*predict-yes
 -->
 (O2065 ^name predict-yes +)
 (S1 ^operator O2065 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1036 ^value 1 +)
 (R1 ^reward R1036 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2066 = 0.6857963029033564)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2066 = 0.3139913445638368)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2065 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2065 = 0.3804109904199586)
=>WM: (14496: S1 ^operator O2068 +)
=>WM: (14495: S1 ^operator O2067 +)
=>WM: (14494: O2068 ^name predict-no)
=>WM: (14493: O2067 ^name predict-yes)
=>WM: (14492: R1037 ^value 1)
=>WM: (14491: R1 ^reward R1037)
<=WM: (14482: S1 ^operator O2065 +)
<=WM: (14483: S1 ^operator O2066 +)
<=WM: (14484: S1 ^operator O2066)
<=WM: (14478: R1 ^reward R1036)
<=WM: (14481: O2066 ^name predict-no)
<=WM: (14480: O2065 ^name predict-yes)
<=WM: (14479: R1036 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2067 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2067 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2068 = 0.3139913445638368)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2068 = 0.6857963029033564)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2066 = 0.3139913445638368)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2066 = 0.6857963029033564)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2065 = 0.3804109904199586)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2065 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485008 -0.171016 0.313991 -> 0.485021 -0.171012 0.314009(R,m,v=1,0.866242,0.11661)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514825 0.170972 0.685796 -> 0.514841 0.170976 0.685817(R,m,v=1,1,0)
=>WM: (14497: S1 ^operator O2068)

  1034:    O: O2068 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1034 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1033 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14498: I3 ^predict-no N1034)
<=WM: (14486: N1033 ^status complete)
<=WM: (14485: I3 ^predict-no N1033)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14502: I2 ^dir L)
=>WM: (14501: I2 ^reward 1)
=>WM: (14500: I2 ^see 0)
=>WM: (14499: N1034 ^status complete)
<=WM: (14489: I2 ^dir L)
<=WM: (14488: I2 ^reward 1)
<=WM: (14487: I2 ^see 0)
=>WM: (14503: I2 ^level-1 L0-root)
<=WM: (14490: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2067 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2068 = 0.6858169471742246)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1038 ^value 1 +)
 (R1 ^reward R1038 +)
Firing propose*predict-yes
 -->
 (O2069 ^name predict-yes +)
 (S1 ^operator O2069 +)
Firing propose*predict-no
 -->
 (O2070 ^name predict-no +)
 (S1 ^operator O2070 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2068 = 0.3140088762608346)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2067 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2068 ^name predict-no +)
 (S1 ^operator O2068 +)
Retracting propose*predict-yes
 -->
 (O2067 ^name predict-yes +)
 (S1 ^operator O2067 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1037 ^value 1 +)
 (R1 ^reward R1037 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2068 = 0.6858169471742246)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2068 = 0.3140088762608346)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2067 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2067 = 0.3804109904199586)
=>WM: (14509: S1 ^operator O2070 +)
=>WM: (14508: S1 ^operator O2069 +)
=>WM: (14507: O2070 ^name predict-no)
=>WM: (14506: O2069 ^name predict-yes)
=>WM: (14505: R1038 ^value 1)
=>WM: (14504: R1 ^reward R1038)
<=WM: (14495: S1 ^operator O2067 +)
<=WM: (14496: S1 ^operator O2068 +)
<=WM: (14497: S1 ^operator O2068)
<=WM: (14491: R1 ^reward R1037)
<=WM: (14494: O2068 ^name predict-no)
<=WM: (14493: O2067 ^name predict-yes)
<=WM: (14492: R1037 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2069 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2069 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2070 = 0.3140088762608346)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2070 = 0.6858169471742246)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2068 = 0.3140088762608346)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2068 = 0.6858169471742246)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2067 = 0.3804109904199586)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2067 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485021 -0.171012 0.314009 -> 0.485033 -0.171009 0.314023(R,m,v=1,0.867089,0.11598)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514841 0.170976 0.685817 -> 0.514854 0.170979 0.685834(R,m,v=1,1,0)
=>WM: (14510: S1 ^operator O2070)

  1035:    O: O2070 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1035 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1034 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14511: I3 ^predict-no N1035)
<=WM: (14499: N1034 ^status complete)
<=WM: (14498: I3 ^predict-no N1034)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14515: I2 ^dir R)
=>WM: (14514: I2 ^reward 1)
=>WM: (14513: I2 ^see 0)
=>WM: (14512: N1035 ^status complete)
<=WM: (14502: I2 ^dir L)
<=WM: (14501: I2 ^reward 1)
<=WM: (14500: I2 ^see 0)
=>WM: (14516: I2 ^level-1 L0-root)
<=WM: (14503: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2069 = 0.7056959425110291)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2070 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1039 ^value 1 +)
 (R1 ^reward R1039 +)
Firing propose*predict-yes
 -->
 (O2071 ^name predict-yes +)
 (S1 ^operator O2071 +)
Firing propose*predict-no
 -->
 (O2072 ^name predict-no +)
 (S1 ^operator O2072 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2070 = 0.229864201526749)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2069 = 0.2939707325508816)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2070 ^name predict-no +)
 (S1 ^operator O2070 +)
Retracting propose*predict-yes
 -->
 (O2069 ^name predict-yes +)
 (S1 ^operator O2069 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1038 ^value 1 +)
 (R1 ^reward R1038 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2070 = 0.6858338284024019)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2070 = 0.3140232411131785)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2069 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2069 = 0.3804109904199586)
=>WM: (14523: S1 ^operator O2072 +)
=>WM: (14522: S1 ^operator O2071 +)
=>WM: (14521: I3 ^dir R)
=>WM: (14520: O2072 ^name predict-no)
=>WM: (14519: O2071 ^name predict-yes)
=>WM: (14518: R1039 ^value 1)
=>WM: (14517: R1 ^reward R1039)
<=WM: (14508: S1 ^operator O2069 +)
<=WM: (14509: S1 ^operator O2070 +)
<=WM: (14510: S1 ^operator O2070)
<=WM: (14454: I3 ^dir L)
<=WM: (14504: R1 ^reward R1038)
<=WM: (14507: O2070 ^name predict-no)
<=WM: (14506: O2069 ^name predict-yes)
<=WM: (14505: R1038 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2071 = 0.7056959425110291)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2071 = 0.2939707325508816)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2072 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2072 = 0.229864201526749)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2070 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2070 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2069 = 0.2939707325508816)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2069 = 0.7056959425110291)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485033 -0.171009 0.314023 -> 0.485042 -0.171007 0.314035(R,m,v=1,0.867925,0.115357)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514854 0.170979 0.685834 -> 0.514865 0.170982 0.685848(R,m,v=1,1,0)
=>WM: (14524: S1 ^operator O2071)

  1036:    O: O2071 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1036 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1035 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14525: I3 ^predict-yes N1036)
<=WM: (14512: N1035 ^status complete)
<=WM: (14511: I3 ^predict-no N1035)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14529: I2 ^dir U)
=>WM: (14528: I2 ^reward 1)
=>WM: (14527: I2 ^see 1)
=>WM: (14526: N1036 ^status complete)
<=WM: (14515: I2 ^dir R)
<=WM: (14514: I2 ^reward 1)
<=WM: (14513: I2 ^see 0)
=>WM: (14530: I2 ^level-1 R1-root)
<=WM: (14516: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1040 ^value 1 +)
 (R1 ^reward R1040 +)
Firing propose*predict-yes
 -->
 (O2073 ^name predict-yes +)
 (S1 ^operator O2073 +)
Firing propose*predict-no
 -->
 (O2074 ^name predict-no +)
 (S1 ^operator O2074 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2072 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2071 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2072 ^name predict-no +)
 (S1 ^operator O2072 +)
Retracting propose*predict-yes
 -->
 (O2071 ^name predict-yes +)
 (S1 ^operator O2071 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1039 ^value 1 +)
 (R1 ^reward R1039 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2072 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2072 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2071 = 0.2939707325508816)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2071 = 0.7056959425110291)
=>WM: (14538: S1 ^operator O2074 +)
=>WM: (14537: S1 ^operator O2073 +)
=>WM: (14536: I3 ^dir U)
=>WM: (14535: O2074 ^name predict-no)
=>WM: (14534: O2073 ^name predict-yes)
=>WM: (14533: R1040 ^value 1)
=>WM: (14532: R1 ^reward R1040)
=>WM: (14531: I3 ^see 1)
<=WM: (14522: S1 ^operator O2071 +)
<=WM: (14524: S1 ^operator O2071)
<=WM: (14523: S1 ^operator O2072 +)
<=WM: (14521: I3 ^dir R)
<=WM: (14517: R1 ^reward R1039)
<=WM: (14477: I3 ^see 0)
<=WM: (14520: O2072 ^name predict-no)
<=WM: (14519: O2071 ^name predict-yes)
<=WM: (14518: R1039 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2073 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2074 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2072 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2071 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501047 -0.207077 0.293971 -> 0.501072 -0.207074 0.293998(R,m,v=1,0.852761,0.126335)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498651 0.207045 0.705696 -> 0.49868 0.207048 0.705728(R,m,v=1,1,0)
=>WM: (14539: S1 ^operator O2074)

  1037:    O: O2074 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1037 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1036 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14540: I3 ^predict-no N1037)
<=WM: (14526: N1036 ^status complete)
<=WM: (14525: I3 ^predict-yes N1036)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14544: I2 ^dir R)
=>WM: (14543: I2 ^reward 1)
=>WM: (14542: I2 ^see 0)
=>WM: (14541: N1037 ^status complete)
<=WM: (14529: I2 ^dir U)
<=WM: (14528: I2 ^reward 1)
<=WM: (14527: I2 ^see 1)
=>WM: (14545: I2 ^level-1 R1-root)
<=WM: (14530: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2073 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2074 = 0.7701697371568763)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1041 ^value 1 +)
 (R1 ^reward R1041 +)
Firing propose*predict-yes
 -->
 (O2075 ^name predict-yes +)
 (S1 ^operator O2075 +)
Firing propose*predict-no
 -->
 (O2076 ^name predict-no +)
 (S1 ^operator O2076 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2074 = 0.229864201526749)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2073 = 0.2939980822884902)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2074 ^name predict-no +)
 (S1 ^operator O2074 +)
Retracting propose*predict-yes
 -->
 (O2073 ^name predict-yes +)
 (S1 ^operator O2073 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1040 ^value 1 +)
 (R1 ^reward R1040 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2074 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2073 = 0.)
=>WM: (14553: S1 ^operator O2076 +)
=>WM: (14552: S1 ^operator O2075 +)
=>WM: (14551: I3 ^dir R)
=>WM: (14550: O2076 ^name predict-no)
=>WM: (14549: O2075 ^name predict-yes)
=>WM: (14548: R1041 ^value 1)
=>WM: (14547: R1 ^reward R1041)
=>WM: (14546: I3 ^see 0)
<=WM: (14537: S1 ^operator O2073 +)
<=WM: (14538: S1 ^operator O2074 +)
<=WM: (14539: S1 ^operator O2074)
<=WM: (14536: I3 ^dir U)
<=WM: (14532: R1 ^reward R1040)
<=WM: (14531: I3 ^see 1)
<=WM: (14535: O2074 ^name predict-no)
<=WM: (14534: O2073 ^name predict-yes)
<=WM: (14533: R1040 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2075 = -0.252585164213872)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2075 = 0.2939980822884902)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2076 = 0.7701697371568763)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2076 = 0.229864201526749)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2074 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2074 = 0.7701697371568763)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2073 = 0.2939980822884902)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2073 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14554: S1 ^operator O2076)

  1038:    O: O2076 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1038 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1037 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14555: I3 ^predict-no N1038)
<=WM: (14541: N1037 ^status complete)
<=WM: (14540: I3 ^predict-no N1037)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14559: I2 ^dir U)
=>WM: (14558: I2 ^reward 1)
=>WM: (14557: I2 ^see 0)
=>WM: (14556: N1038 ^status complete)
<=WM: (14544: I2 ^dir R)
<=WM: (14543: I2 ^reward 1)
<=WM: (14542: I2 ^see 0)
=>WM: (14560: I2 ^level-1 R0-root)
<=WM: (14545: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1042 ^value 1 +)
 (R1 ^reward R1042 +)
Firing propose*predict-yes
 -->
 (O2077 ^name predict-yes +)
 (S1 ^operator O2077 +)
Firing propose*predict-no
 -->
 (O2078 ^name predict-no +)
 (S1 ^operator O2078 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2076 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2075 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2076 ^name predict-no +)
 (S1 ^operator O2076 +)
Retracting propose*predict-yes
 -->
 (O2075 ^name predict-yes +)
 (S1 ^operator O2075 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1041 ^value 1 +)
 (R1 ^reward R1041 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2076 = 0.229864201526749)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2076 = 0.7701697371568763)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2075 = 0.2939980822884902)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2075 = -0.252585164213872)
=>WM: (14567: S1 ^operator O2078 +)
=>WM: (14566: S1 ^operator O2077 +)
=>WM: (14565: I3 ^dir U)
=>WM: (14564: O2078 ^name predict-no)
=>WM: (14563: O2077 ^name predict-yes)
=>WM: (14562: R1042 ^value 1)
=>WM: (14561: R1 ^reward R1042)
<=WM: (14552: S1 ^operator O2075 +)
<=WM: (14553: S1 ^operator O2076 +)
<=WM: (14554: S1 ^operator O2076)
<=WM: (14551: I3 ^dir R)
<=WM: (14547: R1 ^reward R1041)
<=WM: (14550: O2076 ^name predict-no)
<=WM: (14549: O2075 ^name predict-yes)
<=WM: (14548: R1041 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2077 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2078 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2076 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2075 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611915 -0.382051 0.229864 -> 0.611913 -0.382052 0.229861(R,m,v=1,0.851648,0.127041)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388112 0.382058 0.77017 -> 0.388109 0.382057 0.770166(R,m,v=1,1,0)
=>WM: (14568: S1 ^operator O2078)

  1039:    O: O2078 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1039 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1038 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14569: I3 ^predict-no N1039)
<=WM: (14556: N1038 ^status complete)
<=WM: (14555: I3 ^predict-no N1038)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14573: I2 ^dir L)
=>WM: (14572: I2 ^reward 1)
=>WM: (14571: I2 ^see 0)
=>WM: (14570: N1039 ^status complete)
<=WM: (14559: I2 ^dir U)
<=WM: (14558: I2 ^reward 1)
<=WM: (14557: I2 ^see 0)
=>WM: (14574: I2 ^level-1 R0-root)
<=WM: (14560: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2077 = 0.6195718054949008)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2078 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1043 ^value 1 +)
 (R1 ^reward R1043 +)
Firing propose*predict-yes
 -->
 (O2079 ^name predict-yes +)
 (S1 ^operator O2079 +)
Firing propose*predict-no
 -->
 (O2080 ^name predict-no +)
 (S1 ^operator O2080 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2078 = 0.3140350167550124)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2077 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2078 ^name predict-no +)
 (S1 ^operator O2078 +)
Retracting propose*predict-yes
 -->
 (O2077 ^name predict-yes +)
 (S1 ^operator O2077 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1042 ^value 1 +)
 (R1 ^reward R1042 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2078 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2077 = 0.)
=>WM: (14581: S1 ^operator O2080 +)
=>WM: (14580: S1 ^operator O2079 +)
=>WM: (14579: I3 ^dir L)
=>WM: (14578: O2080 ^name predict-no)
=>WM: (14577: O2079 ^name predict-yes)
=>WM: (14576: R1043 ^value 1)
=>WM: (14575: R1 ^reward R1043)
<=WM: (14566: S1 ^operator O2077 +)
<=WM: (14567: S1 ^operator O2078 +)
<=WM: (14568: S1 ^operator O2078)
<=WM: (14565: I3 ^dir U)
<=WM: (14561: R1 ^reward R1042)
<=WM: (14564: O2078 ^name predict-no)
<=WM: (14563: O2077 ^name predict-yes)
<=WM: (14562: R1042 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2079 = 0.6195718054949008)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2079 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2080 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2080 = 0.3140350167550124)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2078 = 0.3140350167550124)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2078 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2077 = 0.3804109904199586)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2077 = 0.6195718054949008)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14582: S1 ^operator O2079)

  1040:    O: O2079 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1040 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1039 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14583: I3 ^predict-yes N1040)
<=WM: (14570: N1039 ^status complete)
<=WM: (14569: I3 ^predict-no N1039)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14587: I2 ^dir L)
=>WM: (14586: I2 ^reward 1)
=>WM: (14585: I2 ^see 1)
=>WM: (14584: N1040 ^status complete)
<=WM: (14573: I2 ^dir L)
<=WM: (14572: I2 ^reward 1)
<=WM: (14571: I2 ^see 0)
=>WM: (14588: I2 ^level-1 L1-root)
<=WM: (14574: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2079 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2080 = 0.686084421929226)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1044 ^value 1 +)
 (R1 ^reward R1044 +)
Firing propose*predict-yes
 -->
 (O2081 ^name predict-yes +)
 (S1 ^operator O2081 +)
Firing propose*predict-no
 -->
 (O2082 ^name predict-no +)
 (S1 ^operator O2082 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2080 = 0.3140350167550124)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2079 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2080 ^name predict-no +)
 (S1 ^operator O2080 +)
Retracting propose*predict-yes
 -->
 (O2079 ^name predict-yes +)
 (S1 ^operator O2079 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1043 ^value 1 +)
 (R1 ^reward R1043 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2080 = 0.3140350167550124)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2080 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2079 = 0.3804109904199586)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2079 = 0.6195718054949008)
=>WM: (14595: S1 ^operator O2082 +)
=>WM: (14594: S1 ^operator O2081 +)
=>WM: (14593: O2082 ^name predict-no)
=>WM: (14592: O2081 ^name predict-yes)
=>WM: (14591: R1044 ^value 1)
=>WM: (14590: R1 ^reward R1044)
=>WM: (14589: I3 ^see 1)
<=WM: (14580: S1 ^operator O2079 +)
<=WM: (14582: S1 ^operator O2079)
<=WM: (14581: S1 ^operator O2080 +)
<=WM: (14575: R1 ^reward R1043)
<=WM: (14546: I3 ^see 0)
<=WM: (14578: O2080 ^name predict-no)
<=WM: (14577: O2079 ^name predict-yes)
<=WM: (14576: R1043 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2081 = 0.3804109904199586)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2081 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2082 = 0.3140350167550124)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2082 = 0.686084421929226)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2080 = 0.3140350167550124)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2080 = 0.686084421929226)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2079 = 0.3804109904199586)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2079 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521341 -0.14093 0.380411 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.83815,0.136443)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478641 0.140931 0.619572 -> 0.478642 0.140931 0.619573(R,m,v=1,1,0)
=>WM: (14596: S1 ^operator O2082)

  1041:    O: O2082 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1041 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1040 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14597: I3 ^predict-no N1041)
<=WM: (14584: N1040 ^status complete)
<=WM: (14583: I3 ^predict-yes N1040)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
---- Input Phase --- 
=>WM: (14601: I2 ^dir R)
=>WM: (14600: I2 ^reward 1)
=>WM: (14599: I2 ^see 0)
=>WM: (14598: N1041 ^status complete)
<=WM: (14587: I2 ^dir L)
<=WM: (14586: I2 ^reward 1)
<=WM: (14585: I2 ^see 1)
=>WM: (14602: I2 ^level-1 L0-root)
<=WM: (14588: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2081 = 0.7057283473531946)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2082 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1045 ^value 1 +)
 (R1 ^reward R1045 +)
Firing propose*predict-yes
 -->
 (O2083 ^name predict-yes +)
 (S1 ^operator O2083 +)
Firing propose*predict-no
 -->
 (O2084 ^name predict-no +)
 (S1 ^operator O2084 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2082 = 0.2298614663037441)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2081 = 0.2939980822884902)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2082 ^name predict-no +)
 (S1 ^operator O2082 +)
Retracting propose*predict-yes
 -->
 (O2081 ^name predict-yes +)
 (S1 ^operator O2081 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1044 ^value 1 +)
 (R1 ^reward R1044 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2082 = 0.686084421929226)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2082 = 0.3140350167550124)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2081 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2081 = 0.3804123883778544)
=>WM: (14610: S1 ^operator O2084 +)
=>WM: (14609: S1 ^operator O2083 +)
=>WM: (14608: I3 ^dir R)
=>WM: (14607: O2084 ^name predict-no)
=>WM: (14606: O2083 ^name predict-yes)
=>WM: (14605: R1045 ^value 1)
=>WM: (14604: R1 ^reward R1045)
=>WM: (14603: I3 ^see 0)
<=WM: (14594: S1 ^operator O2081 +)
<=WM: (14595: S1 ^operator O2082 +)
<=WM: (14596: S1 ^operator O2082)
<=WM: (14579: I3 ^dir L)
<=WM: (14590: R1 ^reward R1044)
<=WM: (14589: I3 ^see 1)
<=WM: (14593: O2082 ^name predict-no)
<=WM: (14592: O2081 ^name predict-yes)
<=WM: (14591: R1044 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2083 = 0.2939980822884902)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2083 = 0.7057283473531946)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2084 = 0.2298614663037441)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2084 = -0.2023211881870005)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2082 = 0.2298614663037441)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2082 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2081 = 0.2939980822884902)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2081 = 0.7057283473531946)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485042 -0.171007 0.314035 -> 0.485034 -0.171009 0.314025(R,m,v=1,0.86875,0.114741)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515052 0.171032 0.686084 -> 0.515043 0.17103 0.686073(R,m,v=1,1,0)
=>WM: (14611: S1 ^operator O2083)

  1042:    O: O2083 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1042 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1041 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14612: I3 ^predict-yes N1042)
<=WM: (14598: N1041 ^status complete)
<=WM: (14597: I3 ^predict-no N1041)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14616: I2 ^dir U)
=>WM: (14615: I2 ^reward 1)
=>WM: (14614: I2 ^see 1)
=>WM: (14613: N1042 ^status complete)
<=WM: (14601: I2 ^dir R)
<=WM: (14600: I2 ^reward 1)
<=WM: (14599: I2 ^see 0)
=>WM: (14617: I2 ^level-1 R1-root)
<=WM: (14602: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1046 ^value 1 +)
 (R1 ^reward R1046 +)
Firing propose*predict-yes
 -->
 (O2085 ^name predict-yes +)
 (S1 ^operator O2085 +)
Firing propose*predict-no
 -->
 (O2086 ^name predict-no +)
 (S1 ^operator O2086 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2084 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2083 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2084 ^name predict-no +)
 (S1 ^operator O2084 +)
Retracting propose*predict-yes
 -->
 (O2083 ^name predict-yes +)
 (S1 ^operator O2083 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1045 ^value 1 +)
 (R1 ^reward R1045 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2084 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2084 = 0.2298614663037441)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2083 = 0.7057283473531946)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2083 = 0.2939980822884902)
=>WM: (14625: S1 ^operator O2086 +)
=>WM: (14624: S1 ^operator O2085 +)
=>WM: (14623: I3 ^dir U)
=>WM: (14622: O2086 ^name predict-no)
=>WM: (14621: O2085 ^name predict-yes)
=>WM: (14620: R1046 ^value 1)
=>WM: (14619: R1 ^reward R1046)
=>WM: (14618: I3 ^see 1)
<=WM: (14609: S1 ^operator O2083 +)
<=WM: (14611: S1 ^operator O2083)
<=WM: (14610: S1 ^operator O2084 +)
<=WM: (14608: I3 ^dir R)
<=WM: (14604: R1 ^reward R1045)
<=WM: (14603: I3 ^see 0)
<=WM: (14607: O2084 ^name predict-no)
<=WM: (14606: O2083 ^name predict-yes)
<=WM: (14605: R1045 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2085 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2086 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2084 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2083 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501072 -0.207074 0.293998 -> 0.501092 -0.207072 0.294021(R,m,v=1,0.853659,0.125692)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.49868 0.207048 0.705728 -> 0.498704 0.20705 0.705755(R,m,v=1,1,0)
=>WM: (14626: S1 ^operator O2086)

  1043:    O: O2086 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1043 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1042 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14627: I3 ^predict-no N1043)
<=WM: (14613: N1042 ^status complete)
<=WM: (14612: I3 ^predict-yes N1042)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14631: I2 ^dir R)
=>WM: (14630: I2 ^reward 1)
=>WM: (14629: I2 ^see 0)
=>WM: (14628: N1043 ^status complete)
<=WM: (14616: I2 ^dir U)
<=WM: (14615: I2 ^reward 1)
<=WM: (14614: I2 ^see 1)
=>WM: (14632: I2 ^level-1 R1-root)
<=WM: (14617: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2085 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2086 = 0.7701664478127415)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1047 ^value 1 +)
 (R1 ^reward R1047 +)
Firing propose*predict-yes
 -->
 (O2087 ^name predict-yes +)
 (S1 ^operator O2087 +)
Firing propose*predict-no
 -->
 (O2088 ^name predict-no +)
 (S1 ^operator O2088 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2086 = 0.2298614663037441)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2085 = 0.2940205065793785)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2086 ^name predict-no +)
 (S1 ^operator O2086 +)
Retracting propose*predict-yes
 -->
 (O2085 ^name predict-yes +)
 (S1 ^operator O2085 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1046 ^value 1 +)
 (R1 ^reward R1046 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2086 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2085 = 0.)
=>WM: (14640: S1 ^operator O2088 +)
=>WM: (14639: S1 ^operator O2087 +)
=>WM: (14638: I3 ^dir R)
=>WM: (14637: O2088 ^name predict-no)
=>WM: (14636: O2087 ^name predict-yes)
=>WM: (14635: R1047 ^value 1)
=>WM: (14634: R1 ^reward R1047)
=>WM: (14633: I3 ^see 0)
<=WM: (14624: S1 ^operator O2085 +)
<=WM: (14625: S1 ^operator O2086 +)
<=WM: (14626: S1 ^operator O2086)
<=WM: (14623: I3 ^dir U)
<=WM: (14619: R1 ^reward R1046)
<=WM: (14618: I3 ^see 1)
<=WM: (14622: O2086 ^name predict-no)
<=WM: (14621: O2085 ^name predict-yes)
<=WM: (14620: R1046 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2087 = -0.252585164213872)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2087 = 0.2940205065793785)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2088 = 0.7701664478127415)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2088 = 0.2298614663037441)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2086 = 0.2298614663037441)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2086 = 0.7701664478127415)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2085 = 0.2940205065793785)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2085 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14641: S1 ^operator O2088)

  1044:    O: O2088 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1044 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1043 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14642: I3 ^predict-no N1044)
<=WM: (14628: N1043 ^status complete)
<=WM: (14627: I3 ^predict-no N1043)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14646: I2 ^dir L)
=>WM: (14645: I2 ^reward 1)
=>WM: (14644: I2 ^see 0)
=>WM: (14643: N1044 ^status complete)
<=WM: (14631: I2 ^dir R)
<=WM: (14630: I2 ^reward 1)
<=WM: (14629: I2 ^see 0)
=>WM: (14647: I2 ^level-1 R0-root)
<=WM: (14632: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2087 = 0.6195734444489578)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2088 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1048 ^value 1 +)
 (R1 ^reward R1048 +)
Firing propose*predict-yes
 -->
 (O2089 ^name predict-yes +)
 (S1 ^operator O2089 +)
Firing propose*predict-no
 -->
 (O2090 ^name predict-no +)
 (S1 ^operator O2090 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2088 = 0.3140251866918842)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2087 = 0.3804123883778544)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2088 ^name predict-no +)
 (S1 ^operator O2088 +)
Retracting propose*predict-yes
 -->
 (O2087 ^name predict-yes +)
 (S1 ^operator O2087 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1047 ^value 1 +)
 (R1 ^reward R1047 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2088 = 0.2298614663037441)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2088 = 0.7701664478127415)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2087 = 0.2940205065793785)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2087 = -0.252585164213872)
=>WM: (14654: S1 ^operator O2090 +)
=>WM: (14653: S1 ^operator O2089 +)
=>WM: (14652: I3 ^dir L)
=>WM: (14651: O2090 ^name predict-no)
=>WM: (14650: O2089 ^name predict-yes)
=>WM: (14649: R1048 ^value 1)
=>WM: (14648: R1 ^reward R1048)
<=WM: (14639: S1 ^operator O2087 +)
<=WM: (14640: S1 ^operator O2088 +)
<=WM: (14641: S1 ^operator O2088)
<=WM: (14638: I3 ^dir R)
<=WM: (14634: R1 ^reward R1047)
<=WM: (14637: O2088 ^name predict-no)
<=WM: (14636: O2087 ^name predict-yes)
<=WM: (14635: R1047 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2089 = 0.3804123883778544)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2089 = 0.6195734444489578)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2090 = 0.3140251866918842)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2090 = -0.2190661556260421)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2088 = 0.3140251866918842)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2088 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2087 = 0.3804123883778544)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2087 = 0.6195734444489578)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229861 -> 0.611911 -0.382052 0.229859(R,m,v=1,0.852459,0.126464)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388109 0.382057 0.770166 -> 0.388107 0.382057 0.770164(R,m,v=1,1,0)
=>WM: (14655: S1 ^operator O2089)

  1045:    O: O2089 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1045 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1044 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14656: I3 ^predict-yes N1045)
<=WM: (14643: N1044 ^status complete)
<=WM: (14642: I3 ^predict-no N1044)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14660: I2 ^dir L)
=>WM: (14659: I2 ^reward 1)
=>WM: (14658: I2 ^see 1)
=>WM: (14657: N1045 ^status complete)
<=WM: (14646: I2 ^dir L)
<=WM: (14645: I2 ^reward 1)
<=WM: (14644: I2 ^see 0)
=>WM: (14661: I2 ^level-1 L1-root)
<=WM: (14647: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2089 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2090 = 0.6860729145467337)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1049 ^value 1 +)
 (R1 ^reward R1049 +)
Firing propose*predict-yes
 -->
 (O2091 ^name predict-yes +)
 (S1 ^operator O2091 +)
Firing propose*predict-no
 -->
 (O2092 ^name predict-no +)
 (S1 ^operator O2092 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2090 = 0.3140251866918842)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2089 = 0.3804123883778544)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2090 ^name predict-no +)
 (S1 ^operator O2090 +)
Retracting propose*predict-yes
 -->
 (O2089 ^name predict-yes +)
 (S1 ^operator O2089 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1048 ^value 1 +)
 (R1 ^reward R1048 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2090 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2090 = 0.3140251866918842)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2089 = 0.6195734444489578)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2089 = 0.3804123883778544)
=>WM: (14668: S1 ^operator O2092 +)
=>WM: (14667: S1 ^operator O2091 +)
=>WM: (14666: O2092 ^name predict-no)
=>WM: (14665: O2091 ^name predict-yes)
=>WM: (14664: R1049 ^value 1)
=>WM: (14663: R1 ^reward R1049)
=>WM: (14662: I3 ^see 1)
<=WM: (14653: S1 ^operator O2089 +)
<=WM: (14655: S1 ^operator O2089)
<=WM: (14654: S1 ^operator O2090 +)
<=WM: (14648: R1 ^reward R1048)
<=WM: (14633: I3 ^see 0)
<=WM: (14651: O2090 ^name predict-no)
<=WM: (14650: O2089 ^name predict-yes)
<=WM: (14649: R1048 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2091 = 0.3804123883778544)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2091 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2092 = 0.3140251866918842)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2092 = 0.6860729145467337)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2090 = 0.3140251866918842)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2090 = 0.6860729145467337)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2089 = 0.3804123883778544)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2089 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521343 -0.14093 0.380414(R,m,v=1,0.83908,0.135805)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478642 0.140931 0.619573 -> 0.478644 0.140931 0.619575(R,m,v=1,1,0)
=>WM: (14669: S1 ^operator O2092)

  1046:    O: O2092 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1046 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1045 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14670: I3 ^predict-no N1046)
<=WM: (14657: N1045 ^status complete)
<=WM: (14656: I3 ^predict-yes N1045)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14674: I2 ^dir L)
=>WM: (14673: I2 ^reward 1)
=>WM: (14672: I2 ^see 0)
=>WM: (14671: N1046 ^status complete)
<=WM: (14660: I2 ^dir L)
<=WM: (14659: I2 ^reward 1)
<=WM: (14658: I2 ^see 1)
=>WM: (14675: I2 ^level-1 L0-root)
<=WM: (14661: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2091 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2092 = 0.6858476397463316)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1050 ^value 1 +)
 (R1 ^reward R1050 +)
Firing propose*predict-yes
 -->
 (O2093 ^name predict-yes +)
 (S1 ^operator O2093 +)
Firing propose*predict-no
 -->
 (O2094 ^name predict-no +)
 (S1 ^operator O2094 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2092 = 0.3140251866918842)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2091 = 0.3804135384871243)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2092 ^name predict-no +)
 (S1 ^operator O2092 +)
Retracting propose*predict-yes
 -->
 (O2091 ^name predict-yes +)
 (S1 ^operator O2091 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1049 ^value 1 +)
 (R1 ^reward R1049 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2092 = 0.6860729145467337)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2092 = 0.3140251866918842)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2091 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2091 = 0.3804135384871243)
=>WM: (14682: S1 ^operator O2094 +)
=>WM: (14681: S1 ^operator O2093 +)
=>WM: (14680: O2094 ^name predict-no)
=>WM: (14679: O2093 ^name predict-yes)
=>WM: (14678: R1050 ^value 1)
=>WM: (14677: R1 ^reward R1050)
=>WM: (14676: I3 ^see 0)
<=WM: (14667: S1 ^operator O2091 +)
<=WM: (14668: S1 ^operator O2092 +)
<=WM: (14669: S1 ^operator O2092)
<=WM: (14663: R1 ^reward R1049)
<=WM: (14662: I3 ^see 1)
<=WM: (14666: O2092 ^name predict-no)
<=WM: (14665: O2091 ^name predict-yes)
<=WM: (14664: R1049 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2093 = 0.3804135384871243)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2093 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2094 = 0.3140251866918842)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2094 = 0.6858476397463316)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2092 = 0.3140251866918842)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2092 = 0.6858476397463316)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2091 = 0.3804135384871243)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2091 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485034 -0.171009 0.314025 -> 0.485028 -0.171011 0.314017(R,m,v=1,0.869565,0.11413)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515043 0.17103 0.686073 -> 0.515036 0.171028 0.686063(R,m,v=1,1,0)
=>WM: (14683: S1 ^operator O2094)

  1047:    O: O2094 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1047 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1046 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14684: I3 ^predict-no N1047)
<=WM: (14671: N1046 ^status complete)
<=WM: (14670: I3 ^predict-no N1046)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14688: I2 ^dir R)
=>WM: (14687: I2 ^reward 1)
=>WM: (14686: I2 ^see 0)
=>WM: (14685: N1047 ^status complete)
<=WM: (14674: I2 ^dir L)
<=WM: (14673: I2 ^reward 1)
<=WM: (14672: I2 ^see 0)
=>WM: (14689: I2 ^level-1 L0-root)
<=WM: (14675: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2093 = 0.7057548618480857)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2094 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1051 ^value 1 +)
 (R1 ^reward R1051 +)
Firing propose*predict-yes
 -->
 (O2095 ^name predict-yes +)
 (S1 ^operator O2095 +)
Firing propose*predict-no
 -->
 (O2096 ^name predict-no +)
 (S1 ^operator O2096 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2094 = 0.2298592186043533)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2093 = 0.2940205065793785)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2094 ^name predict-no +)
 (S1 ^operator O2094 +)
Retracting propose*predict-yes
 -->
 (O2093 ^name predict-yes +)
 (S1 ^operator O2093 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1050 ^value 1 +)
 (R1 ^reward R1050 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2094 = 0.6858476397463316)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2094 = 0.3140171210188315)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2093 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2093 = 0.3804135384871243)
=>WM: (14696: S1 ^operator O2096 +)
=>WM: (14695: S1 ^operator O2095 +)
=>WM: (14694: I3 ^dir R)
=>WM: (14693: O2096 ^name predict-no)
=>WM: (14692: O2095 ^name predict-yes)
=>WM: (14691: R1051 ^value 1)
=>WM: (14690: R1 ^reward R1051)
<=WM: (14681: S1 ^operator O2093 +)
<=WM: (14682: S1 ^operator O2094 +)
<=WM: (14683: S1 ^operator O2094)
<=WM: (14652: I3 ^dir L)
<=WM: (14677: R1 ^reward R1050)
<=WM: (14680: O2094 ^name predict-no)
<=WM: (14679: O2093 ^name predict-yes)
<=WM: (14678: R1050 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2095 = 0.7057548618480857)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2095 = 0.2940205065793785)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2096 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2096 = 0.2298592186043533)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2094 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2094 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2093 = 0.2940205065793785)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2093 = 0.7057548618480857)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485028 -0.171011 0.314017 -> 0.485037 -0.171008 0.314028(R,m,v=1,0.87037,0.113527)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514865 0.170982 0.685848 -> 0.514876 0.170985 0.685861(R,m,v=1,1,0)
=>WM: (14697: S1 ^operator O2095)

  1048:    O: O2095 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1048 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1047 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14698: I3 ^predict-yes N1048)
<=WM: (14685: N1047 ^status complete)
<=WM: (14684: I3 ^predict-no N1047)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14702: I2 ^dir L)
=>WM: (14701: I2 ^reward 1)
=>WM: (14700: I2 ^see 1)
=>WM: (14699: N1048 ^status complete)
<=WM: (14688: I2 ^dir R)
<=WM: (14687: I2 ^reward 1)
<=WM: (14686: I2 ^see 0)
=>WM: (14703: I2 ^level-1 R1-root)
<=WM: (14689: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2095 = 0.619600420969239)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2096 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1052 ^value 1 +)
 (R1 ^reward R1052 +)
Firing propose*predict-yes
 -->
 (O2097 ^name predict-yes +)
 (S1 ^operator O2097 +)
Firing propose*predict-no
 -->
 (O2098 ^name predict-no +)
 (S1 ^operator O2098 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2096 = 0.3140282287884166)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2095 = 0.3804135384871243)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2096 ^name predict-no +)
 (S1 ^operator O2096 +)
Retracting propose*predict-yes
 -->
 (O2095 ^name predict-yes +)
 (S1 ^operator O2095 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1051 ^value 1 +)
 (R1 ^reward R1051 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2096 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2096 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2095 = 0.2940205065793785)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2095 = 0.7057548618480857)
=>WM: (14711: S1 ^operator O2098 +)
=>WM: (14710: S1 ^operator O2097 +)
=>WM: (14709: I3 ^dir L)
=>WM: (14708: O2098 ^name predict-no)
=>WM: (14707: O2097 ^name predict-yes)
=>WM: (14706: R1052 ^value 1)
=>WM: (14705: R1 ^reward R1052)
=>WM: (14704: I3 ^see 1)
<=WM: (14695: S1 ^operator O2095 +)
<=WM: (14697: S1 ^operator O2095)
<=WM: (14696: S1 ^operator O2096 +)
<=WM: (14694: I3 ^dir R)
<=WM: (14690: R1 ^reward R1051)
<=WM: (14676: I3 ^see 0)
<=WM: (14693: O2096 ^name predict-no)
<=WM: (14692: O2095 ^name predict-yes)
<=WM: (14691: R1051 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2097 = 0.3804135384871243)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2097 = 0.619600420969239)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2098 = 0.3140282287884166)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2098 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2096 = 0.3140282287884166)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2096 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2095 = 0.3804135384871243)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2095 = 0.619600420969239)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501092 -0.207072 0.294021 -> 0.501109 -0.20707 0.294039(R,m,v=1,0.854545,0.125055)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498704 0.20705 0.705755 -> 0.498724 0.207053 0.705777(R,m,v=1,1,0)
=>WM: (14712: S1 ^operator O2097)

  1049:    O: O2097 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1049 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1048 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14713: I3 ^predict-yes N1049)
<=WM: (14699: N1048 ^status complete)
<=WM: (14698: I3 ^predict-yes N1048)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14717: I2 ^dir L)
=>WM: (14716: I2 ^reward 1)
=>WM: (14715: I2 ^see 1)
=>WM: (14714: N1049 ^status complete)
<=WM: (14702: I2 ^dir L)
<=WM: (14701: I2 ^reward 1)
<=WM: (14700: I2 ^see 1)
=>WM: (14718: I2 ^level-1 L1-root)
<=WM: (14703: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2097 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2098 = 0.6860634902400752)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1053 ^value 1 +)
 (R1 ^reward R1053 +)
Firing propose*predict-yes
 -->
 (O2099 ^name predict-yes +)
 (S1 ^operator O2099 +)
Firing propose*predict-no
 -->
 (O2100 ^name predict-no +)
 (S1 ^operator O2100 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2098 = 0.3140282287884166)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2097 = 0.3804135384871243)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2098 ^name predict-no +)
 (S1 ^operator O2098 +)
Retracting propose*predict-yes
 -->
 (O2097 ^name predict-yes +)
 (S1 ^operator O2097 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1052 ^value 1 +)
 (R1 ^reward R1052 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2098 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2098 = 0.3140282287884166)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2097 = 0.619600420969239)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2097 = 0.3804135384871243)
=>WM: (14724: S1 ^operator O2100 +)
=>WM: (14723: S1 ^operator O2099 +)
=>WM: (14722: O2100 ^name predict-no)
=>WM: (14721: O2099 ^name predict-yes)
=>WM: (14720: R1053 ^value 1)
=>WM: (14719: R1 ^reward R1053)
<=WM: (14710: S1 ^operator O2097 +)
<=WM: (14712: S1 ^operator O2097)
<=WM: (14711: S1 ^operator O2098 +)
<=WM: (14705: R1 ^reward R1052)
<=WM: (14708: O2098 ^name predict-no)
<=WM: (14707: O2097 ^name predict-yes)
<=WM: (14706: R1052 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2099 = 0.3804135384871243)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2099 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2100 = 0.3140282287884166)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2100 = 0.6860634902400752)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2098 = 0.3140282287884166)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2098 = 0.6860634902400752)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2097 = 0.3804135384871243)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2097 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380414 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.84,0.135172)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478672 0.140929 0.6196 -> 0.47867 0.140929 0.619599(R,m,v=1,1,0)
=>WM: (14725: S1 ^operator O2100)

  1050:    O: O2100 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1050 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1049 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14726: I3 ^predict-no N1050)
<=WM: (14714: N1049 ^status complete)
<=WM: (14713: I3 ^predict-yes N1049)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14730: I2 ^dir U)
=>WM: (14729: I2 ^reward 1)
=>WM: (14728: I2 ^see 0)
=>WM: (14727: N1050 ^status complete)
<=WM: (14717: I2 ^dir L)
<=WM: (14716: I2 ^reward 1)
<=WM: (14715: I2 ^see 1)
=>WM: (14731: I2 ^level-1 L0-root)
<=WM: (14718: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1054 ^value 1 +)
 (R1 ^reward R1054 +)
Firing propose*predict-yes
 -->
 (O2101 ^name predict-yes +)
 (S1 ^operator O2101 +)
Firing propose*predict-no
 -->
 (O2102 ^name predict-no +)
 (S1 ^operator O2102 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2100 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2099 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2100 ^name predict-no +)
 (S1 ^operator O2100 +)
Retracting propose*predict-yes
 -->
 (O2099 ^name predict-yes +)
 (S1 ^operator O2099 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1053 ^value 1 +)
 (R1 ^reward R1053 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2100 = 0.6860634902400752)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2100 = 0.3140282287884166)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2099 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2099 = 0.3804124062940181)
=>WM: (14739: S1 ^operator O2102 +)
=>WM: (14738: S1 ^operator O2101 +)
=>WM: (14737: I3 ^dir U)
=>WM: (14736: O2102 ^name predict-no)
=>WM: (14735: O2101 ^name predict-yes)
=>WM: (14734: R1054 ^value 1)
=>WM: (14733: R1 ^reward R1054)
=>WM: (14732: I3 ^see 0)
<=WM: (14723: S1 ^operator O2099 +)
<=WM: (14724: S1 ^operator O2100 +)
<=WM: (14725: S1 ^operator O2100)
<=WM: (14709: I3 ^dir L)
<=WM: (14719: R1 ^reward R1053)
<=WM: (14704: I3 ^see 1)
<=WM: (14722: O2100 ^name predict-no)
<=WM: (14721: O2099 ^name predict-yes)
<=WM: (14720: R1053 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2101 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2102 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2100 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2099 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485037 -0.171008 0.314028 -> 0.485031 -0.17101 0.314021(R,m,v=1,0.871166,0.112929)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515036 0.171028 0.686063 -> 0.515029 0.171026 0.686055(R,m,v=1,1,0)
=>WM: (14740: S1 ^operator O2102)

  1051:    O: O2102 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1051 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1050 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14741: I3 ^predict-no N1051)
<=WM: (14727: N1050 ^status complete)
<=WM: (14726: I3 ^predict-no N1050)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (14745: I2 ^dir U)
=>WM: (14744: I2 ^reward 1)
=>WM: (14743: I2 ^see 0)
=>WM: (14742: N1051 ^status complete)
<=WM: (14730: I2 ^dir U)
<=WM: (14729: I2 ^reward 1)
<=WM: (14728: I2 ^see 0)
=>WM: (14746: I2 ^level-1 L0-root)
<=WM: (14731: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1055 ^value 1 +)
 (R1 ^reward R1055 +)
Firing propose*predict-yes
 -->
 (O2103 ^name predict-yes +)
 (S1 ^operator O2103 +)
Firing propose*predict-no
 -->
 (O2104 ^name predict-no +)
 (S1 ^operator O2104 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2102 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2101 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2102 ^name predict-no +)
 (S1 ^operator O2102 +)
Retracting propose*predict-yes
 -->
 (O2101 ^name predict-yes +)
 (S1 ^operator O2101 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1054 ^value 1 +)
 (R1 ^reward R1054 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2102 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2101 = 0.)
=>WM: (14752: S1 ^operator O2104 +)
=>WM: (14751: S1 ^operator O2103 +)
=>WM: (14750: O2104 ^name predict-no)
=>WM: (14749: O2103 ^name predict-yes)
=>WM: (14748: R1055 ^value 1)
=>WM: (14747: R1 ^reward R1055)
<=WM: (14738: S1 ^operator O2101 +)
<=WM: (14739: S1 ^operator O2102 +)
<=WM: (14740: S1 ^operator O2102)
<=WM: (14733: R1 ^reward R1054)
<=WM: (14736: O2102 ^name predict-no)
<=WM: (14735: O2101 ^name predict-yes)
<=WM: (14734: R1054 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2103 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2104 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2102 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2101 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14753: S1 ^operator O2104)

  1052:    O: O2104 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1052 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1051 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14754: I3 ^predict-no N1052)
<=WM: (14742: N1051 ^status complete)
<=WM: (14741: I3 ^predict-no N1051)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14758: I2 ^dir R)
=>WM: (14757: I2 ^reward 1)
=>WM: (14756: I2 ^see 0)
=>WM: (14755: N1052 ^status complete)
<=WM: (14745: I2 ^dir U)
<=WM: (14744: I2 ^reward 1)
<=WM: (14743: I2 ^see 0)
=>WM: (14759: I2 ^level-1 L0-root)
<=WM: (14746: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2103 = 0.7057765679517091)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2104 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1056 ^value 1 +)
 (R1 ^reward R1056 +)
Firing propose*predict-yes
 -->
 (O2105 ^name predict-yes +)
 (S1 ^operator O2105 +)
Firing propose*predict-no
 -->
 (O2106 ^name predict-no +)
 (S1 ^operator O2106 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2104 = 0.2298592186043533)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2103 = 0.2940389010748334)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2104 ^name predict-no +)
 (S1 ^operator O2104 +)
Retracting propose*predict-yes
 -->
 (O2103 ^name predict-yes +)
 (S1 ^operator O2103 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1055 ^value 1 +)
 (R1 ^reward R1055 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2104 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2103 = 0.)
=>WM: (14766: S1 ^operator O2106 +)
=>WM: (14765: S1 ^operator O2105 +)
=>WM: (14764: I3 ^dir R)
=>WM: (14763: O2106 ^name predict-no)
=>WM: (14762: O2105 ^name predict-yes)
=>WM: (14761: R1056 ^value 1)
=>WM: (14760: R1 ^reward R1056)
<=WM: (14751: S1 ^operator O2103 +)
<=WM: (14752: S1 ^operator O2104 +)
<=WM: (14753: S1 ^operator O2104)
<=WM: (14737: I3 ^dir U)
<=WM: (14747: R1 ^reward R1055)
<=WM: (14750: O2104 ^name predict-no)
<=WM: (14749: O2103 ^name predict-yes)
<=WM: (14748: R1055 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2105 = 0.7057765679517091)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2105 = 0.2940389010748334)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2106 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2106 = 0.2298592186043533)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2104 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2104 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2103 = 0.2940389010748334)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2103 = 0.7057765679517091)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14767: S1 ^operator O2105)

  1053:    O: O2105 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1053 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1052 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14768: I3 ^predict-yes N1053)
<=WM: (14755: N1052 ^status complete)
<=WM: (14754: I3 ^predict-no N1052)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14772: I2 ^dir R)
=>WM: (14771: I2 ^reward 1)
=>WM: (14770: I2 ^see 1)
=>WM: (14769: N1053 ^status complete)
<=WM: (14758: I2 ^dir R)
<=WM: (14757: I2 ^reward 1)
<=WM: (14756: I2 ^see 0)
=>WM: (14773: I2 ^level-1 R1-root)
<=WM: (14759: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2105 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2106 = 0.770163750477286)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1057 ^value 1 +)
 (R1 ^reward R1057 +)
Firing propose*predict-yes
 -->
 (O2107 ^name predict-yes +)
 (S1 ^operator O2107 +)
Firing propose*predict-no
 -->
 (O2108 ^name predict-no +)
 (S1 ^operator O2108 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2106 = 0.2298592186043533)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2105 = 0.2940389010748334)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2106 ^name predict-no +)
 (S1 ^operator O2106 +)
Retracting propose*predict-yes
 -->
 (O2105 ^name predict-yes +)
 (S1 ^operator O2105 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1056 ^value 1 +)
 (R1 ^reward R1056 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2106 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2106 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2105 = 0.2940389010748334)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2105 = 0.7057765679517091)
=>WM: (14780: S1 ^operator O2108 +)
=>WM: (14779: S1 ^operator O2107 +)
=>WM: (14778: O2108 ^name predict-no)
=>WM: (14777: O2107 ^name predict-yes)
=>WM: (14776: R1057 ^value 1)
=>WM: (14775: R1 ^reward R1057)
=>WM: (14774: I3 ^see 1)
<=WM: (14765: S1 ^operator O2105 +)
<=WM: (14767: S1 ^operator O2105)
<=WM: (14766: S1 ^operator O2106 +)
<=WM: (14760: R1 ^reward R1056)
<=WM: (14732: I3 ^see 0)
<=WM: (14763: O2106 ^name predict-no)
<=WM: (14762: O2105 ^name predict-yes)
<=WM: (14761: R1056 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2107 = 0.2940389010748334)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2107 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2108 = 0.2298592186043533)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2108 = 0.770163750477286)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2106 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2106 = 0.770163750477286)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2105 = 0.2940389010748334)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2105 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501109 -0.20707 0.294039 -> 0.501123 -0.207069 0.294054(R,m,v=1,0.855422,0.124425)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498724 0.207053 0.705777 -> 0.49874 0.207054 0.705794(R,m,v=1,1,0)
=>WM: (14781: S1 ^operator O2108)

  1054:    O: O2108 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1054 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1053 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14782: I3 ^predict-no N1054)
<=WM: (14769: N1053 ^status complete)
<=WM: (14768: I3 ^predict-yes N1053)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14786: I2 ^dir R)
=>WM: (14785: I2 ^reward 1)
=>WM: (14784: I2 ^see 0)
=>WM: (14783: N1054 ^status complete)
<=WM: (14772: I2 ^dir R)
<=WM: (14771: I2 ^reward 1)
<=WM: (14770: I2 ^see 1)
=>WM: (14787: I2 ^level-1 R0-root)
<=WM: (14773: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2107 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2108 = 0.7701073432202794)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1058 ^value 1 +)
 (R1 ^reward R1058 +)
Firing propose*predict-yes
 -->
 (O2109 ^name predict-yes +)
 (S1 ^operator O2109 +)
Firing propose*predict-no
 -->
 (O2110 ^name predict-no +)
 (S1 ^operator O2110 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2108 = 0.2298592186043533)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2107 = 0.2940539968979803)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2108 ^name predict-no +)
 (S1 ^operator O2108 +)
Retracting propose*predict-yes
 -->
 (O2107 ^name predict-yes +)
 (S1 ^operator O2107 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1057 ^value 1 +)
 (R1 ^reward R1057 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2108 = 0.770163750477286)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2108 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2107 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2107 = 0.2940539968979803)
=>WM: (14794: S1 ^operator O2110 +)
=>WM: (14793: S1 ^operator O2109 +)
=>WM: (14792: O2110 ^name predict-no)
=>WM: (14791: O2109 ^name predict-yes)
=>WM: (14790: R1058 ^value 1)
=>WM: (14789: R1 ^reward R1058)
=>WM: (14788: I3 ^see 0)
<=WM: (14779: S1 ^operator O2107 +)
<=WM: (14780: S1 ^operator O2108 +)
<=WM: (14781: S1 ^operator O2108)
<=WM: (14775: R1 ^reward R1057)
<=WM: (14774: I3 ^see 1)
<=WM: (14778: O2108 ^name predict-no)
<=WM: (14777: O2107 ^name predict-yes)
<=WM: (14776: R1057 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2109 = 0.2940539968979803)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2109 = -0.1254042659579056)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2110 = 0.2298592186043533)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2110 = 0.7701073432202794)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2108 = 0.2298592186043533)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2108 = 0.7701073432202794)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2107 = 0.2940539968979803)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2107 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611911 -0.382052 0.229859 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.853261,0.125891)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388107 0.382057 0.770164 -> 0.388105 0.382056 0.770162(R,m,v=1,1,0)
=>WM: (14795: S1 ^operator O2110)

  1055:    O: O2110 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1055 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1054 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14796: I3 ^predict-no N1055)
<=WM: (14783: N1054 ^status complete)
<=WM: (14782: I3 ^predict-no N1054)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14800: I2 ^dir L)
=>WM: (14799: I2 ^reward 1)
=>WM: (14798: I2 ^see 0)
=>WM: (14797: N1055 ^status complete)
<=WM: (14786: I2 ^dir R)
<=WM: (14785: I2 ^reward 1)
<=WM: (14784: I2 ^see 0)
=>WM: (14801: I2 ^level-1 R0-root)
<=WM: (14787: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2109 = 0.6195747904526593)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2110 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1059 ^value 1 +)
 (R1 ^reward R1059 +)
Firing propose*predict-yes
 -->
 (O2111 ^name predict-yes +)
 (S1 ^operator O2111 +)
Firing propose*predict-no
 -->
 (O2112 ^name predict-no +)
 (S1 ^operator O2112 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2110 = 0.3140207031247883)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2109 = 0.3804124062940181)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2110 ^name predict-no +)
 (S1 ^operator O2110 +)
Retracting propose*predict-yes
 -->
 (O2109 ^name predict-yes +)
 (S1 ^operator O2109 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1058 ^value 1 +)
 (R1 ^reward R1058 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2110 = 0.7701073432202794)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2110 = 0.2298573707106232)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2109 = -0.1254042659579056)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2109 = 0.2940539968979803)
=>WM: (14808: S1 ^operator O2112 +)
=>WM: (14807: S1 ^operator O2111 +)
=>WM: (14806: I3 ^dir L)
=>WM: (14805: O2112 ^name predict-no)
=>WM: (14804: O2111 ^name predict-yes)
=>WM: (14803: R1059 ^value 1)
=>WM: (14802: R1 ^reward R1059)
<=WM: (14793: S1 ^operator O2109 +)
<=WM: (14794: S1 ^operator O2110 +)
<=WM: (14795: S1 ^operator O2110)
<=WM: (14764: I3 ^dir R)
<=WM: (14789: R1 ^reward R1058)
<=WM: (14792: O2110 ^name predict-no)
<=WM: (14791: O2109 ^name predict-yes)
<=WM: (14790: R1058 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2111 = 0.6195747904526593)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2111 = 0.3804124062940181)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2112 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2112 = 0.3140207031247883)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2110 = 0.3140207031247883)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2110 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2109 = 0.3804124062940181)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2109 = 0.6195747904526593)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382053 0.229857 -> 0.611912 -0.382052 0.22986(R,m,v=1,0.854054,0.125323)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388061 0.382046 0.770107 -> 0.388064 0.382047 0.770111(R,m,v=1,1,0)
=>WM: (14809: S1 ^operator O2111)

  1056:    O: O2111 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1056 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1055 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14810: I3 ^predict-yes N1056)
<=WM: (14797: N1055 ^status complete)
<=WM: (14796: I3 ^predict-no N1055)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14814: I2 ^dir L)
=>WM: (14813: I2 ^reward 1)
=>WM: (14812: I2 ^see 1)
=>WM: (14811: N1056 ^status complete)
<=WM: (14800: I2 ^dir L)
<=WM: (14799: I2 ^reward 1)
<=WM: (14798: I2 ^see 0)
=>WM: (14815: I2 ^level-1 L1-root)
<=WM: (14801: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2111 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2112 = 0.6860547040638999)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1060 ^value 1 +)
 (R1 ^reward R1060 +)
Firing propose*predict-yes
 -->
 (O2113 ^name predict-yes +)
 (S1 ^operator O2113 +)
Firing propose*predict-no
 -->
 (O2114 ^name predict-no +)
 (S1 ^operator O2114 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2112 = 0.3140207031247883)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2111 = 0.3804124062940181)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2112 ^name predict-no +)
 (S1 ^operator O2112 +)
Retracting propose*predict-yes
 -->
 (O2111 ^name predict-yes +)
 (S1 ^operator O2111 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1059 ^value 1 +)
 (R1 ^reward R1059 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2112 = 0.3140207031247883)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2112 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2111 = 0.3804124062940181)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2111 = 0.6195747904526593)
=>WM: (14822: S1 ^operator O2114 +)
=>WM: (14821: S1 ^operator O2113 +)
=>WM: (14820: O2114 ^name predict-no)
=>WM: (14819: O2113 ^name predict-yes)
=>WM: (14818: R1060 ^value 1)
=>WM: (14817: R1 ^reward R1060)
=>WM: (14816: I3 ^see 1)
<=WM: (14807: S1 ^operator O2111 +)
<=WM: (14809: S1 ^operator O2111)
<=WM: (14808: S1 ^operator O2112 +)
<=WM: (14802: R1 ^reward R1059)
<=WM: (14788: I3 ^see 0)
<=WM: (14805: O2112 ^name predict-no)
<=WM: (14804: O2111 ^name predict-yes)
<=WM: (14803: R1059 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2113 = 0.3804124062940181)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2113 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2114 = 0.3140207031247883)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2114 = 0.6860547040638999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2112 = 0.3140207031247883)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2112 = 0.6860547040638999)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2111 = 0.3804124062940181)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2111 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.840909,0.134545)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478644 0.140931 0.619575 -> 0.478645 0.140931 0.619576(R,m,v=1,1,0)
=>WM: (14823: S1 ^operator O2114)

  1057:    O: O2114 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1057 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1056 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14824: I3 ^predict-no N1057)
<=WM: (14811: N1056 ^status complete)
<=WM: (14810: I3 ^predict-yes N1056)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\-/--- Input Phase --- 
=>WM: (14828: I2 ^dir L)
=>WM: (14827: I2 ^reward 1)
=>WM: (14826: I2 ^see 0)
=>WM: (14825: N1057 ^status complete)
<=WM: (14814: I2 ^dir L)
<=WM: (14813: I2 ^reward 1)
<=WM: (14812: I2 ^see 1)
=>WM: (14829: I2 ^level-1 L0-root)
<=WM: (14815: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2113 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2114 = 0.685860669441134)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1061 ^value 1 +)
 (R1 ^reward R1061 +)
Firing propose*predict-yes
 -->
 (O2115 ^name predict-yes +)
 (S1 ^operator O2115 +)
Firing propose*predict-no
 -->
 (O2116 ^name predict-no +)
 (S1 ^operator O2116 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2114 = 0.3140207031247883)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2113 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2114 ^name predict-no +)
 (S1 ^operator O2114 +)
Retracting propose*predict-yes
 -->
 (O2113 ^name predict-yes +)
 (S1 ^operator O2113 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1060 ^value 1 +)
 (R1 ^reward R1060 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2114 = 0.6860547040638999)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2114 = 0.3140207031247883)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2113 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2113 = 0.3804134437534242)
=>WM: (14836: S1 ^operator O2116 +)
=>WM: (14835: S1 ^operator O2115 +)
=>WM: (14834: O2116 ^name predict-no)
=>WM: (14833: O2115 ^name predict-yes)
=>WM: (14832: R1061 ^value 1)
=>WM: (14831: R1 ^reward R1061)
=>WM: (14830: I3 ^see 0)
<=WM: (14821: S1 ^operator O2113 +)
<=WM: (14822: S1 ^operator O2114 +)
<=WM: (14823: S1 ^operator O2114)
<=WM: (14817: R1 ^reward R1060)
<=WM: (14816: I3 ^see 1)
<=WM: (14820: O2114 ^name predict-no)
<=WM: (14819: O2113 ^name predict-yes)
<=WM: (14818: R1060 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2115 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2115 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2116 = 0.3140207031247883)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2116 = 0.685860669441134)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2114 = 0.3140207031247883)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2114 = 0.685860669441134)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2113 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2113 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485031 -0.17101 0.314021 -> 0.485026 -0.171011 0.314015(R,m,v=1,0.871951,0.112337)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515029 0.171026 0.686055 -> 0.515023 0.171024 0.686048(R,m,v=1,1,0)
=>WM: (14837: S1 ^operator O2116)

  1058:    O: O2116 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1058 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1057 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14838: I3 ^predict-no N1058)
<=WM: (14825: N1057 ^status complete)
<=WM: (14824: I3 ^predict-no N1057)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14842: I2 ^dir L)
=>WM: (14841: I2 ^reward 1)
=>WM: (14840: I2 ^see 0)
=>WM: (14839: N1058 ^status complete)
<=WM: (14828: I2 ^dir L)
<=WM: (14827: I2 ^reward 1)
<=WM: (14826: I2 ^see 0)
=>WM: (14843: I2 ^level-1 L0-root)
<=WM: (14829: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2115 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2116 = 0.685860669441134)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1062 ^value 1 +)
 (R1 ^reward R1062 +)
Firing propose*predict-yes
 -->
 (O2117 ^name predict-yes +)
 (S1 ^operator O2117 +)
Firing propose*predict-no
 -->
 (O2118 ^name predict-no +)
 (S1 ^operator O2118 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2116 = 0.3140145220723357)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2115 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2116 ^name predict-no +)
 (S1 ^operator O2116 +)
Retracting propose*predict-yes
 -->
 (O2115 ^name predict-yes +)
 (S1 ^operator O2115 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1061 ^value 1 +)
 (R1 ^reward R1061 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2116 = 0.685860669441134)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2116 = 0.3140145220723357)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2115 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2115 = 0.3804134437534242)
=>WM: (14849: S1 ^operator O2118 +)
=>WM: (14848: S1 ^operator O2117 +)
=>WM: (14847: O2118 ^name predict-no)
=>WM: (14846: O2117 ^name predict-yes)
=>WM: (14845: R1062 ^value 1)
=>WM: (14844: R1 ^reward R1062)
<=WM: (14835: S1 ^operator O2115 +)
<=WM: (14836: S1 ^operator O2116 +)
<=WM: (14837: S1 ^operator O2116)
<=WM: (14831: R1 ^reward R1061)
<=WM: (14834: O2116 ^name predict-no)
<=WM: (14833: O2115 ^name predict-yes)
<=WM: (14832: R1061 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2117 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2117 = -0.3332708974800781)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2118 = 0.3140145220723357)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2118 = 0.685860669441134)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2116 = 0.3140145220723357)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2116 = 0.685860669441134)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2115 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2115 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485026 -0.171011 0.314015 -> 0.485034 -0.171009 0.314025(R,m,v=1,0.872727,0.111752)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514876 0.170985 0.685861 -> 0.514885 0.170988 0.685873(R,m,v=1,1,0)
=>WM: (14850: S1 ^operator O2118)

  1059:    O: O2118 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1059 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1058 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14851: I3 ^predict-no N1059)
<=WM: (14839: N1058 ^status complete)
<=WM: (14838: I3 ^predict-no N1058)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14855: I2 ^dir U)
=>WM: (14854: I2 ^reward 1)
=>WM: (14853: I2 ^see 0)
=>WM: (14852: N1059 ^status complete)
<=WM: (14842: I2 ^dir L)
<=WM: (14841: I2 ^reward 1)
<=WM: (14840: I2 ^see 0)
=>WM: (14856: I2 ^level-1 L0-root)
<=WM: (14843: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1063 ^value 1 +)
 (R1 ^reward R1063 +)
Firing propose*predict-yes
 -->
 (O2119 ^name predict-yes +)
 (S1 ^operator O2119 +)
Firing propose*predict-no
 -->
 (O2120 ^name predict-no +)
 (S1 ^operator O2120 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2118 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2117 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2118 ^name predict-no +)
 (S1 ^operator O2118 +)
Retracting propose*predict-yes
 -->
 (O2117 ^name predict-yes +)
 (S1 ^operator O2117 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1062 ^value 1 +)
 (R1 ^reward R1062 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2118 = 0.6858726594370528)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2118 = 0.3140247423148079)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2117 = -0.3332708974800781)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2117 = 0.3804134437534242)
=>WM: (14863: S1 ^operator O2120 +)
=>WM: (14862: S1 ^operator O2119 +)
=>WM: (14861: I3 ^dir U)
=>WM: (14860: O2120 ^name predict-no)
=>WM: (14859: O2119 ^name predict-yes)
=>WM: (14858: R1063 ^value 1)
=>WM: (14857: R1 ^reward R1063)
<=WM: (14848: S1 ^operator O2117 +)
<=WM: (14849: S1 ^operator O2118 +)
<=WM: (14850: S1 ^operator O2118)
<=WM: (14806: I3 ^dir L)
<=WM: (14844: R1 ^reward R1062)
<=WM: (14847: O2118 ^name predict-no)
<=WM: (14846: O2117 ^name predict-yes)
<=WM: (14845: R1062 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2119 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2120 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2118 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2117 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485034 -0.171009 0.314025 -> 0.485041 -0.171007 0.314033(R,m,v=1,0.873494,0.111172)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514885 0.170988 0.685873 -> 0.514893 0.17099 0.685882(R,m,v=1,1,0)
=>WM: (14864: S1 ^operator O2120)

  1060:    O: O2120 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1060 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1059 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14865: I3 ^predict-no N1060)
<=WM: (14852: N1059 ^status complete)
<=WM: (14851: I3 ^predict-no N1059)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14869: I2 ^dir U)
=>WM: (14868: I2 ^reward 1)
=>WM: (14867: I2 ^see 0)
=>WM: (14866: N1060 ^status complete)
<=WM: (14855: I2 ^dir U)
<=WM: (14854: I2 ^reward 1)
<=WM: (14853: I2 ^see 0)
=>WM: (14870: I2 ^level-1 L0-root)
<=WM: (14856: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1064 ^value 1 +)
 (R1 ^reward R1064 +)
Firing propose*predict-yes
 -->
 (O2121 ^name predict-yes +)
 (S1 ^operator O2121 +)
Firing propose*predict-no
 -->
 (O2122 ^name predict-no +)
 (S1 ^operator O2122 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2120 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2119 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2120 ^name predict-no +)
 (S1 ^operator O2120 +)
Retracting propose*predict-yes
 -->
 (O2119 ^name predict-yes +)
 (S1 ^operator O2119 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1063 ^value 1 +)
 (R1 ^reward R1063 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2120 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2119 = 0.)
=>WM: (14876: S1 ^operator O2122 +)
=>WM: (14875: S1 ^operator O2121 +)
=>WM: (14874: O2122 ^name predict-no)
=>WM: (14873: O2121 ^name predict-yes)
=>WM: (14872: R1064 ^value 1)
=>WM: (14871: R1 ^reward R1064)
<=WM: (14862: S1 ^operator O2119 +)
<=WM: (14863: S1 ^operator O2120 +)
<=WM: (14864: S1 ^operator O2120)
<=WM: (14857: R1 ^reward R1063)
<=WM: (14860: O2120 ^name predict-no)
<=WM: (14859: O2119 ^name predict-yes)
<=WM: (14858: R1063 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2121 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2122 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2120 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2119 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14877: S1 ^operator O2122)

  1061:    O: O2122 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1061 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1060 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14878: I3 ^predict-no N1061)
<=WM: (14866: N1060 ^status complete)
<=WM: (14865: I3 ^predict-no N1060)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (14882: I2 ^dir L)
=>WM: (14881: I2 ^reward 1)
=>WM: (14880: I2 ^see 0)
=>WM: (14879: N1061 ^status complete)
<=WM: (14869: I2 ^dir U)
<=WM: (14868: I2 ^reward 1)
<=WM: (14867: I2 ^see 0)
=>WM: (14883: I2 ^level-1 L0-root)
<=WM: (14870: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2121 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2122 = 0.6858824877823619)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1065 ^value 1 +)
 (R1 ^reward R1065 +)
Firing propose*predict-yes
 -->
 (O2123 ^name predict-yes +)
 (S1 ^operator O2123 +)
Firing propose*predict-no
 -->
 (O2124 ^name predict-no +)
 (S1 ^operator O2124 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2122 = 0.3140331355128715)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2121 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2122 ^name predict-no +)
 (S1 ^operator O2122 +)
Retracting propose*predict-yes
 -->
 (O2121 ^name predict-yes +)
 (S1 ^operator O2121 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1064 ^value 1 +)
 (R1 ^reward R1064 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2122 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2121 = 0.)
=>WM: (14890: S1 ^operator O2124 +)
=>WM: (14889: S1 ^operator O2123 +)
=>WM: (14888: I3 ^dir L)
=>WM: (14887: O2124 ^name predict-no)
=>WM: (14886: O2123 ^name predict-yes)
=>WM: (14885: R1065 ^value 1)
=>WM: (14884: R1 ^reward R1065)
<=WM: (14875: S1 ^operator O2121 +)
<=WM: (14876: S1 ^operator O2122 +)
<=WM: (14877: S1 ^operator O2122)
<=WM: (14861: I3 ^dir U)
<=WM: (14871: R1 ^reward R1064)
<=WM: (14874: O2122 ^name predict-no)
<=WM: (14873: O2121 ^name predict-yes)
<=WM: (14872: R1064 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2123 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2123 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2124 = 0.6858824877823619)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2124 = 0.3140331355128715)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2122 = 0.3140331355128715)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2122 = 0.6858824877823619)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2121 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2121 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14891: S1 ^operator O2124)

  1062:    O: O2124 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1062 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1061 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14892: I3 ^predict-no N1062)
<=WM: (14879: N1061 ^status complete)
<=WM: (14878: I3 ^predict-no N1061)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14896: I2 ^dir L)
=>WM: (14895: I2 ^reward 1)
=>WM: (14894: I2 ^see 0)
=>WM: (14893: N1062 ^status complete)
<=WM: (14882: I2 ^dir L)
<=WM: (14881: I2 ^reward 1)
<=WM: (14880: I2 ^see 0)
=>WM: (14897: I2 ^level-1 L0-root)
<=WM: (14883: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2123 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2124 = 0.6858824877823619)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1066 ^value 1 +)
 (R1 ^reward R1066 +)
Firing propose*predict-yes
 -->
 (O2125 ^name predict-yes +)
 (S1 ^operator O2125 +)
Firing propose*predict-no
 -->
 (O2126 ^name predict-no +)
 (S1 ^operator O2126 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2124 = 0.3140331355128715)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2123 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2124 ^name predict-no +)
 (S1 ^operator O2124 +)
Retracting propose*predict-yes
 -->
 (O2123 ^name predict-yes +)
 (S1 ^operator O2123 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1065 ^value 1 +)
 (R1 ^reward R1065 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2124 = 0.3140331355128715)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2124 = 0.6858824877823619)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2123 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2123 = -0.3332708974800781)
=>WM: (14903: S1 ^operator O2126 +)
=>WM: (14902: S1 ^operator O2125 +)
=>WM: (14901: O2126 ^name predict-no)
=>WM: (14900: O2125 ^name predict-yes)
=>WM: (14899: R1066 ^value 1)
=>WM: (14898: R1 ^reward R1066)
<=WM: (14889: S1 ^operator O2123 +)
<=WM: (14890: S1 ^operator O2124 +)
<=WM: (14891: S1 ^operator O2124)
<=WM: (14884: R1 ^reward R1065)
<=WM: (14887: O2124 ^name predict-no)
<=WM: (14886: O2123 ^name predict-yes)
<=WM: (14885: R1065 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2125 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2125 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2126 = 0.6858824877823619)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2126 = 0.3140331355128715)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2124 = 0.3140331355128715)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2124 = 0.6858824877823619)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2123 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2123 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485041 -0.171007 0.314033 -> 0.485046 -0.171006 0.31404(R,m,v=1,0.874251,0.110598)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514893 0.17099 0.685882 -> 0.514899 0.170991 0.685891(R,m,v=1,1,0)
=>WM: (14904: S1 ^operator O2126)

  1063:    O: O2126 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1063 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1062 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14905: I3 ^predict-no N1063)
<=WM: (14893: N1062 ^status complete)
<=WM: (14892: I3 ^predict-no N1062)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14909: I2 ^dir U)
=>WM: (14908: I2 ^reward 1)
=>WM: (14907: I2 ^see 0)
=>WM: (14906: N1063 ^status complete)
<=WM: (14896: I2 ^dir L)
<=WM: (14895: I2 ^reward 1)
<=WM: (14894: I2 ^see 0)
=>WM: (14910: I2 ^level-1 L0-root)
<=WM: (14897: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1067 ^value 1 +)
 (R1 ^reward R1067 +)
Firing propose*predict-yes
 -->
 (O2127 ^name predict-yes +)
 (S1 ^operator O2127 +)
Firing propose*predict-no
 -->
 (O2128 ^name predict-no +)
 (S1 ^operator O2128 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2126 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2125 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2126 ^name predict-no +)
 (S1 ^operator O2126 +)
Retracting propose*predict-yes
 -->
 (O2125 ^name predict-yes +)
 (S1 ^operator O2125 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1066 ^value 1 +)
 (R1 ^reward R1066 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2126 = 0.3140400312949982)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2126 = 0.6858905480601469)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2125 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2125 = -0.3332708974800781)
=>WM: (14917: S1 ^operator O2128 +)
=>WM: (14916: S1 ^operator O2127 +)
=>WM: (14915: I3 ^dir U)
=>WM: (14914: O2128 ^name predict-no)
=>WM: (14913: O2127 ^name predict-yes)
=>WM: (14912: R1067 ^value 1)
=>WM: (14911: R1 ^reward R1067)
<=WM: (14902: S1 ^operator O2125 +)
<=WM: (14903: S1 ^operator O2126 +)
<=WM: (14904: S1 ^operator O2126)
<=WM: (14888: I3 ^dir L)
<=WM: (14898: R1 ^reward R1066)
<=WM: (14901: O2126 ^name predict-no)
<=WM: (14900: O2125 ^name predict-yes)
<=WM: (14899: R1066 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2127 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2128 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2126 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2125 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.31404 -> 0.48505 -0.171005 0.314046(R,m,v=1,0.875,0.11003)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514899 0.170991 0.685891 -> 0.514904 0.170993 0.685897(R,m,v=1,1,0)
=>WM: (14918: S1 ^operator O2128)

  1064:    O: O2128 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1064 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1063 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14919: I3 ^predict-no N1064)
<=WM: (14906: N1063 ^status complete)
<=WM: (14905: I3 ^predict-no N1063)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14923: I2 ^dir L)
=>WM: (14922: I2 ^reward 1)
=>WM: (14921: I2 ^see 0)
=>WM: (14920: N1064 ^status complete)
<=WM: (14909: I2 ^dir U)
<=WM: (14908: I2 ^reward 1)
<=WM: (14907: I2 ^see 0)
=>WM: (14924: I2 ^level-1 L0-root)
<=WM: (14910: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2127 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2128 = 0.6858971614456655)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1068 ^value 1 +)
 (R1 ^reward R1068 +)
Firing propose*predict-yes
 -->
 (O2129 ^name predict-yes +)
 (S1 ^operator O2129 +)
Firing propose*predict-no
 -->
 (O2130 ^name predict-no +)
 (S1 ^operator O2130 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2128 = 0.3140456992451273)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2127 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2128 ^name predict-no +)
 (S1 ^operator O2128 +)
Retracting propose*predict-yes
 -->
 (O2127 ^name predict-yes +)
 (S1 ^operator O2127 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1067 ^value 1 +)
 (R1 ^reward R1067 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2128 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2127 = 0.)
=>WM: (14931: S1 ^operator O2130 +)
=>WM: (14930: S1 ^operator O2129 +)
=>WM: (14929: I3 ^dir L)
=>WM: (14928: O2130 ^name predict-no)
=>WM: (14927: O2129 ^name predict-yes)
=>WM: (14926: R1068 ^value 1)
=>WM: (14925: R1 ^reward R1068)
<=WM: (14916: S1 ^operator O2127 +)
<=WM: (14917: S1 ^operator O2128 +)
<=WM: (14918: S1 ^operator O2128)
<=WM: (14915: I3 ^dir U)
<=WM: (14911: R1 ^reward R1067)
<=WM: (14914: O2128 ^name predict-no)
<=WM: (14913: O2127 ^name predict-yes)
<=WM: (14912: R1067 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2129 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2129 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2130 = 0.6858971614456655)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2130 = 0.3140456992451273)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2128 = 0.3140456992451273)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2128 = 0.6858971614456655)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2127 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2127 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14932: S1 ^operator O2130)

  1065:    O: O2130 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1065 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1064 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14933: I3 ^predict-no N1065)
<=WM: (14920: N1064 ^status complete)
<=WM: (14919: I3 ^predict-no N1064)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14937: I2 ^dir L)
=>WM: (14936: I2 ^reward 1)
=>WM: (14935: I2 ^see 0)
=>WM: (14934: N1065 ^status complete)
<=WM: (14923: I2 ^dir L)
<=WM: (14922: I2 ^reward 1)
<=WM: (14921: I2 ^see 0)
=>WM: (14938: I2 ^level-1 L0-root)
<=WM: (14924: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2129 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2130 = 0.6858971614456655)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1069 ^value 1 +)
 (R1 ^reward R1069 +)
Firing propose*predict-yes
 -->
 (O2131 ^name predict-yes +)
 (S1 ^operator O2131 +)
Firing propose*predict-no
 -->
 (O2132 ^name predict-no +)
 (S1 ^operator O2132 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2130 = 0.3140456992451273)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2129 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2130 ^name predict-no +)
 (S1 ^operator O2130 +)
Retracting propose*predict-yes
 -->
 (O2129 ^name predict-yes +)
 (S1 ^operator O2129 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1068 ^value 1 +)
 (R1 ^reward R1068 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2130 = 0.3140456992451273)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2130 = 0.6858971614456655)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2129 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2129 = -0.3332708974800781)
=>WM: (14944: S1 ^operator O2132 +)
=>WM: (14943: S1 ^operator O2131 +)
=>WM: (14942: O2132 ^name predict-no)
=>WM: (14941: O2131 ^name predict-yes)
=>WM: (14940: R1069 ^value 1)
=>WM: (14939: R1 ^reward R1069)
<=WM: (14930: S1 ^operator O2129 +)
<=WM: (14931: S1 ^operator O2130 +)
<=WM: (14932: S1 ^operator O2130)
<=WM: (14925: R1 ^reward R1068)
<=WM: (14928: O2130 ^name predict-no)
<=WM: (14927: O2129 ^name predict-yes)
<=WM: (14926: R1068 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2131 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2131 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2132 = 0.6858971614456655)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2132 = 0.3140456992451273)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2130 = 0.3140456992451273)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2130 = 0.6858971614456655)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2129 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2129 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.48505 -0.171005 0.314046 -> 0.485054 -0.171004 0.31405(R,m,v=1,0.87574,0.109467)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514904 0.170993 0.685897 -> 0.514909 0.170994 0.685903(R,m,v=1,1,0)
=>WM: (14945: S1 ^operator O2132)

  1066:    O: O2132 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1066 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1065 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14946: I3 ^predict-no N1066)
<=WM: (14934: N1065 ^status complete)
<=WM: (14933: I3 ^predict-no N1065)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14950: I2 ^dir L)
=>WM: (14949: I2 ^reward 1)
=>WM: (14948: I2 ^see 0)
=>WM: (14947: N1066 ^status complete)
<=WM: (14937: I2 ^dir L)
<=WM: (14936: I2 ^reward 1)
<=WM: (14935: I2 ^see 0)
=>WM: (14951: I2 ^level-1 L0-root)
<=WM: (14938: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2131 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2132 = 0.6859025901730954)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1070 ^value 1 +)
 (R1 ^reward R1070 +)
Firing propose*predict-yes
 -->
 (O2133 ^name predict-yes +)
 (S1 ^operator O2133 +)
Firing propose*predict-no
 -->
 (O2134 ^name predict-no +)
 (S1 ^operator O2134 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2132 = 0.3140503599509452)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2131 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2132 ^name predict-no +)
 (S1 ^operator O2132 +)
Retracting propose*predict-yes
 -->
 (O2131 ^name predict-yes +)
 (S1 ^operator O2131 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1069 ^value 1 +)
 (R1 ^reward R1069 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2132 = 0.3140503599509452)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2132 = 0.6859025901730954)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2131 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2131 = -0.3332708974800781)
=>WM: (14957: S1 ^operator O2134 +)
=>WM: (14956: S1 ^operator O2133 +)
=>WM: (14955: O2134 ^name predict-no)
=>WM: (14954: O2133 ^name predict-yes)
=>WM: (14953: R1070 ^value 1)
=>WM: (14952: R1 ^reward R1070)
<=WM: (14943: S1 ^operator O2131 +)
<=WM: (14944: S1 ^operator O2132 +)
<=WM: (14945: S1 ^operator O2132)
<=WM: (14939: R1 ^reward R1069)
<=WM: (14942: O2132 ^name predict-no)
<=WM: (14941: O2131 ^name predict-yes)
<=WM: (14940: R1069 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2133 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2133 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2134 = 0.6859025901730954)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2134 = 0.3140503599509452)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2132 = 0.3140503599509452)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2132 = 0.6859025901730954)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2131 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2131 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485054 -0.171004 0.31405 -> 0.485057 -0.171003 0.314054(R,m,v=1,0.876471,0.108911)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514909 0.170994 0.685903 -> 0.514912 0.170995 0.685907(R,m,v=1,1,0)
=>WM: (14958: S1 ^operator O2134)

  1067:    O: O2134 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1067 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1066 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14959: I3 ^predict-no N1067)
<=WM: (14947: N1066 ^status complete)
<=WM: (14946: I3 ^predict-no N1066)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14963: I2 ^dir L)
=>WM: (14962: I2 ^reward 1)
=>WM: (14961: I2 ^see 0)
=>WM: (14960: N1067 ^status complete)
<=WM: (14950: I2 ^dir L)
<=WM: (14949: I2 ^reward 1)
<=WM: (14948: I2 ^see 0)
=>WM: (14964: I2 ^level-1 L0-root)
<=WM: (14951: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2133 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2134 = 0.6859070484688164)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1071 ^value 1 +)
 (R1 ^reward R1071 +)
Firing propose*predict-yes
 -->
 (O2135 ^name predict-yes +)
 (S1 ^operator O2135 +)
Firing propose*predict-no
 -->
 (O2136 ^name predict-no +)
 (S1 ^operator O2136 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2134 = 0.3140541939976826)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2133 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2134 ^name predict-no +)
 (S1 ^operator O2134 +)
Retracting propose*predict-yes
 -->
 (O2133 ^name predict-yes +)
 (S1 ^operator O2133 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1070 ^value 1 +)
 (R1 ^reward R1070 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2134 = 0.3140541939976826)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2134 = 0.6859070484688164)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2133 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2133 = -0.3332708974800781)
=>WM: (14970: S1 ^operator O2136 +)
=>WM: (14969: S1 ^operator O2135 +)
=>WM: (14968: O2136 ^name predict-no)
=>WM: (14967: O2135 ^name predict-yes)
=>WM: (14966: R1071 ^value 1)
=>WM: (14965: R1 ^reward R1071)
<=WM: (14956: S1 ^operator O2133 +)
<=WM: (14957: S1 ^operator O2134 +)
<=WM: (14958: S1 ^operator O2134)
<=WM: (14952: R1 ^reward R1070)
<=WM: (14955: O2134 ^name predict-no)
<=WM: (14954: O2133 ^name predict-yes)
<=WM: (14953: R1070 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2135 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2135 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2136 = 0.6859070484688164)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2136 = 0.3140541939976826)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2134 = 0.3140541939976826)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2134 = 0.6859070484688164)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2133 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2133 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485057 -0.171003 0.314054 -> 0.48506 -0.171002 0.314057(R,m,v=1,0.877193,0.108359)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514912 0.170995 0.685907 -> 0.514915 0.170996 0.685911(R,m,v=1,1,0)
=>WM: (14971: S1 ^operator O2136)

  1068:    O: O2136 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1068 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1067 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14972: I3 ^predict-no N1068)
<=WM: (14960: N1067 ^status complete)
<=WM: (14959: I3 ^predict-no N1067)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14976: I2 ^dir L)
=>WM: (14975: I2 ^reward 1)
=>WM: (14974: I2 ^see 0)
=>WM: (14973: N1068 ^status complete)
<=WM: (14963: I2 ^dir L)
<=WM: (14962: I2 ^reward 1)
<=WM: (14961: I2 ^see 0)
=>WM: (14977: I2 ^level-1 L0-root)
<=WM: (14964: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2135 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2136 = 0.6859107114336244)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1072 ^value 1 +)
 (R1 ^reward R1072 +)
Firing propose*predict-yes
 -->
 (O2137 ^name predict-yes +)
 (S1 ^operator O2137 +)
Firing propose*predict-no
 -->
 (O2138 ^name predict-no +)
 (S1 ^operator O2138 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2136 = 0.3140573492937311)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2135 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2136 ^name predict-no +)
 (S1 ^operator O2136 +)
Retracting propose*predict-yes
 -->
 (O2135 ^name predict-yes +)
 (S1 ^operator O2135 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1071 ^value 1 +)
 (R1 ^reward R1071 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2136 = 0.3140573492937311)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2136 = 0.6859107114336244)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2135 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2135 = -0.3332708974800781)
=>WM: (14983: S1 ^operator O2138 +)
=>WM: (14982: S1 ^operator O2137 +)
=>WM: (14981: O2138 ^name predict-no)
=>WM: (14980: O2137 ^name predict-yes)
=>WM: (14979: R1072 ^value 1)
=>WM: (14978: R1 ^reward R1072)
<=WM: (14969: S1 ^operator O2135 +)
<=WM: (14970: S1 ^operator O2136 +)
<=WM: (14971: S1 ^operator O2136)
<=WM: (14965: R1 ^reward R1071)
<=WM: (14968: O2136 ^name predict-no)
<=WM: (14967: O2135 ^name predict-yes)
<=WM: (14966: R1071 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2137 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2137 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2138 = 0.6859107114336244)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2138 = 0.3140573492937311)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2136 = 0.3140573492937311)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2136 = 0.6859107114336244)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2135 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2135 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.48506 -0.171002 0.314057 -> 0.485062 -0.171002 0.31406(R,m,v=1,0.877907,0.107813)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514915 0.170996 0.685911 -> 0.514917 0.170996 0.685914(R,m,v=1,1,0)
=>WM: (14984: S1 ^operator O2138)

  1069:    O: O2138 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1069 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1068 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14985: I3 ^predict-no N1069)
<=WM: (14973: N1068 ^status complete)
<=WM: (14972: I3 ^predict-no N1068)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14989: I2 ^dir L)
=>WM: (14988: I2 ^reward 1)
=>WM: (14987: I2 ^see 0)
=>WM: (14986: N1069 ^status complete)
<=WM: (14976: I2 ^dir L)
<=WM: (14975: I2 ^reward 1)
<=WM: (14974: I2 ^see 0)
=>WM: (14990: I2 ^level-1 L0-root)
<=WM: (14977: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2137 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2138 = 0.6859137222632506)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1073 ^value 1 +)
 (R1 ^reward R1073 +)
Firing propose*predict-yes
 -->
 (O2139 ^name predict-yes +)
 (S1 ^operator O2139 +)
Firing propose*predict-no
 -->
 (O2140 ^name predict-no +)
 (S1 ^operator O2140 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2138 = 0.3140599470408917)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2137 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2138 ^name predict-no +)
 (S1 ^operator O2138 +)
Retracting propose*predict-yes
 -->
 (O2137 ^name predict-yes +)
 (S1 ^operator O2137 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1072 ^value 1 +)
 (R1 ^reward R1072 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2138 = 0.3140599470408917)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2138 = 0.6859137222632506)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2137 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2137 = -0.3332708974800781)
=>WM: (14996: S1 ^operator O2140 +)
=>WM: (14995: S1 ^operator O2139 +)
=>WM: (14994: O2140 ^name predict-no)
=>WM: (14993: O2139 ^name predict-yes)
=>WM: (14992: R1073 ^value 1)
=>WM: (14991: R1 ^reward R1073)
<=WM: (14982: S1 ^operator O2137 +)
<=WM: (14983: S1 ^operator O2138 +)
<=WM: (14984: S1 ^operator O2138)
<=WM: (14978: R1 ^reward R1072)
<=WM: (14981: O2138 ^name predict-no)
<=WM: (14980: O2137 ^name predict-yes)
<=WM: (14979: R1072 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2139 = -0.3332708974800781)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2139 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2140 = 0.6859137222632506)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2140 = 0.3140599470408917)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2138 = 0.3140599470408917)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2138 = 0.6859137222632506)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2137 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2137 = -0.3332708974800781)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485062 -0.171002 0.31406 -> 0.485063 -0.171001 0.314062(R,m,v=1,0.878613,0.107272)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514917 0.170996 0.685914 -> 0.514919 0.170997 0.685916(R,m,v=1,1,0)
=>WM: (14997: S1 ^operator O2140)

  1070:    O: O2140 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1070 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1069 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14998: I3 ^predict-no N1070)
<=WM: (14986: N1069 ^status complete)
<=WM: (14985: I3 ^predict-no N1069)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15002: I2 ^dir R)
=>WM: (15001: I2 ^reward 1)
=>WM: (15000: I2 ^see 0)
=>WM: (14999: N1070 ^status complete)
<=WM: (14989: I2 ^dir L)
<=WM: (14988: I2 ^reward 1)
<=WM: (14987: I2 ^see 0)
=>WM: (15003: I2 ^level-1 L0-root)
<=WM: (14990: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2139 = 0.7057943466848455)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2140 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1074 ^value 1 +)
 (R1 ^reward R1074 +)
Firing propose*predict-yes
 -->
 (O2141 ^name predict-yes +)
 (S1 ^operator O2141 +)
Firing propose*predict-no
 -->
 (O2142 ^name predict-no +)
 (S1 ^operator O2142 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2140 = 0.2298602070490972)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2139 = 0.2940539968979803)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2140 ^name predict-no +)
 (S1 ^operator O2140 +)
Retracting propose*predict-yes
 -->
 (O2139 ^name predict-yes +)
 (S1 ^operator O2139 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1073 ^value 1 +)
 (R1 ^reward R1073 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2140 = 0.3140620866027386)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
 -->
 (S1 ^operator O2140 = 0.6859161981217101)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2139 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
 -->
 (S1 ^operator O2139 = -0.3332708974800781)
=>WM: (15010: S1 ^operator O2142 +)
=>WM: (15009: S1 ^operator O2141 +)
=>WM: (15008: I3 ^dir R)
=>WM: (15007: O2142 ^name predict-no)
=>WM: (15006: O2141 ^name predict-yes)
=>WM: (15005: R1074 ^value 1)
=>WM: (15004: R1 ^reward R1074)
<=WM: (14995: S1 ^operator O2139 +)
<=WM: (14996: S1 ^operator O2140 +)
<=WM: (14997: S1 ^operator O2140)
<=WM: (14929: I3 ^dir L)
<=WM: (14991: R1 ^reward R1073)
<=WM: (14994: O2140 ^name predict-no)
<=WM: (14993: O2139 ^name predict-yes)
<=WM: (14992: R1073 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2141 = 0.7057943466848455)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2141 = 0.2940539968979803)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2142 = -0.2023211881870005)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2142 = 0.2298602070490972)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2140 = 0.2298602070490972)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2140 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2139 = 0.2940539968979803)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2139 = 0.7057943466848455)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485063 -0.171001 0.314062 -> 0.485065 -0.171001 0.314064(R,m,v=1,0.87931,0.106737)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514919 0.170997 0.685916 -> 0.514921 0.170997 0.685918(R,m,v=1,1,0)
=>WM: (15011: S1 ^operator O2141)

  1071:    O: O2141 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1071 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1070 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15012: I3 ^predict-yes N1071)
<=WM: (14999: N1070 ^status complete)
<=WM: (14998: I3 ^predict-no N1070)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (15016: I2 ^dir U)
=>WM: (15015: I2 ^reward 1)
=>WM: (15014: I2 ^see 1)
=>WM: (15013: N1071 ^status complete)
<=WM: (15002: I2 ^dir R)
<=WM: (15001: I2 ^reward 1)
<=WM: (15000: I2 ^see 0)
=>WM: (15017: I2 ^level-1 R1-root)
<=WM: (15003: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1075 ^value 1 +)
 (R1 ^reward R1075 +)
Firing propose*predict-yes
 -->
 (O2143 ^name predict-yes +)
 (S1 ^operator O2143 +)
Firing propose*predict-no
 -->
 (O2144 ^name predict-no +)
 (S1 ^operator O2144 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2142 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2141 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2142 ^name predict-no +)
 (S1 ^operator O2142 +)
Retracting propose*predict-yes
 -->
 (O2141 ^name predict-yes +)
 (S1 ^operator O2141 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1074 ^value 1 +)
 (R1 ^reward R1074 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2142 = 0.2298602070490972)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2142 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2141 = 0.2940539968979803)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2141 = 0.7057943466848455)
=>WM: (15025: S1 ^operator O2144 +)
=>WM: (15024: S1 ^operator O2143 +)
=>WM: (15023: I3 ^dir U)
=>WM: (15022: O2144 ^name predict-no)
=>WM: (15021: O2143 ^name predict-yes)
=>WM: (15020: R1075 ^value 1)
=>WM: (15019: R1 ^reward R1075)
=>WM: (15018: I3 ^see 1)
<=WM: (15009: S1 ^operator O2141 +)
<=WM: (15011: S1 ^operator O2141)
<=WM: (15010: S1 ^operator O2142 +)
<=WM: (15008: I3 ^dir R)
<=WM: (15004: R1 ^reward R1074)
<=WM: (14830: I3 ^see 0)
<=WM: (15007: O2142 ^name predict-no)
<=WM: (15006: O2141 ^name predict-yes)
<=WM: (15005: R1074 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2143 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2144 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2142 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2141 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501123 -0.207069 0.294054 -> 0.501134 -0.207068 0.294066(R,m,v=1,0.856287,0.123801)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.49874 0.207054 0.705794 -> 0.498753 0.207056 0.705809(R,m,v=1,1,0)
=>WM: (15026: S1 ^operator O2144)

  1072:    O: O2144 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1072 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1071 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15027: I3 ^predict-no N1072)
<=WM: (15013: N1071 ^status complete)
<=WM: (15012: I3 ^predict-yes N1071)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15031: I2 ^dir R)
=>WM: (15030: I2 ^reward 1)
=>WM: (15029: I2 ^see 0)
=>WM: (15028: N1072 ^status complete)
<=WM: (15016: I2 ^dir U)
<=WM: (15015: I2 ^reward 1)
<=WM: (15014: I2 ^see 1)
=>WM: (15032: I2 ^level-1 R1-root)
<=WM: (15017: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2143 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2144 = 0.770161537509104)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1076 ^value 1 +)
 (R1 ^reward R1076 +)
Firing propose*predict-yes
 -->
 (O2145 ^name predict-yes +)
 (S1 ^operator O2145 +)
Firing propose*predict-no
 -->
 (O2146 ^name predict-no +)
 (S1 ^operator O2146 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2144 = 0.2298602070490972)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2143 = 0.2940663911910953)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2144 ^name predict-no +)
 (S1 ^operator O2144 +)
Retracting propose*predict-yes
 -->
 (O2143 ^name predict-yes +)
 (S1 ^operator O2143 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1075 ^value 1 +)
 (R1 ^reward R1075 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2144 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2143 = 0.)
=>WM: (15040: S1 ^operator O2146 +)
=>WM: (15039: S1 ^operator O2145 +)
=>WM: (15038: I3 ^dir R)
=>WM: (15037: O2146 ^name predict-no)
=>WM: (15036: O2145 ^name predict-yes)
=>WM: (15035: R1076 ^value 1)
=>WM: (15034: R1 ^reward R1076)
=>WM: (15033: I3 ^see 0)
<=WM: (15024: S1 ^operator O2143 +)
<=WM: (15025: S1 ^operator O2144 +)
<=WM: (15026: S1 ^operator O2144)
<=WM: (15023: I3 ^dir U)
<=WM: (15019: R1 ^reward R1075)
<=WM: (15018: I3 ^see 1)
<=WM: (15022: O2144 ^name predict-no)
<=WM: (15021: O2143 ^name predict-yes)
<=WM: (15020: R1075 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2145 = -0.252585164213872)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2145 = 0.2940663911910953)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2146 = 0.770161537509104)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2146 = 0.2298602070490972)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2144 = 0.2298602070490972)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2144 = 0.770161537509104)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2143 = 0.2940663911910953)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2143 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15041: S1 ^operator O2146)

  1073:    O: O2146 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1073 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1072 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15042: I3 ^predict-no N1073)
<=WM: (15028: N1072 ^status complete)
<=WM: (15027: I3 ^predict-no N1072)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15046: I2 ^dir L)
=>WM: (15045: I2 ^reward 1)
=>WM: (15044: I2 ^see 0)
=>WM: (15043: N1073 ^status complete)
<=WM: (15031: I2 ^dir R)
<=WM: (15030: I2 ^reward 1)
<=WM: (15029: I2 ^see 0)
=>WM: (15047: I2 ^level-1 R0-root)
<=WM: (15032: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2145 = 0.6195760036479832)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2146 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1077 ^value 1 +)
 (R1 ^reward R1077 +)
Firing propose*predict-yes
 -->
 (O2147 ^name predict-yes +)
 (S1 ^operator O2147 +)
Firing propose*predict-no
 -->
 (O2148 ^name predict-no +)
 (S1 ^operator O2148 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2146 = 0.3140638494766289)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2145 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2146 ^name predict-no +)
 (S1 ^operator O2146 +)
Retracting propose*predict-yes
 -->
 (O2145 ^name predict-yes +)
 (S1 ^operator O2145 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1076 ^value 1 +)
 (R1 ^reward R1076 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2146 = 0.2298602070490972)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2146 = 0.770161537509104)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2145 = 0.2940663911910953)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2145 = -0.252585164213872)
=>WM: (15054: S1 ^operator O2148 +)
=>WM: (15053: S1 ^operator O2147 +)
=>WM: (15052: I3 ^dir L)
=>WM: (15051: O2148 ^name predict-no)
=>WM: (15050: O2147 ^name predict-yes)
=>WM: (15049: R1077 ^value 1)
=>WM: (15048: R1 ^reward R1077)
<=WM: (15039: S1 ^operator O2145 +)
<=WM: (15040: S1 ^operator O2146 +)
<=WM: (15041: S1 ^operator O2146)
<=WM: (15038: I3 ^dir R)
<=WM: (15034: R1 ^reward R1076)
<=WM: (15037: O2146 ^name predict-no)
<=WM: (15036: O2145 ^name predict-yes)
<=WM: (15035: R1076 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2147 = 0.3804134437534242)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2147 = 0.6195760036479832)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2148 = 0.3140638494766289)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2148 = -0.2190661556260421)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2146 = 0.3140638494766289)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2146 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2145 = 0.3804134437534242)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2145 = 0.6195760036479832)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611912 -0.382052 0.22986 -> 0.611911 -0.382052 0.229858(R,m,v=1,0.854839,0.12476)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388105 0.382056 0.770162 -> 0.388104 0.382056 0.770159(R,m,v=1,1,0)
=>WM: (15055: S1 ^operator O2147)

  1074:    O: O2147 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1074 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1073 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15056: I3 ^predict-yes N1074)
<=WM: (15043: N1073 ^status complete)
<=WM: (15042: I3 ^predict-no N1073)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15060: I2 ^dir U)
=>WM: (15059: I2 ^reward 1)
=>WM: (15058: I2 ^see 1)
=>WM: (15057: N1074 ^status complete)
<=WM: (15046: I2 ^dir L)
<=WM: (15045: I2 ^reward 1)
<=WM: (15044: I2 ^see 0)
=>WM: (15061: I2 ^level-1 L1-root)
<=WM: (15047: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1078 ^value 1 +)
 (R1 ^reward R1078 +)
Firing propose*predict-yes
 -->
 (O2149 ^name predict-yes +)
 (S1 ^operator O2149 +)
Firing propose*predict-no
 -->
 (O2150 ^name predict-no +)
 (S1 ^operator O2150 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2148 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2147 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2148 ^name predict-no +)
 (S1 ^operator O2148 +)
Retracting propose*predict-yes
 -->
 (O2147 ^name predict-yes +)
 (S1 ^operator O2147 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1077 ^value 1 +)
 (R1 ^reward R1077 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2148 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2148 = 0.3140638494766289)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2147 = 0.6195760036479832)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2147 = 0.3804134437534242)
=>WM: (15069: S1 ^operator O2150 +)
=>WM: (15068: S1 ^operator O2149 +)
=>WM: (15067: I3 ^dir U)
=>WM: (15066: O2150 ^name predict-no)
=>WM: (15065: O2149 ^name predict-yes)
=>WM: (15064: R1078 ^value 1)
=>WM: (15063: R1 ^reward R1078)
=>WM: (15062: I3 ^see 1)
<=WM: (15053: S1 ^operator O2147 +)
<=WM: (15055: S1 ^operator O2147)
<=WM: (15054: S1 ^operator O2148 +)
<=WM: (15052: I3 ^dir L)
<=WM: (15048: R1 ^reward R1077)
<=WM: (15033: I3 ^see 0)
<=WM: (15051: O2148 ^name predict-no)
<=WM: (15050: O2147 ^name predict-yes)
<=WM: (15049: R1077 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2149 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2150 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2148 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2147 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.841808,0.133924)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478645 0.140931 0.619576 -> 0.478646 0.140931 0.619577(R,m,v=1,1,0)
=>WM: (15070: S1 ^operator O2150)

  1075:    O: O2150 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1075 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1074 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15071: I3 ^predict-no N1075)
<=WM: (15057: N1074 ^status complete)
<=WM: (15056: I3 ^predict-yes N1074)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15075: I2 ^dir L)
=>WM: (15074: I2 ^reward 1)
=>WM: (15073: I2 ^see 0)
=>WM: (15072: N1075 ^status complete)
<=WM: (15060: I2 ^dir U)
<=WM: (15059: I2 ^reward 1)
<=WM: (15058: I2 ^see 1)
=>WM: (15076: I2 ^level-1 L1-root)
<=WM: (15061: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2149 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2150 = 0.6860475006196615)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1079 ^value 1 +)
 (R1 ^reward R1079 +)
Firing propose*predict-yes
 -->
 (O2151 ^name predict-yes +)
 (S1 ^operator O2151 +)
Firing propose*predict-no
 -->
 (O2152 ^name predict-no +)
 (S1 ^operator O2152 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2150 = 0.3140638494766289)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2149 = 0.3804142980557849)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2150 ^name predict-no +)
 (S1 ^operator O2150 +)
Retracting propose*predict-yes
 -->
 (O2149 ^name predict-yes +)
 (S1 ^operator O2149 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1078 ^value 1 +)
 (R1 ^reward R1078 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2150 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2149 = 0.)
=>WM: (15084: S1 ^operator O2152 +)
=>WM: (15083: S1 ^operator O2151 +)
=>WM: (15082: I3 ^dir L)
=>WM: (15081: O2152 ^name predict-no)
=>WM: (15080: O2151 ^name predict-yes)
=>WM: (15079: R1079 ^value 1)
=>WM: (15078: R1 ^reward R1079)
=>WM: (15077: I3 ^see 0)
<=WM: (15068: S1 ^operator O2149 +)
<=WM: (15069: S1 ^operator O2150 +)
<=WM: (15070: S1 ^operator O2150)
<=WM: (15067: I3 ^dir U)
<=WM: (15063: R1 ^reward R1078)
<=WM: (15062: I3 ^see 1)
<=WM: (15066: O2150 ^name predict-no)
<=WM: (15065: O2149 ^name predict-yes)
<=WM: (15064: R1078 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2151 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2151 = 0.3804142980557849)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2152 = 0.6860475006196615)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2152 = 0.3140638494766289)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2150 = 0.3140638494766289)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2150 = 0.6860475006196615)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2149 = 0.3804142980557849)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2149 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15085: S1 ^operator O2152)

  1076:    O: O2152 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1076 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1075 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15086: I3 ^predict-no N1076)
<=WM: (15072: N1075 ^status complete)
<=WM: (15071: I3 ^predict-no N1075)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15090: I2 ^dir R)
=>WM: (15089: I2 ^reward 1)
=>WM: (15088: I2 ^see 0)
=>WM: (15087: N1076 ^status complete)
<=WM: (15075: I2 ^dir L)
<=WM: (15074: I2 ^reward 1)
<=WM: (15073: I2 ^see 0)
=>WM: (15091: I2 ^level-1 L0-root)
<=WM: (15076: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2151 = 0.7058089158850139)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2152 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1080 ^value 1 +)
 (R1 ^reward R1080 +)
Firing propose*predict-yes
 -->
 (O2153 ^name predict-yes +)
 (S1 ^operator O2153 +)
Firing propose*predict-no
 -->
 (O2154 ^name predict-no +)
 (S1 ^operator O2154 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2152 = 0.229858460707707)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2151 = 0.2940663911910953)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2152 ^name predict-no +)
 (S1 ^operator O2152 +)
Retracting propose*predict-yes
 -->
 (O2151 ^name predict-yes +)
 (S1 ^operator O2151 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1079 ^value 1 +)
 (R1 ^reward R1079 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2152 = 0.3140638494766289)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2152 = 0.6860475006196615)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2151 = 0.3804142980557849)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2151 = -0.3470159027404986)
=>WM: (15098: S1 ^operator O2154 +)
=>WM: (15097: S1 ^operator O2153 +)
=>WM: (15096: I3 ^dir R)
=>WM: (15095: O2154 ^name predict-no)
=>WM: (15094: O2153 ^name predict-yes)
=>WM: (15093: R1080 ^value 1)
=>WM: (15092: R1 ^reward R1080)
<=WM: (15083: S1 ^operator O2151 +)
<=WM: (15084: S1 ^operator O2152 +)
<=WM: (15085: S1 ^operator O2152)
<=WM: (15082: I3 ^dir L)
<=WM: (15078: R1 ^reward R1079)
<=WM: (15081: O2152 ^name predict-no)
<=WM: (15080: O2151 ^name predict-yes)
<=WM: (15079: R1079 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2153 = 0.2940663911910953)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2153 = 0.7058089158850139)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2154 = 0.229858460707707)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2154 = -0.2023211881870005)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2152 = 0.229858460707707)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2152 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2151 = 0.2940663911910953)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2151 = 0.7058089158850139)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485065 -0.171001 0.314064 -> 0.485058 -0.171003 0.314055(R,m,v=1,0.88,0.106207)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515023 0.171024 0.686048 -> 0.515015 0.171022 0.686037(R,m,v=1,1,0)
=>WM: (15099: S1 ^operator O2153)

  1077:    O: O2153 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1077 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1076 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15100: I3 ^predict-yes N1077)
<=WM: (15087: N1076 ^status complete)
<=WM: (15086: I3 ^predict-no N1076)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15104: I2 ^dir L)
=>WM: (15103: I2 ^reward 1)
=>WM: (15102: I2 ^see 1)
=>WM: (15101: N1077 ^status complete)
<=WM: (15090: I2 ^dir R)
<=WM: (15089: I2 ^reward 1)
<=WM: (15088: I2 ^see 0)
=>WM: (15105: I2 ^level-1 R1-root)
<=WM: (15091: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2153 = 0.6195991016645057)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2154 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1081 ^value 1 +)
 (R1 ^reward R1081 +)
Firing propose*predict-yes
 -->
 (O2155 ^name predict-yes +)
 (S1 ^operator O2155 +)
Firing propose*predict-no
 -->
 (O2156 ^name predict-no +)
 (S1 ^operator O2156 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2154 = 0.3140548183361512)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2153 = 0.3804142980557849)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2154 ^name predict-no +)
 (S1 ^operator O2154 +)
Retracting propose*predict-yes
 -->
 (O2153 ^name predict-yes +)
 (S1 ^operator O2153 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1080 ^value 1 +)
 (R1 ^reward R1080 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2154 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2154 = 0.229858460707707)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2153 = 0.7058089158850139)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2153 = 0.2940663911910953)
=>WM: (15113: S1 ^operator O2156 +)
=>WM: (15112: S1 ^operator O2155 +)
=>WM: (15111: I3 ^dir L)
=>WM: (15110: O2156 ^name predict-no)
=>WM: (15109: O2155 ^name predict-yes)
=>WM: (15108: R1081 ^value 1)
=>WM: (15107: R1 ^reward R1081)
=>WM: (15106: I3 ^see 1)
<=WM: (15097: S1 ^operator O2153 +)
<=WM: (15099: S1 ^operator O2153)
<=WM: (15098: S1 ^operator O2154 +)
<=WM: (15096: I3 ^dir R)
<=WM: (15092: R1 ^reward R1080)
<=WM: (15077: I3 ^see 0)
<=WM: (15095: O2154 ^name predict-no)
<=WM: (15094: O2153 ^name predict-yes)
<=WM: (15093: R1080 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2155 = 0.3804142980557849)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2155 = 0.6195991016645057)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2156 = 0.3140548183361512)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2156 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2154 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2154 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2153 = 0.3804142980557849)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2153 = 0.6195991016645057)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501134 -0.207068 0.294066 -> 0.501143 -0.207067 0.294077(R,m,v=1,0.857143,0.123182)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498753 0.207056 0.705809 -> 0.498764 0.207057 0.705821(R,m,v=1,1,0)
=>WM: (15114: S1 ^operator O2155)

  1078:    O: O2155 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1078 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1077 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15115: I3 ^predict-yes N1078)
<=WM: (15101: N1077 ^status complete)
<=WM: (15100: I3 ^predict-yes N1077)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15119: I2 ^dir U)
=>WM: (15118: I2 ^reward 1)
=>WM: (15117: I2 ^see 1)
=>WM: (15116: N1078 ^status complete)
<=WM: (15104: I2 ^dir L)
<=WM: (15103: I2 ^reward 1)
<=WM: (15102: I2 ^see 1)
=>WM: (15120: I2 ^level-1 L1-root)
<=WM: (15105: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1082 ^value 1 +)
 (R1 ^reward R1082 +)
Firing propose*predict-yes
 -->
 (O2157 ^name predict-yes +)
 (S1 ^operator O2157 +)
Firing propose*predict-no
 -->
 (O2158 ^name predict-no +)
 (S1 ^operator O2158 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2156 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2155 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2156 ^name predict-no +)
 (S1 ^operator O2156 +)
Retracting propose*predict-yes
 -->
 (O2155 ^name predict-yes +)
 (S1 ^operator O2155 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1081 ^value 1 +)
 (R1 ^reward R1081 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2156 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2156 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2155 = 0.6195991016645057)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2155 = 0.3804142980557849)
=>WM: (15127: S1 ^operator O2158 +)
=>WM: (15126: S1 ^operator O2157 +)
=>WM: (15125: I3 ^dir U)
=>WM: (15124: O2158 ^name predict-no)
=>WM: (15123: O2157 ^name predict-yes)
=>WM: (15122: R1082 ^value 1)
=>WM: (15121: R1 ^reward R1082)
<=WM: (15112: S1 ^operator O2155 +)
<=WM: (15114: S1 ^operator O2155)
<=WM: (15113: S1 ^operator O2156 +)
<=WM: (15111: I3 ^dir L)
<=WM: (15107: R1 ^reward R1081)
<=WM: (15110: O2156 ^name predict-no)
<=WM: (15109: O2155 ^name predict-yes)
<=WM: (15108: R1081 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2157 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2158 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2156 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2155 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.842697,0.133308)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.47867 0.140929 0.619599 -> 0.478669 0.140929 0.619598(R,m,v=1,1,0)
=>WM: (15128: S1 ^operator O2158)

  1079:    O: O2158 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1079 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1078 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15129: I3 ^predict-no N1079)
<=WM: (15116: N1078 ^status complete)
<=WM: (15115: I3 ^predict-yes N1078)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15133: I2 ^dir R)
=>WM: (15132: I2 ^reward 1)
=>WM: (15131: I2 ^see 0)
=>WM: (15130: N1079 ^status complete)
<=WM: (15119: I2 ^dir U)
<=WM: (15118: I2 ^reward 1)
<=WM: (15117: I2 ^see 1)
=>WM: (15134: I2 ^level-1 L1-root)
<=WM: (15120: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2157 = 0.70622448437219)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2158 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1083 ^value 1 +)
 (R1 ^reward R1083 +)
Firing propose*predict-yes
 -->
 (O2159 ^name predict-yes +)
 (S1 ^operator O2159 +)
Firing propose*predict-no
 -->
 (O2160 ^name predict-no +)
 (S1 ^operator O2160 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2158 = 0.229858460707707)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2157 = 0.2940765719273235)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2158 ^name predict-no +)
 (S1 ^operator O2158 +)
Retracting propose*predict-yes
 -->
 (O2157 ^name predict-yes +)
 (S1 ^operator O2157 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1082 ^value 1 +)
 (R1 ^reward R1082 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2158 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2157 = 0.)
=>WM: (15142: S1 ^operator O2160 +)
=>WM: (15141: S1 ^operator O2159 +)
=>WM: (15140: I3 ^dir R)
=>WM: (15139: O2160 ^name predict-no)
=>WM: (15138: O2159 ^name predict-yes)
=>WM: (15137: R1083 ^value 1)
=>WM: (15136: R1 ^reward R1083)
=>WM: (15135: I3 ^see 0)
<=WM: (15126: S1 ^operator O2157 +)
<=WM: (15127: S1 ^operator O2158 +)
<=WM: (15128: S1 ^operator O2158)
<=WM: (15125: I3 ^dir U)
<=WM: (15121: R1 ^reward R1082)
<=WM: (15106: I3 ^see 1)
<=WM: (15124: O2158 ^name predict-no)
<=WM: (15123: O2157 ^name predict-yes)
<=WM: (15122: R1082 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2159 = 0.70622448437219)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2159 = 0.2940765719273235)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2160 = -0.1937987592593187)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2160 = 0.229858460707707)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2158 = 0.229858460707707)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2158 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2157 = 0.2940765719273235)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2157 = 0.70622448437219)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15143: S1 ^operator O2159)

  1080:    O: O2159 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1080 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1079 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15144: I3 ^predict-yes N1080)
<=WM: (15130: N1079 ^status complete)
<=WM: (15129: I3 ^predict-no N1079)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15148: I2 ^dir U)
=>WM: (15147: I2 ^reward 1)
=>WM: (15146: I2 ^see 1)
=>WM: (15145: N1080 ^status complete)
<=WM: (15133: I2 ^dir R)
<=WM: (15132: I2 ^reward 1)
<=WM: (15131: I2 ^see 0)
=>WM: (15149: I2 ^level-1 R1-root)
<=WM: (15134: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1084 ^value 1 +)
 (R1 ^reward R1084 +)
Firing propose*predict-yes
 -->
 (O2161 ^name predict-yes +)
 (S1 ^operator O2161 +)
Firing propose*predict-no
 -->
 (O2162 ^name predict-no +)
 (S1 ^operator O2162 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2160 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2159 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2160 ^name predict-no +)
 (S1 ^operator O2160 +)
Retracting propose*predict-yes
 -->
 (O2159 ^name predict-yes +)
 (S1 ^operator O2159 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1083 ^value 1 +)
 (R1 ^reward R1083 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2160 = 0.229858460707707)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2160 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2159 = 0.2940765719273235)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2159 = 0.70622448437219)
=>WM: (15157: S1 ^operator O2162 +)
=>WM: (15156: S1 ^operator O2161 +)
=>WM: (15155: I3 ^dir U)
=>WM: (15154: O2162 ^name predict-no)
=>WM: (15153: O2161 ^name predict-yes)
=>WM: (15152: R1084 ^value 1)
=>WM: (15151: R1 ^reward R1084)
=>WM: (15150: I3 ^see 1)
<=WM: (15141: S1 ^operator O2159 +)
<=WM: (15143: S1 ^operator O2159)
<=WM: (15142: S1 ^operator O2160 +)
<=WM: (15140: I3 ^dir R)
<=WM: (15136: R1 ^reward R1083)
<=WM: (15135: I3 ^see 0)
<=WM: (15139: O2160 ^name predict-no)
<=WM: (15138: O2159 ^name predict-yes)
<=WM: (15137: R1083 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2161 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2162 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2160 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2159 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501143 -0.207067 0.294077 -> 0.501121 -0.207069 0.294052(R,m,v=1,0.857988,0.12257)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499129 0.207096 0.706224 -> 0.499103 0.207093 0.706196(R,m,v=1,1,0)
=>WM: (15158: S1 ^operator O2162)

  1081:    O: O2162 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1081 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1080 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15159: I3 ^predict-no N1081)
<=WM: (15145: N1080 ^status complete)
<=WM: (15144: I3 ^predict-yes N1080)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (15163: I2 ^dir R)
=>WM: (15162: I2 ^reward 1)
=>WM: (15161: I2 ^see 0)
=>WM: (15160: N1081 ^status complete)
<=WM: (15148: I2 ^dir U)
<=WM: (15147: I2 ^reward 1)
<=WM: (15146: I2 ^see 1)
=>WM: (15164: I2 ^level-1 R1-root)
<=WM: (15149: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2161 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2162 = 0.7701594485713136)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1085 ^value 1 +)
 (R1 ^reward R1085 +)
Firing propose*predict-yes
 -->
 (O2163 ^name predict-yes +)
 (S1 ^operator O2163 +)
Firing propose*predict-no
 -->
 (O2164 ^name predict-no +)
 (S1 ^operator O2164 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2162 = 0.229858460707707)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2161 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2162 ^name predict-no +)
 (S1 ^operator O2162 +)
Retracting propose*predict-yes
 -->
 (O2161 ^name predict-yes +)
 (S1 ^operator O2161 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1084 ^value 1 +)
 (R1 ^reward R1084 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2162 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2161 = 0.)
=>WM: (15172: S1 ^operator O2164 +)
=>WM: (15171: S1 ^operator O2163 +)
=>WM: (15170: I3 ^dir R)
=>WM: (15169: O2164 ^name predict-no)
=>WM: (15168: O2163 ^name predict-yes)
=>WM: (15167: R1085 ^value 1)
=>WM: (15166: R1 ^reward R1085)
=>WM: (15165: I3 ^see 0)
<=WM: (15156: S1 ^operator O2161 +)
<=WM: (15157: S1 ^operator O2162 +)
<=WM: (15158: S1 ^operator O2162)
<=WM: (15155: I3 ^dir U)
<=WM: (15151: R1 ^reward R1084)
<=WM: (15150: I3 ^see 1)
<=WM: (15154: O2162 ^name predict-no)
<=WM: (15153: O2161 ^name predict-yes)
<=WM: (15152: R1084 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2163 = -0.252585164213872)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2163 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2164 = 0.7701594485713136)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2164 = 0.229858460707707)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2162 = 0.229858460707707)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2162 = 0.7701594485713136)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2161 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2161 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15173: S1 ^operator O2164)

  1082:    O: O2164 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1082 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1081 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15174: I3 ^predict-no N1082)
<=WM: (15160: N1081 ^status complete)
<=WM: (15159: I3 ^predict-no N1081)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15178: I2 ^dir U)
=>WM: (15177: I2 ^reward 1)
=>WM: (15176: I2 ^see 0)
=>WM: (15175: N1082 ^status complete)
<=WM: (15163: I2 ^dir R)
<=WM: (15162: I2 ^reward 1)
<=WM: (15161: I2 ^see 0)
=>WM: (15179: I2 ^level-1 R0-root)
<=WM: (15164: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1086 ^value 1 +)
 (R1 ^reward R1086 +)
Firing propose*predict-yes
 -->
 (O2165 ^name predict-yes +)
 (S1 ^operator O2165 +)
Firing propose*predict-no
 -->
 (O2166 ^name predict-no +)
 (S1 ^operator O2166 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2164 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2163 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2164 ^name predict-no +)
 (S1 ^operator O2164 +)
Retracting propose*predict-yes
 -->
 (O2163 ^name predict-yes +)
 (S1 ^operator O2163 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1085 ^value 1 +)
 (R1 ^reward R1085 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2164 = 0.229858460707707)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2164 = 0.7701594485713136)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2163 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2163 = -0.252585164213872)
=>WM: (15186: S1 ^operator O2166 +)
=>WM: (15185: S1 ^operator O2165 +)
=>WM: (15184: I3 ^dir U)
=>WM: (15183: O2166 ^name predict-no)
=>WM: (15182: O2165 ^name predict-yes)
=>WM: (15181: R1086 ^value 1)
=>WM: (15180: R1 ^reward R1086)
<=WM: (15171: S1 ^operator O2163 +)
<=WM: (15172: S1 ^operator O2164 +)
<=WM: (15173: S1 ^operator O2164)
<=WM: (15170: I3 ^dir R)
<=WM: (15166: R1 ^reward R1085)
<=WM: (15169: O2164 ^name predict-no)
<=WM: (15168: O2163 ^name predict-yes)
<=WM: (15167: R1085 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2165 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2166 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2164 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2163 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611911 -0.382052 0.229858 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.855615,0.124202)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388104 0.382056 0.770159 -> 0.388102 0.382055 0.770158(R,m,v=1,1,0)
=>WM: (15187: S1 ^operator O2166)

  1083:    O: O2166 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1083 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1082 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15188: I3 ^predict-no N1083)
<=WM: (15175: N1082 ^status complete)
<=WM: (15174: I3 ^predict-no N1082)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15192: I2 ^dir R)
=>WM: (15191: I2 ^reward 1)
=>WM: (15190: I2 ^see 0)
=>WM: (15189: N1083 ^status complete)
<=WM: (15178: I2 ^dir U)
<=WM: (15177: I2 ^reward 1)
<=WM: (15176: I2 ^see 0)
=>WM: (15193: I2 ^level-1 R0-root)
<=WM: (15179: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2165 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2166 = 0.7701105848453105)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1087 ^value 1 +)
 (R1 ^reward R1087 +)
Firing propose*predict-yes
 -->
 (O2167 ^name predict-yes +)
 (S1 ^operator O2167 +)
Firing propose*predict-no
 -->
 (O2168 ^name predict-no +)
 (S1 ^operator O2168 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2166 = 0.2298570236216184)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2165 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2166 ^name predict-no +)
 (S1 ^operator O2166 +)
Retracting propose*predict-yes
 -->
 (O2165 ^name predict-yes +)
 (S1 ^operator O2165 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1086 ^value 1 +)
 (R1 ^reward R1086 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2166 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2165 = 0.)
=>WM: (15200: S1 ^operator O2168 +)
=>WM: (15199: S1 ^operator O2167 +)
=>WM: (15198: I3 ^dir R)
=>WM: (15197: O2168 ^name predict-no)
=>WM: (15196: O2167 ^name predict-yes)
=>WM: (15195: R1087 ^value 1)
=>WM: (15194: R1 ^reward R1087)
<=WM: (15185: S1 ^operator O2165 +)
<=WM: (15186: S1 ^operator O2166 +)
<=WM: (15187: S1 ^operator O2166)
<=WM: (15184: I3 ^dir U)
<=WM: (15180: R1 ^reward R1086)
<=WM: (15183: O2166 ^name predict-no)
<=WM: (15182: O2165 ^name predict-yes)
<=WM: (15181: R1086 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2167 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2167 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2168 = 0.7701105848453105)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2168 = 0.2298570236216184)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2166 = 0.2298570236216184)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2166 = 0.7701105848453105)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2165 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2165 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15201: S1 ^operator O2168)

  1084:    O: O2168 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1084 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1083 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15202: I3 ^predict-no N1084)
<=WM: (15189: N1083 ^status complete)
<=WM: (15188: I3 ^predict-no N1083)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15206: I2 ^dir U)
=>WM: (15205: I2 ^reward 1)
=>WM: (15204: I2 ^see 0)
=>WM: (15203: N1084 ^status complete)
<=WM: (15192: I2 ^dir R)
<=WM: (15191: I2 ^reward 1)
<=WM: (15190: I2 ^see 0)
=>WM: (15207: I2 ^level-1 R0-root)
<=WM: (15193: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1088 ^value 1 +)
 (R1 ^reward R1088 +)
Firing propose*predict-yes
 -->
 (O2169 ^name predict-yes +)
 (S1 ^operator O2169 +)
Firing propose*predict-no
 -->
 (O2170 ^name predict-no +)
 (S1 ^operator O2170 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2168 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2167 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2168 ^name predict-no +)
 (S1 ^operator O2168 +)
Retracting propose*predict-yes
 -->
 (O2167 ^name predict-yes +)
 (S1 ^operator O2167 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1087 ^value 1 +)
 (R1 ^reward R1087 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2168 = 0.2298570236216184)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2168 = 0.7701105848453105)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2167 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2167 = -0.1254042659579056)
=>WM: (15214: S1 ^operator O2170 +)
=>WM: (15213: S1 ^operator O2169 +)
=>WM: (15212: I3 ^dir U)
=>WM: (15211: O2170 ^name predict-no)
=>WM: (15210: O2169 ^name predict-yes)
=>WM: (15209: R1088 ^value 1)
=>WM: (15208: R1 ^reward R1088)
<=WM: (15199: S1 ^operator O2167 +)
<=WM: (15200: S1 ^operator O2168 +)
<=WM: (15201: S1 ^operator O2168)
<=WM: (15198: I3 ^dir R)
<=WM: (15194: R1 ^reward R1087)
<=WM: (15197: O2168 ^name predict-no)
<=WM: (15196: O2167 ^name predict-yes)
<=WM: (15195: R1087 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2169 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2170 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2168 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2167 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382053 0.229857 -> 0.611912 -0.382052 0.22986(R,m,v=1,0.856383,0.123649)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388064 0.382047 0.770111 -> 0.388066 0.382047 0.770114(R,m,v=1,1,0)
=>WM: (15215: S1 ^operator O2170)

  1085:    O: O2170 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1085 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1084 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15216: I3 ^predict-no N1085)
<=WM: (15203: N1084 ^status complete)
<=WM: (15202: I3 ^predict-no N1084)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15220: I2 ^dir R)
=>WM: (15219: I2 ^reward 1)
=>WM: (15218: I2 ^see 0)
=>WM: (15217: N1085 ^status complete)
<=WM: (15206: I2 ^dir U)
<=WM: (15205: I2 ^reward 1)
<=WM: (15204: I2 ^see 0)
=>WM: (15221: I2 ^level-1 R0-root)
<=WM: (15207: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2169 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2170 = 0.7701135541770483)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1089 ^value 1 +)
 (R1 ^reward R1089 +)
Firing propose*predict-yes
 -->
 (O2171 ^name predict-yes +)
 (S1 ^operator O2171 +)
Firing propose*predict-no
 -->
 (O2172 ^name predict-no +)
 (S1 ^operator O2172 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2170 = 0.2298596205778046)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2169 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2170 ^name predict-no +)
 (S1 ^operator O2170 +)
Retracting propose*predict-yes
 -->
 (O2169 ^name predict-yes +)
 (S1 ^operator O2169 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1088 ^value 1 +)
 (R1 ^reward R1088 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2170 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2169 = 0.)
=>WM: (15228: S1 ^operator O2172 +)
=>WM: (15227: S1 ^operator O2171 +)
=>WM: (15226: I3 ^dir R)
=>WM: (15225: O2172 ^name predict-no)
=>WM: (15224: O2171 ^name predict-yes)
=>WM: (15223: R1089 ^value 1)
=>WM: (15222: R1 ^reward R1089)
<=WM: (15213: S1 ^operator O2169 +)
<=WM: (15214: S1 ^operator O2170 +)
<=WM: (15215: S1 ^operator O2170)
<=WM: (15212: I3 ^dir U)
<=WM: (15208: R1 ^reward R1088)
<=WM: (15211: O2170 ^name predict-no)
<=WM: (15210: O2169 ^name predict-yes)
<=WM: (15209: R1088 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2171 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2171 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2172 = 0.7701135541770483)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2172 = 0.2298596205778046)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2170 = 0.2298596205778046)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2170 = 0.7701135541770483)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2169 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2169 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15229: S1 ^operator O2172)

  1086:    O: O2172 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1086 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1085 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15230: I3 ^predict-no N1086)
<=WM: (15217: N1085 ^status complete)
<=WM: (15216: I3 ^predict-no N1085)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (15234: I2 ^dir U)
=>WM: (15233: I2 ^reward 1)
=>WM: (15232: I2 ^see 0)
=>WM: (15231: N1086 ^status complete)
<=WM: (15220: I2 ^dir R)
<=WM: (15219: I2 ^reward 1)
<=WM: (15218: I2 ^see 0)
=>WM: (15235: I2 ^level-1 R0-root)
<=WM: (15221: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1090 ^value 1 +)
 (R1 ^reward R1090 +)
Firing propose*predict-yes
 -->
 (O2173 ^name predict-yes +)
 (S1 ^operator O2173 +)
Firing propose*predict-no
 -->
 (O2174 ^name predict-no +)
 (S1 ^operator O2174 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2172 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2171 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2172 ^name predict-no +)
 (S1 ^operator O2172 +)
Retracting propose*predict-yes
 -->
 (O2171 ^name predict-yes +)
 (S1 ^operator O2171 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1089 ^value 1 +)
 (R1 ^reward R1089 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2172 = 0.2298596205778046)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2172 = 0.7701135541770483)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2171 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2171 = -0.1254042659579056)
=>WM: (15242: S1 ^operator O2174 +)
=>WM: (15241: S1 ^operator O2173 +)
=>WM: (15240: I3 ^dir U)
=>WM: (15239: O2174 ^name predict-no)
=>WM: (15238: O2173 ^name predict-yes)
=>WM: (15237: R1090 ^value 1)
=>WM: (15236: R1 ^reward R1090)
<=WM: (15227: S1 ^operator O2171 +)
<=WM: (15228: S1 ^operator O2172 +)
<=WM: (15229: S1 ^operator O2172)
<=WM: (15226: I3 ^dir R)
<=WM: (15222: R1 ^reward R1089)
<=WM: (15225: O2172 ^name predict-no)
<=WM: (15224: O2171 ^name predict-yes)
<=WM: (15223: R1089 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2173 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2174 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2172 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2171 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611912 -0.382052 0.22986 -> 0.611914 -0.382052 0.229862(R,m,v=1,0.857143,0.1231)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388066 0.382047 0.770114 -> 0.388068 0.382048 0.770116(R,m,v=1,1,0)
=>WM: (15243: S1 ^operator O2174)

  1087:    O: O2174 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1087 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1086 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15244: I3 ^predict-no N1087)
<=WM: (15231: N1086 ^status complete)
<=WM: (15230: I3 ^predict-no N1086)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15248: I2 ^dir R)
=>WM: (15247: I2 ^reward 1)
=>WM: (15246: I2 ^see 0)
=>WM: (15245: N1087 ^status complete)
<=WM: (15234: I2 ^dir U)
<=WM: (15233: I2 ^reward 1)
<=WM: (15232: I2 ^see 0)
=>WM: (15249: I2 ^level-1 R0-root)
<=WM: (15235: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2173 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2174 = 0.7701160080460637)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1091 ^value 1 +)
 (R1 ^reward R1091 +)
Firing propose*predict-yes
 -->
 (O2175 ^name predict-yes +)
 (S1 ^operator O2175 +)
Firing propose*predict-no
 -->
 (O2176 ^name predict-no +)
 (S1 ^operator O2176 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2174 = 0.229861769434934)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2173 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2174 ^name predict-no +)
 (S1 ^operator O2174 +)
Retracting propose*predict-yes
 -->
 (O2173 ^name predict-yes +)
 (S1 ^operator O2173 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1090 ^value 1 +)
 (R1 ^reward R1090 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2174 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2173 = 0.)
=>WM: (15256: S1 ^operator O2176 +)
=>WM: (15255: S1 ^operator O2175 +)
=>WM: (15254: I3 ^dir R)
=>WM: (15253: O2176 ^name predict-no)
=>WM: (15252: O2175 ^name predict-yes)
=>WM: (15251: R1091 ^value 1)
=>WM: (15250: R1 ^reward R1091)
<=WM: (15241: S1 ^operator O2173 +)
<=WM: (15242: S1 ^operator O2174 +)
<=WM: (15243: S1 ^operator O2174)
<=WM: (15240: I3 ^dir U)
<=WM: (15236: R1 ^reward R1090)
<=WM: (15239: O2174 ^name predict-no)
<=WM: (15238: O2173 ^name predict-yes)
<=WM: (15237: R1090 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2175 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2175 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2176 = 0.7701160080460637)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2176 = 0.229861769434934)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2174 = 0.229861769434934)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2174 = 0.7701160080460637)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2173 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2173 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15257: S1 ^operator O2176)

  1088:    O: O2176 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1088 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1087 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15258: I3 ^predict-no N1088)
<=WM: (15245: N1087 ^status complete)
<=WM: (15244: I3 ^predict-no N1087)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15262: I2 ^dir R)
=>WM: (15261: I2 ^reward 1)
=>WM: (15260: I2 ^see 0)
=>WM: (15259: N1088 ^status complete)
<=WM: (15248: I2 ^dir R)
<=WM: (15247: I2 ^reward 1)
<=WM: (15246: I2 ^see 0)
=>WM: (15263: I2 ^level-1 R0-root)
<=WM: (15249: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2175 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2176 = 0.7701160080460637)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1092 ^value 1 +)
 (R1 ^reward R1092 +)
Firing propose*predict-yes
 -->
 (O2177 ^name predict-yes +)
 (S1 ^operator O2177 +)
Firing propose*predict-no
 -->
 (O2178 ^name predict-no +)
 (S1 ^operator O2178 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2176 = 0.229861769434934)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2175 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2176 ^name predict-no +)
 (S1 ^operator O2176 +)
Retracting propose*predict-yes
 -->
 (O2175 ^name predict-yes +)
 (S1 ^operator O2175 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1091 ^value 1 +)
 (R1 ^reward R1091 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2176 = 0.229861769434934)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2176 = 0.7701160080460637)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2175 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2175 = -0.1254042659579056)
=>WM: (15269: S1 ^operator O2178 +)
=>WM: (15268: S1 ^operator O2177 +)
=>WM: (15267: O2178 ^name predict-no)
=>WM: (15266: O2177 ^name predict-yes)
=>WM: (15265: R1092 ^value 1)
=>WM: (15264: R1 ^reward R1092)
<=WM: (15255: S1 ^operator O2175 +)
<=WM: (15256: S1 ^operator O2176 +)
<=WM: (15257: S1 ^operator O2176)
<=WM: (15250: R1 ^reward R1091)
<=WM: (15253: O2176 ^name predict-no)
<=WM: (15252: O2175 ^name predict-yes)
<=WM: (15251: R1091 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2177 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2177 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2178 = 0.7701160080460637)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2178 = 0.229861769434934)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2176 = 0.229861769434934)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2176 = 0.7701160080460637)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2175 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2175 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611914 -0.382052 0.229862 -> 0.611915 -0.382051 0.229864(R,m,v=1,0.857895,0.122556)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388068 0.382048 0.770116 -> 0.38807 0.382048 0.770118(R,m,v=1,1,0)
=>WM: (15270: S1 ^operator O2178)

  1089:    O: O2178 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1089 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1088 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15271: I3 ^predict-no N1089)
<=WM: (15259: N1088 ^status complete)
<=WM: (15258: I3 ^predict-no N1088)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (15275: I2 ^dir R)
=>WM: (15274: I2 ^reward 1)
=>WM: (15273: I2 ^see 0)
=>WM: (15272: N1089 ^status complete)
<=WM: (15262: I2 ^dir R)
<=WM: (15261: I2 ^reward 1)
<=WM: (15260: I2 ^see 0)
=>WM: (15276: I2 ^level-1 R0-root)
<=WM: (15263: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2177 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2178 = 0.7701180366340212)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1093 ^value 1 +)
 (R1 ^reward R1093 +)
Firing propose*predict-yes
 -->
 (O2179 ^name predict-yes +)
 (S1 ^operator O2179 +)
Firing propose*predict-no
 -->
 (O2180 ^name predict-no +)
 (S1 ^operator O2180 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2178 = 0.229863548083355)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2177 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2178 ^name predict-no +)
 (S1 ^operator O2178 +)
Retracting propose*predict-yes
 -->
 (O2177 ^name predict-yes +)
 (S1 ^operator O2177 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1092 ^value 1 +)
 (R1 ^reward R1092 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2178 = 0.229863548083355)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2178 = 0.7701180366340212)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2177 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2177 = -0.1254042659579056)
=>WM: (15282: S1 ^operator O2180 +)
=>WM: (15281: S1 ^operator O2179 +)
=>WM: (15280: O2180 ^name predict-no)
=>WM: (15279: O2179 ^name predict-yes)
=>WM: (15278: R1093 ^value 1)
=>WM: (15277: R1 ^reward R1093)
<=WM: (15268: S1 ^operator O2177 +)
<=WM: (15269: S1 ^operator O2178 +)
<=WM: (15270: S1 ^operator O2178)
<=WM: (15264: R1 ^reward R1092)
<=WM: (15267: O2178 ^name predict-no)
<=WM: (15266: O2177 ^name predict-yes)
<=WM: (15265: R1092 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2179 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2179 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2180 = 0.7701180366340212)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2180 = 0.229863548083355)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2178 = 0.229863548083355)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2178 = 0.7701180366340212)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2177 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2177 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611915 -0.382051 0.229864 -> 0.611916 -0.382051 0.229865(R,m,v=1,0.858639,0.122017)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.38807 0.382048 0.770118 -> 0.388071 0.382048 0.77012(R,m,v=1,1,0)
=>WM: (15283: S1 ^operator O2180)

  1090:    O: O2180 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1090 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1089 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15284: I3 ^predict-no N1090)
<=WM: (15272: N1089 ^status complete)
<=WM: (15271: I3 ^predict-no N1089)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15288: I2 ^dir R)
=>WM: (15287: I2 ^reward 1)
=>WM: (15286: I2 ^see 0)
=>WM: (15285: N1090 ^status complete)
<=WM: (15275: I2 ^dir R)
<=WM: (15274: I2 ^reward 1)
<=WM: (15273: I2 ^see 0)
=>WM: (15289: I2 ^level-1 R0-root)
<=WM: (15276: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2179 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2180 = 0.7701197142167014)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1094 ^value 1 +)
 (R1 ^reward R1094 +)
Firing propose*predict-yes
 -->
 (O2181 ^name predict-yes +)
 (S1 ^operator O2181 +)
Firing propose*predict-no
 -->
 (O2182 ^name predict-no +)
 (S1 ^operator O2182 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2180 = 0.2298650207702772)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2179 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2180 ^name predict-no +)
 (S1 ^operator O2180 +)
Retracting propose*predict-yes
 -->
 (O2179 ^name predict-yes +)
 (S1 ^operator O2179 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1093 ^value 1 +)
 (R1 ^reward R1093 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2180 = 0.2298650207702772)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2180 = 0.7701197142167014)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2179 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2179 = -0.1254042659579056)
=>WM: (15295: S1 ^operator O2182 +)
=>WM: (15294: S1 ^operator O2181 +)
=>WM: (15293: O2182 ^name predict-no)
=>WM: (15292: O2181 ^name predict-yes)
=>WM: (15291: R1094 ^value 1)
=>WM: (15290: R1 ^reward R1094)
<=WM: (15281: S1 ^operator O2179 +)
<=WM: (15282: S1 ^operator O2180 +)
<=WM: (15283: S1 ^operator O2180)
<=WM: (15277: R1 ^reward R1093)
<=WM: (15280: O2180 ^name predict-no)
<=WM: (15279: O2179 ^name predict-yes)
<=WM: (15278: R1093 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2181 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2181 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2182 = 0.7701197142167014)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2182 = 0.2298650207702772)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2180 = 0.2298650207702772)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2180 = 0.7701197142167014)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2179 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2179 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611916 -0.382051 0.229865 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.859375,0.121482)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388071 0.382048 0.77012 -> 0.388073 0.382049 0.770121(R,m,v=1,1,0)
=>WM: (15296: S1 ^operator O2182)

  1091:    O: O2182 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1091 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1090 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15297: I3 ^predict-no N1091)
<=WM: (15285: N1090 ^status complete)
<=WM: (15284: I3 ^predict-no N1090)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
---- Input Phase --- 
=>WM: (15301: I2 ^dir R)
=>WM: (15300: I2 ^reward 1)
=>WM: (15299: I2 ^see 0)
=>WM: (15298: N1091 ^status complete)
<=WM: (15288: I2 ^dir R)
<=WM: (15287: I2 ^reward 1)
<=WM: (15286: I2 ^see 0)
=>WM: (15302: I2 ^level-1 R0-root)
<=WM: (15289: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2181 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2182 = 0.7701211019931825)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1095 ^value 1 +)
 (R1 ^reward R1095 +)
Firing propose*predict-yes
 -->
 (O2183 ^name predict-yes +)
 (S1 ^operator O2183 +)
Firing propose*predict-no
 -->
 (O2184 ^name predict-no +)
 (S1 ^operator O2184 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2182 = 0.2298662405085362)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2181 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2182 ^name predict-no +)
 (S1 ^operator O2182 +)
Retracting propose*predict-yes
 -->
 (O2181 ^name predict-yes +)
 (S1 ^operator O2181 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1094 ^value 1 +)
 (R1 ^reward R1094 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2182 = 0.2298662405085362)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2182 = 0.7701211019931825)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2181 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2181 = -0.1254042659579056)
=>WM: (15308: S1 ^operator O2184 +)
=>WM: (15307: S1 ^operator O2183 +)
=>WM: (15306: O2184 ^name predict-no)
=>WM: (15305: O2183 ^name predict-yes)
=>WM: (15304: R1095 ^value 1)
=>WM: (15303: R1 ^reward R1095)
<=WM: (15294: S1 ^operator O2181 +)
<=WM: (15295: S1 ^operator O2182 +)
<=WM: (15296: S1 ^operator O2182)
<=WM: (15290: R1 ^reward R1094)
<=WM: (15293: O2182 ^name predict-no)
<=WM: (15292: O2181 ^name predict-yes)
<=WM: (15291: R1094 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2183 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2183 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2184 = 0.7701211019931825)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2184 = 0.2298662405085362)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2182 = 0.2298662405085362)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2182 = 0.7701211019931825)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2181 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2181 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611918 -0.382051 0.229867(R,m,v=1,0.860104,0.120952)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388073 0.382049 0.770121 -> 0.388073 0.382049 0.770122(R,m,v=1,1,0)
=>WM: (15309: S1 ^operator O2184)

  1092:    O: O2184 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1092 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1091 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15310: I3 ^predict-no N1092)
<=WM: (15298: N1091 ^status complete)
<=WM: (15297: I3 ^predict-no N1091)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15314: I2 ^dir U)
=>WM: (15313: I2 ^reward 1)
=>WM: (15312: I2 ^see 0)
=>WM: (15311: N1092 ^status complete)
<=WM: (15301: I2 ^dir R)
<=WM: (15300: I2 ^reward 1)
<=WM: (15299: I2 ^see 0)
=>WM: (15315: I2 ^level-1 R0-root)
<=WM: (15302: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1096 ^value 1 +)
 (R1 ^reward R1096 +)
Firing propose*predict-yes
 -->
 (O2185 ^name predict-yes +)
 (S1 ^operator O2185 +)
Firing propose*predict-no
 -->
 (O2186 ^name predict-no +)
 (S1 ^operator O2186 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2184 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2183 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2184 ^name predict-no +)
 (S1 ^operator O2184 +)
Retracting propose*predict-yes
 -->
 (O2183 ^name predict-yes +)
 (S1 ^operator O2183 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1095 ^value 1 +)
 (R1 ^reward R1095 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2184 = 0.2298672510565515)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2184 = 0.7701222504073515)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2183 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2183 = -0.1254042659579056)
=>WM: (15322: S1 ^operator O2186 +)
=>WM: (15321: S1 ^operator O2185 +)
=>WM: (15320: I3 ^dir U)
=>WM: (15319: O2186 ^name predict-no)
=>WM: (15318: O2185 ^name predict-yes)
=>WM: (15317: R1096 ^value 1)
=>WM: (15316: R1 ^reward R1096)
<=WM: (15307: S1 ^operator O2183 +)
<=WM: (15308: S1 ^operator O2184 +)
<=WM: (15309: S1 ^operator O2184)
<=WM: (15254: I3 ^dir R)
<=WM: (15303: R1 ^reward R1095)
<=WM: (15306: O2184 ^name predict-no)
<=WM: (15305: O2183 ^name predict-yes)
<=WM: (15304: R1095 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2185 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2186 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2184 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2183 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611918 -0.382051 0.229867 -> 0.611919 -0.382051 0.229868(R,m,v=1,0.860825,0.120426)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388073 0.382049 0.770122 -> 0.388074 0.382049 0.770123(R,m,v=1,1,0)
=>WM: (15323: S1 ^operator O2186)

  1093:    O: O2186 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1093 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1092 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15324: I3 ^predict-no N1093)
<=WM: (15311: N1092 ^status complete)
<=WM: (15310: I3 ^predict-no N1092)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (15328: I2 ^dir R)
=>WM: (15327: I2 ^reward 1)
=>WM: (15326: I2 ^see 0)
=>WM: (15325: N1093 ^status complete)
<=WM: (15314: I2 ^dir U)
<=WM: (15313: I2 ^reward 1)
<=WM: (15312: I2 ^see 0)
=>WM: (15329: I2 ^level-1 R0-root)
<=WM: (15315: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2185 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2186 = 0.770123201053682)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1097 ^value 1 +)
 (R1 ^reward R1097 +)
Firing propose*predict-yes
 -->
 (O2187 ^name predict-yes +)
 (S1 ^operator O2187 +)
Firing propose*predict-no
 -->
 (O2188 ^name predict-no +)
 (S1 ^operator O2188 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2186 = 0.2298680885464747)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2185 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2186 ^name predict-no +)
 (S1 ^operator O2186 +)
Retracting propose*predict-yes
 -->
 (O2185 ^name predict-yes +)
 (S1 ^operator O2185 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1096 ^value 1 +)
 (R1 ^reward R1096 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2186 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2185 = 0.)
=>WM: (15336: S1 ^operator O2188 +)
=>WM: (15335: S1 ^operator O2187 +)
=>WM: (15334: I3 ^dir R)
=>WM: (15333: O2188 ^name predict-no)
=>WM: (15332: O2187 ^name predict-yes)
=>WM: (15331: R1097 ^value 1)
=>WM: (15330: R1 ^reward R1097)
<=WM: (15321: S1 ^operator O2185 +)
<=WM: (15322: S1 ^operator O2186 +)
<=WM: (15323: S1 ^operator O2186)
<=WM: (15320: I3 ^dir U)
<=WM: (15316: R1 ^reward R1096)
<=WM: (15319: O2186 ^name predict-no)
<=WM: (15318: O2185 ^name predict-yes)
<=WM: (15317: R1096 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2187 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2187 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2188 = 0.770123201053682)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2188 = 0.2298680885464747)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2186 = 0.2298680885464747)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2186 = 0.770123201053682)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2185 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2185 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15337: S1 ^operator O2188)

  1094:    O: O2188 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1094 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1093 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15338: I3 ^predict-no N1094)
<=WM: (15325: N1093 ^status complete)
<=WM: (15324: I3 ^predict-no N1093)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15342: I2 ^dir R)
=>WM: (15341: I2 ^reward 1)
=>WM: (15340: I2 ^see 0)
=>WM: (15339: N1094 ^status complete)
<=WM: (15328: I2 ^dir R)
<=WM: (15327: I2 ^reward 1)
<=WM: (15326: I2 ^see 0)
=>WM: (15343: I2 ^level-1 R0-root)
<=WM: (15329: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2187 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2188 = 0.770123201053682)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1098 ^value 1 +)
 (R1 ^reward R1098 +)
Firing propose*predict-yes
 -->
 (O2189 ^name predict-yes +)
 (S1 ^operator O2189 +)
Firing propose*predict-no
 -->
 (O2190 ^name predict-no +)
 (S1 ^operator O2190 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2188 = 0.2298680885464747)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2187 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2188 ^name predict-no +)
 (S1 ^operator O2188 +)
Retracting propose*predict-yes
 -->
 (O2187 ^name predict-yes +)
 (S1 ^operator O2187 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1097 ^value 1 +)
 (R1 ^reward R1097 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2188 = 0.2298680885464747)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2188 = 0.770123201053682)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2187 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2187 = -0.1254042659579056)
=>WM: (15349: S1 ^operator O2190 +)
=>WM: (15348: S1 ^operator O2189 +)
=>WM: (15347: O2190 ^name predict-no)
=>WM: (15346: O2189 ^name predict-yes)
=>WM: (15345: R1098 ^value 1)
=>WM: (15344: R1 ^reward R1098)
<=WM: (15335: S1 ^operator O2187 +)
<=WM: (15336: S1 ^operator O2188 +)
<=WM: (15337: S1 ^operator O2188)
<=WM: (15330: R1 ^reward R1097)
<=WM: (15333: O2188 ^name predict-no)
<=WM: (15332: O2187 ^name predict-yes)
<=WM: (15331: R1097 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2189 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2189 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2190 = 0.770123201053682)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2190 = 0.2298680885464747)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2188 = 0.2298680885464747)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2188 = 0.770123201053682)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2187 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2187 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611919 -0.382051 0.229868 -> 0.611919 -0.38205 0.229869(R,m,v=1,0.861538,0.119905)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388074 0.382049 0.770123 -> 0.388075 0.382049 0.770124(R,m,v=1,1,0)
=>WM: (15350: S1 ^operator O2190)

  1095:    O: O2190 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1095 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1094 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15351: I3 ^predict-no N1095)
<=WM: (15339: N1094 ^status complete)
<=WM: (15338: I3 ^predict-no N1094)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15355: I2 ^dir L)
=>WM: (15354: I2 ^reward 1)
=>WM: (15353: I2 ^see 0)
=>WM: (15352: N1095 ^status complete)
<=WM: (15342: I2 ^dir R)
<=WM: (15341: I2 ^reward 1)
<=WM: (15340: I2 ^see 0)
=>WM: (15356: I2 ^level-1 R0-root)
<=WM: (15343: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2189 = 0.6195770009714396)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2190 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1099 ^value 1 +)
 (R1 ^reward R1099 +)
Firing propose*predict-yes
 -->
 (O2191 ^name predict-yes +)
 (S1 ^operator O2191 +)
Firing propose*predict-no
 -->
 (O2192 ^name predict-no +)
 (S1 ^operator O2192 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2190 = 0.3140548183361512)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2189 = 0.3804132142488074)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2190 ^name predict-no +)
 (S1 ^operator O2190 +)
Retracting propose*predict-yes
 -->
 (O2189 ^name predict-yes +)
 (S1 ^operator O2189 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1098 ^value 1 +)
 (R1 ^reward R1098 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2190 = 0.2298687828235715)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2190 = 0.7701239882424035)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2189 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2189 = -0.1254042659579056)
=>WM: (15363: S1 ^operator O2192 +)
=>WM: (15362: S1 ^operator O2191 +)
=>WM: (15361: I3 ^dir L)
=>WM: (15360: O2192 ^name predict-no)
=>WM: (15359: O2191 ^name predict-yes)
=>WM: (15358: R1099 ^value 1)
=>WM: (15357: R1 ^reward R1099)
<=WM: (15348: S1 ^operator O2189 +)
<=WM: (15349: S1 ^operator O2190 +)
<=WM: (15350: S1 ^operator O2190)
<=WM: (15334: I3 ^dir R)
<=WM: (15344: R1 ^reward R1098)
<=WM: (15347: O2190 ^name predict-no)
<=WM: (15346: O2189 ^name predict-yes)
<=WM: (15345: R1098 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2191 = 0.6195770009714396)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2191 = 0.3804132142488074)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2192 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2192 = 0.3140548183361512)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2190 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2190 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2189 = 0.3804132142488074)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2189 = 0.6195770009714396)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611919 -0.38205 0.229869 -> 0.61192 -0.38205 0.229869(R,m,v=1,0.862245,0.119388)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388075 0.382049 0.770124 -> 0.388075 0.382049 0.770125(R,m,v=1,1,0)
=>WM: (15364: S1 ^operator O2191)

  1096:    O: O2191 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1096 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1095 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15365: I3 ^predict-yes N1096)
<=WM: (15352: N1095 ^status complete)
<=WM: (15351: I3 ^predict-no N1095)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15369: I2 ^dir R)
=>WM: (15368: I2 ^reward 1)
=>WM: (15367: I2 ^see 1)
=>WM: (15366: N1096 ^status complete)
<=WM: (15355: I2 ^dir L)
<=WM: (15354: I2 ^reward 1)
<=WM: (15353: I2 ^see 0)
=>WM: (15370: I2 ^level-1 L1-root)
<=WM: (15356: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2191 = 0.7061957252803326)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2192 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1100 ^value 1 +)
 (R1 ^reward R1100 +)
Firing propose*predict-yes
 -->
 (O2193 ^name predict-yes +)
 (S1 ^operator O2193 +)
Firing propose*predict-no
 -->
 (O2194 ^name predict-no +)
 (S1 ^operator O2194 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2192 = 0.2298693585484839)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2191 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2192 ^name predict-no +)
 (S1 ^operator O2192 +)
Retracting propose*predict-yes
 -->
 (O2191 ^name predict-yes +)
 (S1 ^operator O2191 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1099 ^value 1 +)
 (R1 ^reward R1099 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2192 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2192 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2191 = 0.3804132142488074)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2191 = 0.6195770009714396)
=>WM: (15378: S1 ^operator O2194 +)
=>WM: (15377: S1 ^operator O2193 +)
=>WM: (15376: I3 ^dir R)
=>WM: (15375: O2194 ^name predict-no)
=>WM: (15374: O2193 ^name predict-yes)
=>WM: (15373: R1100 ^value 1)
=>WM: (15372: R1 ^reward R1100)
=>WM: (15371: I3 ^see 1)
<=WM: (15362: S1 ^operator O2191 +)
<=WM: (15364: S1 ^operator O2191)
<=WM: (15363: S1 ^operator O2192 +)
<=WM: (15361: I3 ^dir L)
<=WM: (15357: R1 ^reward R1099)
<=WM: (15165: I3 ^see 0)
<=WM: (15360: O2192 ^name predict-no)
<=WM: (15359: O2191 ^name predict-yes)
<=WM: (15358: R1099 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2193 = 0.2940520155428289)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2193 = 0.7061957252803326)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2194 = 0.2298693585484839)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2194 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2192 = 0.2298693585484839)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2192 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2191 = 0.2940520155428289)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2191 = 0.7061957252803326)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.843575,0.132697)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478646 0.140931 0.619577 -> 0.478647 0.140931 0.619578(R,m,v=1,1,0)
=>WM: (15379: S1 ^operator O2193)

  1097:    O: O2193 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1097 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1096 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15380: I3 ^predict-yes N1097)
<=WM: (15366: N1096 ^status complete)
<=WM: (15365: I3 ^predict-yes N1096)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15384: I2 ^dir L)
=>WM: (15383: I2 ^reward 1)
=>WM: (15382: I2 ^see 1)
=>WM: (15381: N1097 ^status complete)
<=WM: (15369: I2 ^dir R)
<=WM: (15368: I2 ^reward 1)
<=WM: (15367: I2 ^see 1)
=>WM: (15385: I2 ^level-1 R1-root)
<=WM: (15370: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2193 = 0.6195978385087889)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2194 = -0.1479504104026684)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1101 ^value 1 +)
 (R1 ^reward R1101 +)
Firing propose*predict-yes
 -->
 (O2195 ^name predict-yes +)
 (S1 ^operator O2195 +)
Firing propose*predict-no
 -->
 (O2196 ^name predict-no +)
 (S1 ^operator O2196 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2194 = 0.3140548183361512)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2193 = 0.3804140049526733)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2194 ^name predict-no +)
 (S1 ^operator O2194 +)
Retracting propose*predict-yes
 -->
 (O2193 ^name predict-yes +)
 (S1 ^operator O2193 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1100 ^value 1 +)
 (R1 ^reward R1100 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2194 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2194 = 0.2298693585484839)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2193 = 0.7061957252803326)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2193 = 0.2940520155428289)
=>WM: (15392: S1 ^operator O2196 +)
=>WM: (15391: S1 ^operator O2195 +)
=>WM: (15390: I3 ^dir L)
=>WM: (15389: O2196 ^name predict-no)
=>WM: (15388: O2195 ^name predict-yes)
=>WM: (15387: R1101 ^value 1)
=>WM: (15386: R1 ^reward R1101)
<=WM: (15377: S1 ^operator O2193 +)
<=WM: (15379: S1 ^operator O2193)
<=WM: (15378: S1 ^operator O2194 +)
<=WM: (15376: I3 ^dir R)
<=WM: (15372: R1 ^reward R1100)
<=WM: (15375: O2194 ^name predict-no)
<=WM: (15374: O2193 ^name predict-yes)
<=WM: (15373: R1100 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2195 = 0.3804140049526733)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2195 = 0.6195978385087889)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2196 = 0.3140548183361512)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2196 = -0.1479504104026684)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2194 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2194 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2193 = 0.3804140049526733)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2193 = 0.6195978385087889)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501121 -0.207069 0.294052 -> 0.501103 -0.207071 0.294032(R,m,v=1,0.858824,0.121963)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499103 0.207093 0.706196 -> 0.499081 0.207091 0.706172(R,m,v=1,1,0)
=>WM: (15393: S1 ^operator O2195)

  1098:    O: O2195 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1098 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1097 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15394: I3 ^predict-yes N1098)
<=WM: (15381: N1097 ^status complete)
<=WM: (15380: I3 ^predict-yes N1097)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15398: I2 ^dir L)
=>WM: (15397: I2 ^reward 1)
=>WM: (15396: I2 ^see 1)
=>WM: (15395: N1098 ^status complete)
<=WM: (15384: I2 ^dir L)
<=WM: (15383: I2 ^reward 1)
<=WM: (15382: I2 ^see 1)
=>WM: (15399: I2 ^level-1 L1-root)
<=WM: (15385: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2195 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2196 = 0.6860368928081693)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1102 ^value 1 +)
 (R1 ^reward R1102 +)
Firing propose*predict-yes
 -->
 (O2197 ^name predict-yes +)
 (S1 ^operator O2197 +)
Firing propose*predict-no
 -->
 (O2198 ^name predict-no +)
 (S1 ^operator O2198 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2196 = 0.3140548183361512)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2195 = 0.3804140049526733)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2196 ^name predict-no +)
 (S1 ^operator O2196 +)
Retracting propose*predict-yes
 -->
 (O2195 ^name predict-yes +)
 (S1 ^operator O2195 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1101 ^value 1 +)
 (R1 ^reward R1101 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
 -->
 (S1 ^operator O2196 = -0.1479504104026684)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2196 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
 -->
 (S1 ^operator O2195 = 0.6195978385087889)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2195 = 0.3804140049526733)
=>WM: (15405: S1 ^operator O2198 +)
=>WM: (15404: S1 ^operator O2197 +)
=>WM: (15403: O2198 ^name predict-no)
=>WM: (15402: O2197 ^name predict-yes)
=>WM: (15401: R1102 ^value 1)
=>WM: (15400: R1 ^reward R1102)
<=WM: (15391: S1 ^operator O2195 +)
<=WM: (15393: S1 ^operator O2195)
<=WM: (15392: S1 ^operator O2196 +)
<=WM: (15386: R1 ^reward R1101)
<=WM: (15389: O2196 ^name predict-no)
<=WM: (15388: O2195 ^name predict-yes)
<=WM: (15387: R1101 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2197 = 0.3804140049526733)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2197 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2198 = 0.3140548183361512)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2198 = 0.6860368928081693)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2196 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2196 = 0.6860368928081693)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2195 = 0.3804140049526733)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2195 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.844444,0.132092)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478669 0.140929 0.619598 -> 0.478668 0.140929 0.619597(R,m,v=1,1,0)
=>WM: (15406: S1 ^operator O2198)

  1099:    O: O2198 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1099 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1098 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15407: I3 ^predict-no N1099)
<=WM: (15395: N1098 ^status complete)
<=WM: (15394: I3 ^predict-yes N1098)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15411: I2 ^dir R)
=>WM: (15410: I2 ^reward 1)
=>WM: (15409: I2 ^see 0)
=>WM: (15408: N1099 ^status complete)
<=WM: (15398: I2 ^dir L)
<=WM: (15397: I2 ^reward 1)
<=WM: (15396: I2 ^see 1)
=>WM: (15412: I2 ^level-1 L0-root)
<=WM: (15399: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2197 = 0.7058208607781853)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2198 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1103 ^value 1 +)
 (R1 ^reward R1103 +)
Firing propose*predict-yes
 -->
 (O2199 ^name predict-yes +)
 (S1 ^operator O2199 +)
Firing propose*predict-no
 -->
 (O2200 ^name predict-no +)
 (S1 ^operator O2200 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2198 = 0.2298693585484839)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2197 = 0.2940318273940734)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2198 ^name predict-no +)
 (S1 ^operator O2198 +)
Retracting propose*predict-yes
 -->
 (O2197 ^name predict-yes +)
 (S1 ^operator O2197 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1102 ^value 1 +)
 (R1 ^reward R1102 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2198 = 0.6860368928081693)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2198 = 0.3140548183361512)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2197 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2197 = 0.3804130487485735)
=>WM: (15420: S1 ^operator O2200 +)
=>WM: (15419: S1 ^operator O2199 +)
=>WM: (15418: I3 ^dir R)
=>WM: (15417: O2200 ^name predict-no)
=>WM: (15416: O2199 ^name predict-yes)
=>WM: (15415: R1103 ^value 1)
=>WM: (15414: R1 ^reward R1103)
=>WM: (15413: I3 ^see 0)
<=WM: (15404: S1 ^operator O2197 +)
<=WM: (15405: S1 ^operator O2198 +)
<=WM: (15406: S1 ^operator O2198)
<=WM: (15390: I3 ^dir L)
<=WM: (15400: R1 ^reward R1102)
<=WM: (15371: I3 ^see 1)
<=WM: (15403: O2198 ^name predict-no)
<=WM: (15402: O2197 ^name predict-yes)
<=WM: (15401: R1102 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2199 = 0.2940318273940734)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2199 = 0.7058208607781853)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2200 = 0.2298693585484839)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2200 = -0.2023211881870005)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2198 = 0.2298693585484839)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2198 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2197 = 0.2940318273940734)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2197 = 0.7058208607781853)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485058 -0.171003 0.314055 -> 0.485052 -0.171004 0.314047(R,m,v=1,0.880682,0.105682)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515015 0.171022 0.686037 -> 0.515008 0.17102 0.686028(R,m,v=1,1,0)
=>WM: (15421: S1 ^operator O2199)

  1100:    O: O2199 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1100 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1099 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15422: I3 ^predict-yes N1100)
<=WM: (15408: N1099 ^status complete)
<=WM: (15407: I3 ^predict-no N1099)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15426: I2 ^dir R)
=>WM: (15425: I2 ^reward 1)
=>WM: (15424: I2 ^see 1)
=>WM: (15423: N1100 ^status complete)
<=WM: (15411: I2 ^dir R)
<=WM: (15410: I2 ^reward 1)
<=WM: (15409: I2 ^see 0)
=>WM: (15427: I2 ^level-1 R1-root)
<=WM: (15412: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2199 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2200 = 0.7701577329613335)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1104 ^value 1 +)
 (R1 ^reward R1104 +)
Firing propose*predict-yes
 -->
 (O2201 ^name predict-yes +)
 (S1 ^operator O2201 +)
Firing propose*predict-no
 -->
 (O2202 ^name predict-no +)
 (S1 ^operator O2202 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2200 = 0.2298693585484839)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2199 = 0.2940318273940734)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2200 ^name predict-no +)
 (S1 ^operator O2200 +)
Retracting propose*predict-yes
 -->
 (O2199 ^name predict-yes +)
 (S1 ^operator O2199 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1103 ^value 1 +)
 (R1 ^reward R1103 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2200 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2200 = 0.2298693585484839)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2199 = 0.7058208607781853)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2199 = 0.2940318273940734)
=>WM: (15434: S1 ^operator O2202 +)
=>WM: (15433: S1 ^operator O2201 +)
=>WM: (15432: O2202 ^name predict-no)
=>WM: (15431: O2201 ^name predict-yes)
=>WM: (15430: R1104 ^value 1)
=>WM: (15429: R1 ^reward R1104)
=>WM: (15428: I3 ^see 1)
<=WM: (15419: S1 ^operator O2199 +)
<=WM: (15421: S1 ^operator O2199)
<=WM: (15420: S1 ^operator O2200 +)
<=WM: (15414: R1 ^reward R1103)
<=WM: (15413: I3 ^see 0)
<=WM: (15417: O2200 ^name predict-no)
<=WM: (15416: O2199 ^name predict-yes)
<=WM: (15415: R1103 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2201 = 0.2940318273940734)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2201 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2202 = 0.2298693585484839)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2202 = 0.7701577329613335)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2200 = 0.2298693585484839)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2200 = 0.7701577329613335)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2199 = 0.2940318273940734)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2199 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501103 -0.207071 0.294032 -> 0.501114 -0.20707 0.294044(R,m,v=1,0.859649,0.121362)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498764 0.207057 0.705821 -> 0.498777 0.207058 0.705835(R,m,v=1,1,0)
=>WM: (15435: S1 ^operator O2202)

  1101:    O: O2202 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1101 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1100 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15436: I3 ^predict-no N1101)
<=WM: (15423: N1100 ^status complete)
<=WM: (15422: I3 ^predict-yes N1100)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (15440: I2 ^dir L)
=>WM: (15439: I2 ^reward 1)
=>WM: (15438: I2 ^see 0)
=>WM: (15437: N1101 ^status complete)
<=WM: (15426: I2 ^dir R)
<=WM: (15425: I2 ^reward 1)
<=WM: (15424: I2 ^see 1)
=>WM: (15441: I2 ^level-1 R0-root)
<=WM: (15427: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2201 = 0.6195779233564012)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2202 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1105 ^value 1 +)
 (R1 ^reward R1105 +)
Firing propose*predict-yes
 -->
 (O2203 ^name predict-yes +)
 (S1 ^operator O2203 +)
Firing propose*predict-no
 -->
 (O2204 ^name predict-no +)
 (S1 ^operator O2204 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2202 = 0.3140473868976779)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2201 = 0.3804130487485735)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2202 ^name predict-no +)
 (S1 ^operator O2202 +)
Retracting propose*predict-yes
 -->
 (O2201 ^name predict-yes +)
 (S1 ^operator O2201 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1104 ^value 1 +)
 (R1 ^reward R1104 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2202 = 0.7701577329613335)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2202 = 0.2298693585484839)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2201 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2201 = 0.2940438202219438)
=>WM: (15449: S1 ^operator O2204 +)
=>WM: (15448: S1 ^operator O2203 +)
=>WM: (15447: I3 ^dir L)
=>WM: (15446: O2204 ^name predict-no)
=>WM: (15445: O2203 ^name predict-yes)
=>WM: (15444: R1105 ^value 1)
=>WM: (15443: R1 ^reward R1105)
=>WM: (15442: I3 ^see 0)
<=WM: (15433: S1 ^operator O2201 +)
<=WM: (15434: S1 ^operator O2202 +)
<=WM: (15435: S1 ^operator O2202)
<=WM: (15418: I3 ^dir R)
<=WM: (15429: R1 ^reward R1104)
<=WM: (15428: I3 ^see 1)
<=WM: (15432: O2202 ^name predict-no)
<=WM: (15431: O2201 ^name predict-yes)
<=WM: (15430: R1104 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2203 = 0.3804130487485735)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2203 = 0.6195779233564012)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2204 = 0.3140473868976779)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2204 = -0.2190661556260421)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2202 = 0.3140473868976779)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2202 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2201 = 0.3804130487485735)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2201 = 0.6195779233564012)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.61192 -0.38205 0.229869 -> 0.611918 -0.382051 0.229867(R,m,v=1,0.862944,0.118875)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388102 0.382055 0.770158 -> 0.3881 0.382055 0.770155(R,m,v=1,1,0)
=>WM: (15450: S1 ^operator O2203)

  1102:    O: O2203 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1102 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1101 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15451: I3 ^predict-yes N1102)
<=WM: (15437: N1101 ^status complete)
<=WM: (15436: I3 ^predict-no N1101)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15455: I2 ^dir L)
=>WM: (15454: I2 ^reward 1)
=>WM: (15453: I2 ^see 1)
=>WM: (15452: N1102 ^status complete)
<=WM: (15440: I2 ^dir L)
<=WM: (15439: I2 ^reward 1)
<=WM: (15438: I2 ^see 0)
=>WM: (15456: I2 ^level-1 L1-root)
<=WM: (15441: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2203 = -0.3470159027404986)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2204 = 0.686028179458083)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1106 ^value 1 +)
 (R1 ^reward R1106 +)
Firing propose*predict-yes
 -->
 (O2205 ^name predict-yes +)
 (S1 ^operator O2205 +)
Firing propose*predict-no
 -->
 (O2206 ^name predict-no +)
 (S1 ^operator O2206 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2204 = 0.3140473868976779)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2203 = 0.3804130487485735)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2204 ^name predict-no +)
 (S1 ^operator O2204 +)
Retracting propose*predict-yes
 -->
 (O2203 ^name predict-yes +)
 (S1 ^operator O2203 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1105 ^value 1 +)
 (R1 ^reward R1105 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2204 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2204 = 0.3140473868976779)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2203 = 0.6195779233564012)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2203 = 0.3804130487485735)
=>WM: (15463: S1 ^operator O2206 +)
=>WM: (15462: S1 ^operator O2205 +)
=>WM: (15461: O2206 ^name predict-no)
=>WM: (15460: O2205 ^name predict-yes)
=>WM: (15459: R1106 ^value 1)
=>WM: (15458: R1 ^reward R1106)
=>WM: (15457: I3 ^see 1)
<=WM: (15448: S1 ^operator O2203 +)
<=WM: (15450: S1 ^operator O2203)
<=WM: (15449: S1 ^operator O2204 +)
<=WM: (15443: R1 ^reward R1105)
<=WM: (15442: I3 ^see 0)
<=WM: (15446: O2204 ^name predict-no)
<=WM: (15445: O2203 ^name predict-yes)
<=WM: (15444: R1105 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2205 = 0.3804130487485735)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2205 = -0.3470159027404986)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2206 = 0.3140473868976779)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2206 = 0.686028179458083)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2204 = 0.3140473868976779)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2204 = 0.686028179458083)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2203 = 0.3804130487485735)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2203 = -0.3470159027404986)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.845304,0.131492)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478647 0.140931 0.619578 -> 0.478648 0.140931 0.619579(R,m,v=1,1,0)
=>WM: (15464: S1 ^operator O2206)

  1103:    O: O2206 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1103 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1102 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15465: I3 ^predict-no N1103)
<=WM: (15452: N1102 ^status complete)
<=WM: (15451: I3 ^predict-yes N1102)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15469: I2 ^dir R)
=>WM: (15468: I2 ^reward 1)
=>WM: (15467: I2 ^see 0)
=>WM: (15466: N1103 ^status complete)
<=WM: (15455: I2 ^dir L)
<=WM: (15454: I2 ^reward 1)
<=WM: (15453: I2 ^see 1)
=>WM: (15470: I2 ^level-1 L0-root)
<=WM: (15456: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2205 = 0.7058349330775942)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2206 = -0.2023211881870005)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1107 ^value 1 +)
 (R1 ^reward R1107 +)
Firing propose*predict-yes
 -->
 (O2207 ^name predict-yes +)
 (S1 ^operator O2207 +)
Firing propose*predict-no
 -->
 (O2208 ^name predict-no +)
 (S1 ^operator O2208 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2206 = 0.2298672026809531)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2205 = 0.2940438202219438)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2206 ^name predict-no +)
 (S1 ^operator O2206 +)
Retracting propose*predict-yes
 -->
 (O2205 ^name predict-yes +)
 (S1 ^operator O2205 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1106 ^value 1 +)
 (R1 ^reward R1106 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
 -->
 (S1 ^operator O2206 = 0.686028179458083)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2206 = 0.3140473868976779)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
 -->
 (S1 ^operator O2205 = -0.3470159027404986)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2205 = 0.3804137769811579)
=>WM: (15478: S1 ^operator O2208 +)
=>WM: (15477: S1 ^operator O2207 +)
=>WM: (15476: I3 ^dir R)
=>WM: (15475: O2208 ^name predict-no)
=>WM: (15474: O2207 ^name predict-yes)
=>WM: (15473: R1107 ^value 1)
=>WM: (15472: R1 ^reward R1107)
=>WM: (15471: I3 ^see 0)
<=WM: (15462: S1 ^operator O2205 +)
<=WM: (15463: S1 ^operator O2206 +)
<=WM: (15464: S1 ^operator O2206)
<=WM: (15447: I3 ^dir L)
<=WM: (15458: R1 ^reward R1106)
<=WM: (15457: I3 ^see 1)
<=WM: (15461: O2206 ^name predict-no)
<=WM: (15460: O2205 ^name predict-yes)
<=WM: (15459: R1106 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2207 = 0.2940438202219438)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2207 = 0.7058349330775942)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2208 = 0.2298672026809531)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2208 = -0.2023211881870005)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2206 = 0.2298672026809531)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2206 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2205 = 0.2940438202219438)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2205 = 0.7058349330775942)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.485052 -0.171004 0.314047 -> 0.485047 -0.171006 0.314041(R,m,v=1,0.881356,0.105162)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515008 0.17102 0.686028 -> 0.515002 0.171019 0.686021(R,m,v=1,1,0)
=>WM: (15479: S1 ^operator O2207)

  1104:    O: O2207 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1104 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1103 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15480: I3 ^predict-yes N1104)
<=WM: (15466: N1103 ^status complete)
<=WM: (15465: I3 ^predict-no N1103)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15484: I2 ^dir R)
=>WM: (15483: I2 ^reward 1)
=>WM: (15482: I2 ^see 1)
=>WM: (15481: N1104 ^status complete)
<=WM: (15469: I2 ^dir R)
<=WM: (15468: I2 ^reward 1)
<=WM: (15467: I2 ^see 0)
=>WM: (15485: I2 ^level-1 R1-root)
<=WM: (15470: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2207 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2208 = 0.7701551449828702)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1108 ^value 1 +)
 (R1 ^reward R1108 +)
Firing propose*predict-yes
 -->
 (O2209 ^name predict-yes +)
 (S1 ^operator O2209 +)
Firing propose*predict-no
 -->
 (O2210 ^name predict-no +)
 (S1 ^operator O2210 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2208 = 0.2298672026809531)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2207 = 0.2940438202219438)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2208 ^name predict-no +)
 (S1 ^operator O2208 +)
Retracting propose*predict-yes
 -->
 (O2207 ^name predict-yes +)
 (S1 ^operator O2207 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1107 ^value 1 +)
 (R1 ^reward R1107 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
 -->
 (S1 ^operator O2208 = -0.2023211881870005)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2208 = 0.2298672026809531)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
 -->
 (S1 ^operator O2207 = 0.7058349330775942)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2207 = 0.2940438202219438)
=>WM: (15492: S1 ^operator O2210 +)
=>WM: (15491: S1 ^operator O2209 +)
=>WM: (15490: O2210 ^name predict-no)
=>WM: (15489: O2209 ^name predict-yes)
=>WM: (15488: R1108 ^value 1)
=>WM: (15487: R1 ^reward R1108)
=>WM: (15486: I3 ^see 1)
<=WM: (15477: S1 ^operator O2207 +)
<=WM: (15479: S1 ^operator O2207)
<=WM: (15478: S1 ^operator O2208 +)
<=WM: (15472: R1 ^reward R1107)
<=WM: (15471: I3 ^see 0)
<=WM: (15475: O2208 ^name predict-no)
<=WM: (15474: O2207 ^name predict-yes)
<=WM: (15473: R1107 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2209 = 0.2940438202219438)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2209 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2210 = 0.2298672026809531)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2210 = 0.7701551449828702)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2208 = 0.2298672026809531)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2208 = 0.7701551449828702)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2207 = 0.2940438202219438)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2207 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501114 -0.20707 0.294044 -> 0.501123 -0.207069 0.294054(R,m,v=1,0.860465,0.120767)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498777 0.207058 0.705835 -> 0.498787 0.207059 0.705846(R,m,v=1,1,0)
=>WM: (15493: S1 ^operator O2210)

  1105:    O: O2210 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1105 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1104 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15494: I3 ^predict-no N1105)
<=WM: (15481: N1104 ^status complete)
<=WM: (15480: I3 ^predict-yes N1104)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15498: I2 ^dir R)
=>WM: (15497: I2 ^reward 1)
=>WM: (15496: I2 ^see 0)
=>WM: (15495: N1105 ^status complete)
<=WM: (15484: I2 ^dir R)
<=WM: (15483: I2 ^reward 1)
<=WM: (15482: I2 ^see 1)
=>WM: (15499: I2 ^level-1 R0-root)
<=WM: (15485: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2209 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2210 = 0.7701246402854851)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1109 ^value 1 +)
 (R1 ^reward R1109 +)
Firing propose*predict-yes
 -->
 (O2211 ^name predict-yes +)
 (S1 ^operator O2211 +)
Firing propose*predict-no
 -->
 (O2212 ^name predict-no +)
 (S1 ^operator O2212 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2210 = 0.2298672026809531)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2209 = 0.2940536816948511)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2210 ^name predict-no +)
 (S1 ^operator O2210 +)
Retracting propose*predict-yes
 -->
 (O2209 ^name predict-yes +)
 (S1 ^operator O2209 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1108 ^value 1 +)
 (R1 ^reward R1108 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2210 = 0.7701551449828702)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2210 = 0.2298672026809531)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2209 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2209 = 0.2940536816948511)
=>WM: (15506: S1 ^operator O2212 +)
=>WM: (15505: S1 ^operator O2211 +)
=>WM: (15504: O2212 ^name predict-no)
=>WM: (15503: O2211 ^name predict-yes)
=>WM: (15502: R1109 ^value 1)
=>WM: (15501: R1 ^reward R1109)
=>WM: (15500: I3 ^see 0)
<=WM: (15491: S1 ^operator O2209 +)
<=WM: (15492: S1 ^operator O2210 +)
<=WM: (15493: S1 ^operator O2210)
<=WM: (15487: R1 ^reward R1108)
<=WM: (15486: I3 ^see 1)
<=WM: (15490: O2210 ^name predict-no)
<=WM: (15489: O2209 ^name predict-yes)
<=WM: (15488: R1108 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2211 = 0.2940536816948511)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2211 = -0.1254042659579056)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2212 = 0.2298672026809531)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2212 = 0.7701246402854851)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2210 = 0.2298672026809531)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2210 = 0.7701246402854851)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2209 = 0.2940536816948511)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2209 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611918 -0.382051 0.229867 -> 0.611917 -0.382051 0.229865(R,m,v=1,0.863636,0.118366)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.3881 0.382055 0.770155 -> 0.388098 0.382055 0.770153(R,m,v=1,1,0)
=>WM: (15507: S1 ^operator O2212)

  1106:    O: O2212 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1106 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1105 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15508: I3 ^predict-no N1106)
<=WM: (15495: N1105 ^status complete)
<=WM: (15494: I3 ^predict-no N1105)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15512: I2 ^dir L)
=>WM: (15511: I2 ^reward 1)
=>WM: (15510: I2 ^see 0)
=>WM: (15509: N1106 ^status complete)
<=WM: (15498: I2 ^dir R)
<=WM: (15497: I2 ^reward 1)
<=WM: (15496: I2 ^see 0)
=>WM: (15513: I2 ^level-1 R0-root)
<=WM: (15499: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2211 = 0.6195787722435855)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2212 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1110 ^value 1 +)
 (R1 ^reward R1110 +)
Firing propose*predict-yes
 -->
 (O2213 ^name predict-yes +)
 (S1 ^operator O2213 +)
Firing propose*predict-no
 -->
 (O2214 ^name predict-no +)
 (S1 ^operator O2214 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2212 = 0.314041269303462)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2211 = 0.3804137769811579)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2212 ^name predict-no +)
 (S1 ^operator O2212 +)
Retracting propose*predict-yes
 -->
 (O2211 ^name predict-yes +)
 (S1 ^operator O2211 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1109 ^value 1 +)
 (R1 ^reward R1109 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2212 = 0.7701246402854851)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2212 = 0.2298654257475218)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2211 = -0.1254042659579056)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2211 = 0.2940536816948511)
=>WM: (15520: S1 ^operator O2214 +)
=>WM: (15519: S1 ^operator O2213 +)
=>WM: (15518: I3 ^dir L)
=>WM: (15517: O2214 ^name predict-no)
=>WM: (15516: O2213 ^name predict-yes)
=>WM: (15515: R1110 ^value 1)
=>WM: (15514: R1 ^reward R1110)
<=WM: (15505: S1 ^operator O2211 +)
<=WM: (15506: S1 ^operator O2212 +)
<=WM: (15507: S1 ^operator O2212)
<=WM: (15476: I3 ^dir R)
<=WM: (15501: R1 ^reward R1109)
<=WM: (15504: O2212 ^name predict-no)
<=WM: (15503: O2211 ^name predict-yes)
<=WM: (15502: R1109 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2213 = 0.6195787722435855)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2213 = 0.3804137769811579)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2214 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2214 = 0.314041269303462)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2212 = 0.314041269303462)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2212 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2211 = 0.3804137769811579)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2211 = 0.6195787722435855)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229865 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.864322,0.117862)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388075 0.382049 0.770125 -> 0.388076 0.382049 0.770126(R,m,v=1,1,0)
=>WM: (15521: S1 ^operator O2213)

  1107:    O: O2213 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1107 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1106 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15522: I3 ^predict-yes N1107)
<=WM: (15509: N1106 ^status complete)
<=WM: (15508: I3 ^predict-no N1106)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15526: I2 ^dir R)
=>WM: (15525: I2 ^reward 1)
=>WM: (15524: I2 ^see 1)
=>WM: (15523: N1107 ^status complete)
<=WM: (15512: I2 ^dir L)
<=WM: (15511: I2 ^reward 1)
<=WM: (15510: I2 ^see 0)
=>WM: (15527: I2 ^level-1 L1-root)
<=WM: (15513: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2213 = 0.7061721241516533)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2214 = -0.1937987592593187)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1111 ^value 1 +)
 (R1 ^reward R1111 +)
Firing propose*predict-yes
 -->
 (O2215 ^name predict-yes +)
 (S1 ^operator O2215 +)
Firing propose*predict-no
 -->
 (O2216 ^name predict-no +)
 (S1 ^operator O2216 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2214 = 0.2298662149963561)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2213 = 0.2940536816948511)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2214 ^name predict-no +)
 (S1 ^operator O2214 +)
Retracting propose*predict-yes
 -->
 (O2213 ^name predict-yes +)
 (S1 ^operator O2213 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1110 ^value 1 +)
 (R1 ^reward R1110 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2214 = 0.314041269303462)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2214 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2213 = 0.3804137769811579)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2213 = 0.6195787722435855)
=>WM: (15535: S1 ^operator O2216 +)
=>WM: (15534: S1 ^operator O2215 +)
=>WM: (15533: I3 ^dir R)
=>WM: (15532: O2216 ^name predict-no)
=>WM: (15531: O2215 ^name predict-yes)
=>WM: (15530: R1111 ^value 1)
=>WM: (15529: R1 ^reward R1111)
=>WM: (15528: I3 ^see 1)
<=WM: (15519: S1 ^operator O2213 +)
<=WM: (15521: S1 ^operator O2213)
<=WM: (15520: S1 ^operator O2214 +)
<=WM: (15518: I3 ^dir L)
<=WM: (15514: R1 ^reward R1110)
<=WM: (15500: I3 ^see 0)
<=WM: (15517: O2214 ^name predict-no)
<=WM: (15516: O2213 ^name predict-yes)
<=WM: (15515: R1110 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2215 = 0.2940536816948511)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2215 = 0.7061721241516533)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2216 = 0.2298662149963561)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2216 = -0.1937987592593187)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2214 = 0.2298662149963561)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2214 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2213 = 0.2940536816948511)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2213 = 0.7061721241516533)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.846154,0.130897)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478648 0.140931 0.619579 -> 0.478649 0.140931 0.619579(R,m,v=1,1,0)
=>WM: (15536: S1 ^operator O2215)

  1108:    O: O2215 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1108 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1107 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15537: I3 ^predict-yes N1108)
<=WM: (15523: N1107 ^status complete)
<=WM: (15522: I3 ^predict-yes N1107)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15541: I2 ^dir R)
=>WM: (15540: I2 ^reward 1)
=>WM: (15539: I2 ^see 1)
=>WM: (15538: N1108 ^status complete)
<=WM: (15526: I2 ^dir R)
<=WM: (15525: I2 ^reward 1)
<=WM: (15524: I2 ^see 1)
=>WM: (15542: I2 ^level-1 R1-root)
<=WM: (15527: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2215 = -0.252585164213872)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2216 = 0.7701530160237312)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1112 ^value 1 +)
 (R1 ^reward R1112 +)
Firing propose*predict-yes
 -->
 (O2217 ^name predict-yes +)
 (S1 ^operator O2217 +)
Firing propose*predict-no
 -->
 (O2218 ^name predict-no +)
 (S1 ^operator O2218 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2216 = 0.2298662149963561)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2215 = 0.2940536816948511)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2216 ^name predict-no +)
 (S1 ^operator O2216 +)
Retracting propose*predict-yes
 -->
 (O2215 ^name predict-yes +)
 (S1 ^operator O2215 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1111 ^value 1 +)
 (R1 ^reward R1111 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
 -->
 (S1 ^operator O2216 = -0.1937987592593187)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2216 = 0.2298662149963561)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
 -->
 (S1 ^operator O2215 = 0.7061721241516533)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2215 = 0.2940536816948511)
=>WM: (15548: S1 ^operator O2218 +)
=>WM: (15547: S1 ^operator O2217 +)
=>WM: (15546: O2218 ^name predict-no)
=>WM: (15545: O2217 ^name predict-yes)
=>WM: (15544: R1112 ^value 1)
=>WM: (15543: R1 ^reward R1112)
<=WM: (15534: S1 ^operator O2215 +)
<=WM: (15536: S1 ^operator O2215)
<=WM: (15535: S1 ^operator O2216 +)
<=WM: (15529: R1 ^reward R1111)
<=WM: (15532: O2216 ^name predict-no)
<=WM: (15531: O2215 ^name predict-yes)
<=WM: (15530: R1111 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2217 = 0.2940536816948511)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2217 = -0.252585164213872)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2218 = 0.2298662149963561)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2218 = 0.7701530160237312)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2216 = 0.2298662149963561)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2216 = 0.7701530160237312)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2215 = 0.2940536816948511)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2215 = -0.252585164213872)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.501123 -0.207069 0.294054 -> 0.501106 -0.207071 0.294035(R,m,v=1,0.861272,0.120177)
RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499081 0.207091 0.706172 -> 0.499062 0.207089 0.706151(R,m,v=1,1,0)
=>WM: (15549: S1 ^operator O2218)

  1109:    O: O2218 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1109 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1108 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15550: I3 ^predict-no N1109)
<=WM: (15538: N1108 ^status complete)
<=WM: (15537: I3 ^predict-yes N1108)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15554: I2 ^dir R)
=>WM: (15553: I2 ^reward 1)
=>WM: (15552: I2 ^see 0)
=>WM: (15551: N1109 ^status complete)
<=WM: (15541: I2 ^dir R)
<=WM: (15540: I2 ^reward 1)
<=WM: (15539: I2 ^see 1)
=>WM: (15555: I2 ^level-1 R0-root)
<=WM: (15542: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2217 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2218 = 0.770125534612744)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1113 ^value 1 +)
 (R1 ^reward R1113 +)
Firing propose*predict-yes
 -->
 (O2219 ^name predict-yes +)
 (S1 ^operator O2219 +)
Firing propose*predict-no
 -->
 (O2220 ^name predict-no +)
 (S1 ^operator O2220 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2218 = 0.2298662149963561)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2217 = 0.2940353333163421)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2218 ^name predict-no +)
 (S1 ^operator O2218 +)
Retracting propose*predict-yes
 -->
 (O2217 ^name predict-yes +)
 (S1 ^operator O2217 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1112 ^value 1 +)
 (R1 ^reward R1112 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
 -->
 (S1 ^operator O2218 = 0.7701530160237312)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2218 = 0.2298662149963561)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
 -->
 (S1 ^operator O2217 = -0.252585164213872)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2217 = 0.2940353333163421)
=>WM: (15562: S1 ^operator O2220 +)
=>WM: (15561: S1 ^operator O2219 +)
=>WM: (15560: O2220 ^name predict-no)
=>WM: (15559: O2219 ^name predict-yes)
=>WM: (15558: R1113 ^value 1)
=>WM: (15557: R1 ^reward R1113)
=>WM: (15556: I3 ^see 0)
<=WM: (15547: S1 ^operator O2217 +)
<=WM: (15548: S1 ^operator O2218 +)
<=WM: (15549: S1 ^operator O2218)
<=WM: (15543: R1 ^reward R1112)
<=WM: (15528: I3 ^see 1)
<=WM: (15546: O2218 ^name predict-no)
<=WM: (15545: O2217 ^name predict-yes)
<=WM: (15544: R1112 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2219 = 0.2940353333163421)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2219 = -0.1254042659579056)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2220 = 0.2298662149963561)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2220 = 0.770125534612744)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2218 = 0.2298662149963561)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2218 = 0.770125534612744)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2217 = 0.2940353333163421)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2217 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611916 -0.382051 0.229865(R,m,v=1,0.865,0.117362)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388098 0.382055 0.770153 -> 0.388097 0.382054 0.770151(R,m,v=1,1,0)
=>WM: (15563: S1 ^operator O2220)

  1110:    O: O2220 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1110 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1109 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15564: I3 ^predict-no N1110)
<=WM: (15551: N1109 ^status complete)
<=WM: (15550: I3 ^predict-no N1109)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15568: I2 ^dir R)
=>WM: (15567: I2 ^reward 1)
=>WM: (15566: I2 ^see 0)
=>WM: (15565: N1110 ^status complete)
<=WM: (15554: I2 ^dir R)
<=WM: (15553: I2 ^reward 1)
<=WM: (15552: I2 ^see 0)
=>WM: (15569: I2 ^level-1 R0-root)
<=WM: (15555: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2219 = -0.1254042659579056)
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2220 = 0.770125534612744)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1114 ^value 1 +)
 (R1 ^reward R1114 +)
Firing propose*predict-yes
 -->
 (O2221 ^name predict-yes +)
 (S1 ^operator O2221 +)
Firing propose*predict-no
 -->
 (O2222 ^name predict-no +)
 (S1 ^operator O2222 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2220 = 0.2298646883171679)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2219 = 0.2940353333163421)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2220 ^name predict-no +)
 (S1 ^operator O2220 +)
Retracting propose*predict-yes
 -->
 (O2219 ^name predict-yes +)
 (S1 ^operator O2219 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1113 ^value 1 +)
 (R1 ^reward R1113 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2220 = 0.770125534612744)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2220 = 0.2298646883171679)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2219 = -0.1254042659579056)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2219 = 0.2940353333163421)
=>WM: (15575: S1 ^operator O2222 +)
=>WM: (15574: S1 ^operator O2221 +)
=>WM: (15573: O2222 ^name predict-no)
=>WM: (15572: O2221 ^name predict-yes)
=>WM: (15571: R1114 ^value 1)
=>WM: (15570: R1 ^reward R1114)
<=WM: (15561: S1 ^operator O2219 +)
<=WM: (15562: S1 ^operator O2220 +)
<=WM: (15563: S1 ^operator O2220)
<=WM: (15557: R1 ^reward R1113)
<=WM: (15560: O2220 ^name predict-no)
<=WM: (15559: O2219 ^name predict-yes)
<=WM: (15558: R1113 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2221 = 0.2940353333163421)
Firing prefer*rvt*predict-yes*H0*5*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2221 = -0.1254042659579056)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2222 = 0.2298646883171679)
Firing prefer*rvt*predict-no*H0*6*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2222 = 0.770125534612744)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2220 = 0.2298646883171679)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2220 = 0.770125534612744)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2219 = 0.2940353333163421)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2219 = -0.1254042659579056)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611916 -0.382051 0.229865 -> 0.611917 -0.382051 0.229865(R,m,v=1,0.865672,0.116866)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388076 0.382049 0.770126 -> 0.388077 0.38205 0.770126(R,m,v=1,1,0)
=>WM: (15576: S1 ^operator O2222)

  1111:    O: O2222 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1111 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1110 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15577: I3 ^predict-no N1111)
<=WM: (15565: N1110 ^status complete)
<=WM: (15564: I3 ^predict-no N1110)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (15581: I2 ^dir U)
=>WM: (15580: I2 ^reward 1)
=>WM: (15579: I2 ^see 0)
=>WM: (15578: N1111 ^status complete)
<=WM: (15568: I2 ^dir R)
<=WM: (15567: I2 ^reward 1)
<=WM: (15566: I2 ^see 0)
=>WM: (15582: I2 ^level-1 R0-root)
<=WM: (15569: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1115 ^value 1 +)
 (R1 ^reward R1115 +)
Firing propose*predict-yes
 -->
 (O2223 ^name predict-yes +)
 (S1 ^operator O2223 +)
Firing propose*predict-no
 -->
 (O2224 ^name predict-no +)
 (S1 ^operator O2224 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2222 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2221 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2222 ^name predict-no +)
 (S1 ^operator O2222 +)
Retracting propose*predict-yes
 -->
 (O2221 ^name predict-yes +)
 (S1 ^operator O2221 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1114 ^value 1 +)
 (R1 ^reward R1114 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
 -->
 (S1 ^operator O2222 = 0.7701264131585999)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2222 = 0.2298654638682661)
Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
 -->
 (S1 ^operator O2221 = -0.1254042659579056)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2221 = 0.2940353333163421)
=>WM: (15589: S1 ^operator O2224 +)
=>WM: (15588: S1 ^operator O2223 +)
=>WM: (15587: I3 ^dir U)
=>WM: (15586: O2224 ^name predict-no)
=>WM: (15585: O2223 ^name predict-yes)
=>WM: (15584: R1115 ^value 1)
=>WM: (15583: R1 ^reward R1115)
<=WM: (15574: S1 ^operator O2221 +)
<=WM: (15575: S1 ^operator O2222 +)
<=WM: (15576: S1 ^operator O2222)
<=WM: (15533: I3 ^dir R)
<=WM: (15570: R1 ^reward R1114)
<=WM: (15573: O2222 ^name predict-no)
<=WM: (15572: O2221 ^name predict-yes)
<=WM: (15571: R1114 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2223 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2224 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2222 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2221 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229865 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.866337,0.116374)
RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388077 0.38205 0.770126 -> 0.388077 0.38205 0.770127(R,m,v=1,1,0)
=>WM: (15590: S1 ^operator O2224)

  1112:    O: O2224 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1112 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1111 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15591: I3 ^predict-no N1112)
<=WM: (15578: N1111 ^status complete)
<=WM: (15577: I3 ^predict-no N1111)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15595: I2 ^dir U)
=>WM: (15594: I2 ^reward 1)
=>WM: (15593: I2 ^see 0)
=>WM: (15592: N1112 ^status complete)
<=WM: (15581: I2 ^dir U)
<=WM: (15580: I2 ^reward 1)
<=WM: (15579: I2 ^see 0)
=>WM: (15596: I2 ^level-1 R0-root)
<=WM: (15582: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1116 ^value 1 +)
 (R1 ^reward R1116 +)
Firing propose*predict-yes
 -->
 (O2225 ^name predict-yes +)
 (S1 ^operator O2225 +)
Firing propose*predict-no
 -->
 (O2226 ^name predict-no +)
 (S1 ^operator O2226 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2224 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2223 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2224 ^name predict-no +)
 (S1 ^operator O2224 +)
Retracting propose*predict-yes
 -->
 (O2223 ^name predict-yes +)
 (S1 ^operator O2223 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1115 ^value 1 +)
 (R1 ^reward R1115 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2224 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2223 = 0.)
=>WM: (15602: S1 ^operator O2226 +)
=>WM: (15601: S1 ^operator O2225 +)
=>WM: (15600: O2226 ^name predict-no)
=>WM: (15599: O2225 ^name predict-yes)
=>WM: (15598: R1116 ^value 1)
=>WM: (15597: R1 ^reward R1116)
<=WM: (15588: S1 ^operator O2223 +)
<=WM: (15589: S1 ^operator O2224 +)
<=WM: (15590: S1 ^operator O2224)
<=WM: (15583: R1 ^reward R1115)
<=WM: (15586: O2224 ^name predict-no)
<=WM: (15585: O2223 ^name predict-yes)
<=WM: (15584: R1115 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2225 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2226 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2224 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2223 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15603: S1 ^operator O2226)

  1113:    O: O2226 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1113 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1112 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15604: I3 ^predict-no N1113)
<=WM: (15592: N1112 ^status complete)
<=WM: (15591: I3 ^predict-no N1112)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15608: I2 ^dir U)
=>WM: (15607: I2 ^reward 1)
=>WM: (15606: I2 ^see 0)
=>WM: (15605: N1113 ^status complete)
<=WM: (15595: I2 ^dir U)
<=WM: (15594: I2 ^reward 1)
<=WM: (15593: I2 ^see 0)
=>WM: (15609: I2 ^level-1 R0-root)
<=WM: (15596: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1117 ^value 1 +)
 (R1 ^reward R1117 +)
Firing propose*predict-yes
 -->
 (O2227 ^name predict-yes +)
 (S1 ^operator O2227 +)
Firing propose*predict-no
 -->
 (O2228 ^name predict-no +)
 (S1 ^operator O2228 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2226 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2225 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2226 ^name predict-no +)
 (S1 ^operator O2226 +)
Retracting propose*predict-yes
 -->
 (O2225 ^name predict-yes +)
 (S1 ^operator O2225 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1116 ^value 1 +)
 (R1 ^reward R1116 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2226 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2225 = 0.)
=>WM: (15615: S1 ^operator O2228 +)
=>WM: (15614: S1 ^operator O2227 +)
=>WM: (15613: O2228 ^name predict-no)
=>WM: (15612: O2227 ^name predict-yes)
=>WM: (15611: R1117 ^value 1)
=>WM: (15610: R1 ^reward R1117)
<=WM: (15601: S1 ^operator O2225 +)
<=WM: (15602: S1 ^operator O2226 +)
<=WM: (15603: S1 ^operator O2226)
<=WM: (15597: R1 ^reward R1116)
<=WM: (15600: O2226 ^name predict-no)
<=WM: (15599: O2225 ^name predict-yes)
<=WM: (15598: R1116 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2227 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2228 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2226 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2225 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15616: S1 ^operator O2228)

  1114:    O: O2228 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1114 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1113 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15617: I3 ^predict-no N1114)
<=WM: (15605: N1113 ^status complete)
<=WM: (15604: I3 ^predict-no N1113)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15621: I2 ^dir L)
=>WM: (15620: I2 ^reward 1)
=>WM: (15619: I2 ^see 0)
=>WM: (15618: N1114 ^status complete)
<=WM: (15608: I2 ^dir U)
<=WM: (15607: I2 ^reward 1)
<=WM: (15606: I2 ^see 0)
=>WM: (15622: I2 ^level-1 R0-root)
<=WM: (15609: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2227 = 0.6195794710944548)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2228 = -0.2190661556260421)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1118 ^value 1 +)
 (R1 ^reward R1118 +)
Firing propose*predict-yes
 -->
 (O2229 ^name predict-yes +)
 (S1 ^operator O2229 +)
Firing propose*predict-no
 -->
 (O2230 ^name predict-no +)
 (S1 ^operator O2230 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2228 = 0.314041269303462)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2227 = 0.3804143774620755)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2228 ^name predict-no +)
 (S1 ^operator O2228 +)
Retracting propose*predict-yes
 -->
 (O2227 ^name predict-yes +)
 (S1 ^operator O2227 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1117 ^value 1 +)
 (R1 ^reward R1117 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2228 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2227 = 0.)
=>WM: (15629: S1 ^operator O2230 +)
=>WM: (15628: S1 ^operator O2229 +)
=>WM: (15627: I3 ^dir L)
=>WM: (15626: O2230 ^name predict-no)
=>WM: (15625: O2229 ^name predict-yes)
=>WM: (15624: R1118 ^value 1)
=>WM: (15623: R1 ^reward R1118)
<=WM: (15614: S1 ^operator O2227 +)
<=WM: (15615: S1 ^operator O2228 +)
<=WM: (15616: S1 ^operator O2228)
<=WM: (15587: I3 ^dir U)
<=WM: (15610: R1 ^reward R1117)
<=WM: (15613: O2228 ^name predict-no)
<=WM: (15612: O2227 ^name predict-yes)
<=WM: (15611: R1117 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2229 = 0.6195794710944548)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2229 = 0.3804143774620755)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2230 = -0.2190661556260421)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2230 = 0.314041269303462)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2228 = 0.314041269303462)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
 -->
 (S1 ^operator O2228 = -0.2190661556260421)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2227 = 0.3804143774620755)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
 -->
 (S1 ^operator O2227 = 0.6195794710944548)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15630: S1 ^operator O2229)

  1115:    O: O2229 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1115 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1114 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15631: I3 ^predict-yes N1115)
<=WM: (15618: N1114 ^status complete)
<=WM: (15617: I3 ^predict-no N1114)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15635: I2 ^dir L)
=>WM: (15634: I2 ^reward 1)
=>WM: (15633: I2 ^see 1)
=>WM: (15632: N1115 ^status complete)
<=WM: (15621: I2 ^dir L)
<=WM: (15620: I2 ^reward 1)
<=WM: (15619: I2 ^see 0)
=>WM: (15636: I2 ^level-1 L1-root)
<=WM: (15622: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, ac