stdout-flip-2.5K_1.txt

/flipv2/20121113-091142-2.5K-ReLST-asneeded_epochs_50_5runs_noalphadecay/stdout-flip-2.5K_1.txt

https://bitbucket.org/evan13579b/soar-ziggurat
Plain Text | 16394 lines | 15674 code | 720 blank | 0 comment | 0 complexity | 79b9c8d1fecac5a1c13676e25c2fe8cf MD5 | raw file
Possible License(s): BSD-3-Clause

Seeding... 1
dir: dir isL
Python-Soar Flip environment.
To accept commands from an external sml process, you'll need to
type 'slave <log file> <n decisons>' at the prompt...
sourcing 'flip_predict.soar'
***********
Total: 11 productions sourced.

seeding Soar with 1 ...

soar> Entering slave mode:
  - log file 'rl-slave-2.5K_1.log'....
  - will exit slave mode after 2500 decisions
  waiting for commands from an externally connected sml process...
-/|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\-/|\-/|\-sleeping...
/|\-/|sleeping...
\1:    O: O1 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
rule alias: '*'

rule alias: '*'

-/|\-/|2:    O: O4 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-3:    O: O5 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\4:    O: O7 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
-/|5:    O: O10 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
\-/6:    O: O11 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\7:    O: O13 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|8:    O: O16 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/9:    O: O18 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\-10:    O: O20 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\11:    O: O22 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

-12:    O: O23 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\13:    O: O26 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|14:    O: O28 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
\15:    O: O30 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
-/16:    O: O32 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-17:    O: O33 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\18:    O: O36 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/19:    O: O38 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-/20:    O: O39 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|\-21:    O: O41 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/22:    O: O43 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|23:    O: O46 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-24:    O: O47 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\25:    O: O50 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
-/|26:    O: O52 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-27:    O: O53 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|28:    O: O55 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-/29:    O: O57 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\30:    O: O60 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\sleeping...
-31:    O: O61 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/32:    O: O63 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|\-33:    O: O65 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|\34:    O: O67 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/35:    O: O69 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-36:    O: O72 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|37:    O: O73 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-/38:    O: O76 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
|\-39:    O: O77 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|40:    O: O79 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/41:    O: O81 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|42:    O: O84 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/43:    O: O85 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|\-44:    O: O87 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\45:    O: O90 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/46:    O: O92 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\47:    O: O94 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/48:    O: O95 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-49:    O: O98 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\50:    O: O100 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\-/sleeping...
|51:    O: O102 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
rule alias: '*'

rule alias: '*'

\52:    O: O103 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/53:    O: O106 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-54:    O: O107 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/|\55:    O: O109 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
-/|56:    O: O111 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/57:    O: O114 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\-58:    O: O116 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\59:    O: O118 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|60:    O: O119 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-61:    O: O121 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

/62:    O: O123 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-63:    O: O126 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\64:    O: O128 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/65:    O: O130 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-66:    O: O131 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/|\67:    O: O133 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|68:    O: O135 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-/69:    O: O138 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-70:    O: O140 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-71:    O: O142 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/72:    O: O143 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-73:    O: O145 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|74:    O: O148 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-75:    O: O150 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\76:    O: O152 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/|77:    O: O154 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/78:    O: O155 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-/79:    O: O157 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|80:    O: O160 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/81:    O: O161 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|82:    O: O164 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/83:    O: O166 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-84:    O: O168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
/|\85:    O: O169 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/86:    O: O172 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\87:    O: O174 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|88:    O: O176 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/89:    O: O177 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\90:    O: O179 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/91:    O: O182 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|92:    O: O183 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/93:    O: O186 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-94:    O: O188 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|95:    O: O190 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\96:    O: O191 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/|97:    O: O194 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-98:    O: O196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
/|\99:    O: O198 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|100:    O: O200 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/101:    O: O201 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\102:    O: O203 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
-/|103:    O: O205 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-104:    O: O207 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\105:    O: O209 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|106:    O: O211 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-107:    O: O213 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/|108:    O: O216 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-109:    O: O218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\110:    O: O220 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/111:    O: O222 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|112:    O: O223 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/113:    O: O225 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-114:    O: O227 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\-115:    O: O229 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|\-sleeping...
/116:    O: O231 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\117:    O: O234 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/118:    O: O236 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-119:    O: O238 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\120:    O: O239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
-/|121:    O: O241 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
rule alias: '*'

rule alias: '*'

\122:    O: O244 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|123:    O: O246 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-124:    O: O248 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\125:    O: O250 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/126:    O: O252 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-127:    O: O254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\128:    O: O256 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/129:    O: O258 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-130:    O: O259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|131:    O: O262 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\132:    O: O263 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|133:    O: O266 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/134:    O: O267 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-135:    O: O270 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
/|\136:    O: O272 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|137:    O: O274 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/138:    O: O276 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
|\-139:    O: O277 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|140:    O: O279 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-141:    O: O282 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
/142:    O: O283 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\143:    O: O286 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|144:    O: O287 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/145:    O: O290 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-146:    O: O292 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\147:    O: O293 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|148:    O: O295 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/149:    O: O297 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\150:    O: O300 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/151:    O: O301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|152:    O: O304 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/153:    O: O305 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\154:    O: O308 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/155:    O: O310 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\156:    O: O312 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|157:    O: O314 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/158:    O: O315 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-/159:    O: O317 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\160:    O: O320 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/161:    O: O322 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|162:    O: O323 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/163:    O: O325 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\164:    O: O327 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|165:    O: O330 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/166:    O: O331 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\-167:    O: O333 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|168:    O: O335 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/|169:    O: O337 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-170:    O: O340 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|171:    O: O342 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\172:    O: O343 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/|173:    O: O346 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/174:    O: O347 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|\-175:    O: O350 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|176:    O: O352 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/177:    O: O354 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-178:    O: O356 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
/|\179:    O: O357 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|180:    O: O360 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-181:    O: O362 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/182:    O: O364 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-183:    O: O366 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|184:    O: O367 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-185:    O: O369 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\186:    O: O371 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|187:    O: O374 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/188:    O: O376 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-189:    O: O378 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\190:    O: O379 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|191:    O: O381 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\192:    O: O384 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|193:    O: O385 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\194:    O: O388 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|195:    O: O389 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/196:    O: O391 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-197:    O: O394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\198:    O: O395 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|199:    O: O398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
\-200:    O: O399 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\-/|201:    O: O401 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\202:    O: O404 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/203:    O: O406 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-204:    O: O408 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\205:    O: O410 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/206:    O: O412 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/207:    O: O413 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\208:    O: O416 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|209:    O: O417 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-210:    O: O419 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|211:    O: O422 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\212:    O: O424 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/213:    O: O426 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-214:    O: O427 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\-sleeping...
/215:    O: O430 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-/216:    O: O432 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-217:    O: O434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\218:    O: O436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/219:    O: O437 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-220:    O: O440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\221:    O: O442 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-222:    O: O443 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\-sleeping...
/223:    O: O446 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\224:    O: O448 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|225:    O: O449 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-/226:    O: O452 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
|\227:    O: O454 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|228:    O: O456 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-229:    O: O458 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\230:    O: O459 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|231:    O: O462 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\232:    O: O463 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/233:    O: O465 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\234:    O: O468 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|235:    O: O469 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/236:    O: O471 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\237:    O: O473 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|238:    O: O476 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-239:    O: O478 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\240:    O: O479 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/|241:    O: O482 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\242:    O: O484 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/243:    O: O485 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-244:    O: O488 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\245:    O: O490 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/246:    O: O492 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-247:    O: O493 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\248:    O: O495 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|249:    O: O498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/250:    O: O500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/251:    O: O502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|252:    O: O503 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-253:    O: O506 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-sleeping...
/254:    O: O508 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-255:    O: O509 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\256:    O: O512 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|257:    O: O514 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/258:    O: O516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-259:    O: O518 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\260:    O: O519 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/|\261:    O: O522 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-262:    O: O524 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\263:    O: O526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|264:    O: O528 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/265:    O: O530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\266:    O: O531 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/267:    O: O533 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-268:    O: O536 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\269:    O: O538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|270:    O: O540 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-271:    O: O542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/272:    O: O544 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\273:    O: O545 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|274:    O: O548 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-275:    O: O550 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|276:    O: O551 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-277:    O: O554 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\278:    O: O555 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/279:    O: O557 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-280:    O: O559 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|281:    O: O561 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\282:    O: O564 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|283:    O: O566 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-284:    O: O568 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|285:    O: O569 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/286:    O: O571 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\-/287:    O: O573 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-288:    O: O576 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\289:    O: O578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/290:    O: O580 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\291:    O: O582 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-292:    O: O584 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\293:    O: O586 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|294:    O: O587 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-295:    O: O590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|296:    O: O592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/297:    O: O594 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-298:    O: O596 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|299:    O: O597 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/300:    O: O599 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-/|\301:    O: O601 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-302:    O: O604 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\303:    O: O606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|304:    O: O608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/305:    O: O610 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-306:    O: O611 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/307:    O: O614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-308:    O: O616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|309:    O: O618 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/310:    O: O620 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-311:    O: O621 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/312:    O: O624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-313:    O: O626 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\314:    O: O628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/315:    O: O630 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\316:    O: O632 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|317:    O: O633 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-318:    O: O636 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|319:    O: O638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-320:    O: O640 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\321:    O: O641 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-322:    O: O644 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\323:    O: O646 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/324:    O: O648 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\325:    O: O649 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|326:    O: O651 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/327:    O: O654 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-328:    O: O655 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\329:    O: O657 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|330:    O: O660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\331:    O: O661 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-332:    O: O664 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\333:    O: O665 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|334:    O: O667 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/|335:    O: O670 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-336:    O: O671 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\337:    O: O674 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|338:    O: O676 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-339:    O: O678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\340:    O: O679 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/341:    O: O682 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|342:    O: O684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/343:    O: O685 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\344:    O: O687 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|345:    O: O690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/|346:    O: O691 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-347:    O: O694 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|348:    O: O696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-349:    O: O698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-sleeping...
/350:    O: O700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-351:    O: O702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/352:    O: O704 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-353:    O: O706 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|354:    O: O708 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/355:    O: O710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-356:    O: O712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\357:    O: O714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/358:    O: O716 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-359:    O: O717 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\360:    O: O719 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|361:    O: O722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\362:    O: O724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|363:    O: O726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/364:    O: O728 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-365:    O: O730 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\366:    O: O732 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|\367:    O: O733 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/368:    O: O736 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-369:    O: O738 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\370:    O: O740 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-371:    O: O742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/372:    O: O744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-373:    O: O745 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|374:    O: O748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-375:    O: O749 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\376:    O: O752 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|377:    O: O754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/378:    O: O755 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-379:    O: O757 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\380:    O: O759 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|381:    O: O762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\382:    O: O764 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\383:    O: O766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/384:    O: O768 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
|\-385:    O: O770 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\386:    O: O772 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|387:    O: O773 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/388:    O: O776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-389:    O: O777 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|390:    O: O780 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/391:    O: O782 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|392:    O: O784 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/|393:    O: O786 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/394:    O: O788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-395:    O: O789 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|396:    O: O791 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/397:    O: O794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-398:    O: O795 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\399:    O: O797 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|\400:    O: O800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|401:    O: O802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\402:    O: O803 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
-/|403:    O: O806 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
\404:    O: O808 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
-405:    O: O809 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|406:    O: O812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/407:    O: O813 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-408:    O: O815 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|409:    O: O817 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/410:    O: O820 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-411:    O: O822 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/412:    O: O824 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-413:    O: O826 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\414:    O: O828 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|415:    O: O829 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/416:    O: O832 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\417:    O: O834 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/418:    O: O836 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-419:    O: O838 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|420:    O: O840 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/421:    O: O842 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|422:    O: O844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\423:    O: O845 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|424:    O: O848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-425:    O: O850 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|426:    O: O851 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/427:    O: O854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-428:    O: O855 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\429:    O: O858 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|430:    O: O859 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-/431:    O: O861 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|432:    O: O864 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/433:    O: O865 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-434:    O: O868 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\435:    O: O869 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/|436:    O: O872 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/437:    O: O874 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-438:    O: O875 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|439:    O: O877 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/440:    O: O880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-441:    O: O882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/442:    O: O884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-443:    O: O886 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\444:    O: O888 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|\445:    O: O889 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/446:    O: O892 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/447:    O: O894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-448:    O: O896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\449:    O: O898 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|450:    O: O900 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/451:    O: O902 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|452:    O: O904 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/453:    O: O905 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-454:    O: O908 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|455:    O: O910 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-456:    O: O912 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\-457:    O: O914 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\458:    O: O916 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/459:    O: O918 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
|\-460:    O: O920 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|461:    O: O921 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\462:    O: O924 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|463:    O: O926 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/464:    O: O928 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-465:    O: O930 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\466:    O: O932 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|467:    O: O933 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/468:    O: O935 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-469:    O: O937 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\470:    O: O939 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|471:    O: O942 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\472:    O: O943 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|473:    O: O946 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/474:    O: O947 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-475:    O: O949 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\476:    O: O951 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/477:    O: O953 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-478:    O: O956 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\479:    O: O958 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|480:    O: O960 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/481:    O: O962 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|482:    O: O963 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/483:    O: O966 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-484:    O: O968 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|485:    O: O970 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\486:    O: O972 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/487:    O: O974 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-488:    O: O975 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/489:    O: O978 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-490:    O: O980 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\491:    O: O982 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-492:    O: O984 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\493:    O: O985 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/494:    O: O988 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\495:    O: O990 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/496:    O: O992 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\497:    O: O993 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-498:    O: O995 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\499:    O: O998 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|500:    O: O999 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/|\-501:    O: O1001 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/502:    O: O1004 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\503:    O: O1006 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|504:    O: O1007 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/505:    O: O1009 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\506:    O: O1012 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-507:    O: O1013 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|508:    O: O1015 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-509:    O: O1018 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|510:    O: O1020 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/511:    O: O1022 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|512:    O: O1024 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-513:    O: O1026 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\514:    O: O1027 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|515:    O: O1030 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/516:    O: O1032 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-517:    O: O1033 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\518:    O: O1036 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|519:    O: O1038 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-520:    O: O1040 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|521:    O: O1042 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\522:    O: O1044 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|523:    O: O1046 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/524:    O: O1048 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-525:    O: O1050 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|526:    O: O1052 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/527:    O: O1053 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-/528:    O: O1056 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-529:    O: O1057 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\530:    O: O1060 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|531:    O: O1062 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\532:    O: O1063 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/533:    O: O1065 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\534:    O: O1068 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|535:    O: O1070 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/536:    O: O1072 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\537:    O: O1074 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-538:    O: O1076 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\539:    O: O1078 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|540:    O: O1080 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/|541:    O: O1082 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\542:    O: O1084 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/543:    O: O1086 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-544:    O: O1088 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\545:    O: O1090 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/546:    O: O1092 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-547:    O: O1094 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\548:    O: O1096 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|549:    O: O1098 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-550:    O: O1100 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\551:    O: O1102 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-552:    O: O1104 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\553:    O: O1106 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|554:    O: O1108 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/555:    O: O1110 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-556:    O: O1111 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|557:    O: O1114 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/558:    O: O1116 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-559:    O: O1117 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\-560:    O: O1119 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\561:    O: O1122 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-562:    O: O1123 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|563:    O: O1126 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/564:    O: O1127 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-565:    O: O1129 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|566:    O: O1132 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/567:    O: O1134 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-568:    O: O1136 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\569:    O: O1138 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/570:    O: O1139 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-571:    O: O1141 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/572:    O: O1144 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\573:    O: O1146 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|574:    O: O1148 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-575:    O: O1150 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\576:    O: O1152 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|577:    O: O1153 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/578:    O: O1156 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-579:    O: O1158 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\580:    O: O1160 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-581:    O: O1162 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/582:    O: O1164 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-583:    O: O1165 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|584:    O: O1168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/585:    O: O1170 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-586:    O: O1172 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|587:    O: O1173 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-588:    O: O1175 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\589:    O: O1178 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|590:    O: O1180 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-591:    O: O1181 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/592:    O: O1183 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\593:    O: O1185 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|594:    O: O1187 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/595:    O: O1189 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-596:    O: O1192 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\597:    O: O1194 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/598:    O: O1196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-599:    O: O1198 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\-600:    O: O1200 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\601:    O: O1202 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-602:    O: O1204 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\603:    O: O1206 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/604:    O: O1208 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|605:    O: O1209 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/606:    O: O1211 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-607:    O: O1213 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\608:    O: O1216 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|609:    O: O1218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/|610:    O: O1219 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-611:    O: O1221 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/612:    O: O1223 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\613:    O: O1226 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/614:    O: O1227 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-615:    O: O1230 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\616:    O: O1232 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|617:    O: O1234 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/618:    O: O1235 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\619:    O: O1237 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|620:    O: O1239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/|621:    O: O1242 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\622:    O: O1244 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/623:    O: O1245 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\624:    O: O1248 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|625:    O: O1249 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/626:    O: O1252 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\627:    O: O1254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/628:    O: O1256 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|629:    O: O1258 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/630:    O: O1259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-631:    O: O1262 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/632:    O: O1263 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-633:    O: O1266 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|634:    O: O1268 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-635:    O: O1269 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/636:    O: O1272 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\637:    O: O1273 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|638:    O: O1276 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-639:    O: O1278 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\640:    O: O1280 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|641:    O: O1282 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\642:    O: O1283 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|643:    O: O1286 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-644:    O: O1288 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\645:    O: O1289 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|646:    O: O1292 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\647:    O: O1294 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|648:    O: O1295 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-649:    O: O1298 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-650:    O: O1300 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\-651:    O: O1301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/652:    O: O1304 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\653:    O: O1306 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|654:    O: O1307 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-655:    O: O1310 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\656:    O: O1311 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|657:    O: O1314 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|sleeping...
\658:    O: O1316 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|659:    O: O1317 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/660:    O: O1320 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-661:    O: O1322 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/662:    O: O1323 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-663:    O: O1326 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|664:    O: O1328 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/665:    O: O1330 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-666:    O: O1331 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\667:    O: O1334 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|668:    O: O1336 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\669:    O: O1338 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/670:    O: O1340 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-671:    O: O1342 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/672:    O: O1343 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\673:    O: O1346 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|674:    O: O1348 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/675:    O: O1350 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-676:    O: O1351 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\677:    O: O1353 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/678:    O: O1355 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-679:    O: O1357 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\680:    O: O1359 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|681:    O: O1362 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\682:    O: O1364 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|683:    O: O1365 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-684:    O: O1368 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\685:    O: O1370 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|686:    O: O1372 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-687:    O: O1374 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\688:    O: O1376 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/689:    O: O1378 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-690:    O: O1380 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\691:    O: O1381 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-692:    O: O1384 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\693:    O: O1386 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|694:    O: O1388 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/695:    O: O1390 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-696:    O: O1392 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\697:    O: O1394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|698:    O: O1396 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/699:    O: O1398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\700:    O: O1399 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/701:    O: O1402 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|702:    O: O1404 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/|703:    O: O1405 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/704:    O: O1408 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\705:    O: O1410 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|706:    O: O1412 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/707:    O: O1414 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-708:    O: O1415 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|709:    O: O1417 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-710:    O: O1420 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\711:    O: O1421 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-712:    O: O1424 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|713:    O: O1425 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/714:    O: O1428 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\715:    O: O1430 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|716:    O: O1432 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-717:    O: O1434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\718:    O: O1436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|719:    O: O1437 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/720:    O: O1440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-721:    O: O1442 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/722:    O: O1444 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-723:    O: O1446 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\724:    O: O1448 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|725:    O: O1450 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/726:    O: O1452 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\727:    O: O1454 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|728:    O: O1455 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-729:    O: O1458 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\-730:    O: O1460 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\731:    O: O1461 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-732:    O: O1463 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\733:    O: O1466 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|734:    O: O1467 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-735:    O: O1469 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\736:    O: O1472 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|737:    O: O1474 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\738:    O: O1475 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/739:    O: O1477 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-740:    O: O1479 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\741:    O: O1482 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-742:    O: O1484 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|743:    O: O1486 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/744:    O: O1487 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\745:    O: O1490 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/746:    O: O1491 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\747:    O: O1494 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|748:    O: O1496 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/749:    O: O1498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\750:    O: O1500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|751:    O: O1502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\752:    O: O1503 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|753:    O: O1505 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/754:    O: O1507 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-755:    O: O1509 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\-sleeping...
/756:    O: O1511 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-757:    O: O1514 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\758:    O: O1516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|759:    O: O1518 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/760:    O: O1519 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-761:    O: O1521 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/762:    O: O1524 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-763:    O: O1526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\764:    O: O1528 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|765:    O: O1530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\766:    O: O1532 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|767:    O: O1534 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-768:    O: O1536 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\769:    O: O1538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|770:    O: O1539 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-771:    O: O1542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/772:    O: O1543 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-773:    O: O1546 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|774:    O: O1547 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/|775:    O: O1549 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/776:    O: O1552 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-777:    O: O1553 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\-sleeping...
/778:    O: O1556 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-779:    O: O1557 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\780:    O: O1559 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|781:    O: O1562 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\782:    O: O1563 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|783:    O: O1565 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/784:    O: O1568 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\785:    O: O1569 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|786:    O: O1572 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/|787:    O: O1573 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/788:    O: O1576 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-/789:    O: O1578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\790:    O: O1580 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|791:    O: O1582 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\792:    O: O1584 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/793:    O: O1585 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-794:    O: O1588 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\795:    O: O1590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|796:    O: O1592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/797:    O: O1594 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\798:    O: O1596 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|799:    O: O1598 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/800:    O: O1600 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-801:    O: O1601 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/802:    O: O1603 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-803:    O: O1606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\804:    O: O1608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|805:    O: O1610 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/806:    O: O1612 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\807:    O: O1614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-808:    O: O1616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\809:    O: O1618 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\sleeping...
-810:    O: O1620 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\811:    O: O1622 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-812:    O: O1624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\813:    O: O1626 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|814:    O: O1628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/815:    O: O1629 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-816:    O: O1632 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|817:    O: O1634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/818:    O: O1635 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|819:    O: O1638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/820:    O: O1639 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-821:    O: O1641 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/822:    O: O1644 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-823:    O: O1645 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\-824:    O: O1648 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|825:    O: O1649 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/826:    O: O1651 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-827:    O: O1654 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\828:    O: O1656 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-829:    O: O1657 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\830:    O: O1660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/831:    O: O1661 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|832:    O: O1664 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/833:    O: O1666 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\834:    O: O1667 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/835:    O: O1669 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-836:    O: O1672 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\837:    O: O1674 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-838:    O: O1675 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\839:    O: O1678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/840:    O: O1680 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-841:    O: O1681 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/842:    O: O1684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-843:    O: O1685 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|844:    O: O1688 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-845:    O: O1690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|846:    O: O1692 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\847:    O: O1694 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|848:    O: O1696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-849:    O: O1698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|850:    O: O1700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/851:    O: O1702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|852:    O: O1704 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-853:    O: O1706 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\854:    O: O1707 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|855:    O: O1710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/856:    O: O1712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\857:    O: O1714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|858:    O: O1715 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-859:    O: O1718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\860:    O: O1720 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|861:    O: O1722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\862:    O: O1724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|863:    O: O1726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/864:    O: O1727 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\865:    O: O1730 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|866:    O: O1731 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/867:    O: O1733 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\868:    O: O1736 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|869:    O: O1738 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/870:    O: O1740 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\871:    O: O1742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-872:    O: O1744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\873:    O: O1746 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\874:    O: O1748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/875:    O: O1750 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-876:    O: O1751 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/877:    O: O1754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-878:    O: O1756 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\879:    O: O1758 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-880:    O: O1760 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\881:    O: O1762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-882:    O: O1764 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\883:    O: O1766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/884:    O: O1768 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-885:    O: O1769 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\-886:    O: O1772 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|887:    O: O1773 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/888:    O: O1776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-889:    O: O1778 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|890:    O: O1780 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/891:    O: O1781 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|892:    O: O1783 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/893:    O: O1786 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-894:    O: O1788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\895:    O: O1790 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/896:    O: O1792 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\897:    O: O1794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-898:    O: O1796 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\899:    O: O1798 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|900:    O: O1800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-901:    O: O1802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/902:    O: O1804 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|903:    O: O1806 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/904:    O: O1808 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-905:    O: O1810 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\906:    O: O1812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|907:    O: O1814 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/908:    O: O1816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-909:    O: O1818 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|910:    O: O1820 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/911:    O: O1822 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|912:    O: O1823 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\913:    O: O1825 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|914:    O: O1828 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/915:    O: O1829 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\916:    O: O1832 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/917:    O: O1834 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-918:    O: O1836 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\919:    O: O1837 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|920:    O: O1839 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/921:    O: O1842 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|922:    O: O1844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-923:    O: O1845 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\924:    O: O1848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/925:    O: O1850 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\926:    O: O1852 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|927:    O: O1854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-928:    O: O1856 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\929:    O: O1858 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|930:    O: O1860 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-931:    O: O1862 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/932:    O: O1864 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-933:    O: O1865 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|934:    O: O1868 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/935:    O: O1870 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\936:    O: O1872 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|937:    O: O1874 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/938:    O: O1876 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\939:    O: O1877 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/940:    O: O1880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\941:    O: O1882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-942:    O: O1884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\943:    O: O1886 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/944:    O: O1887 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\945:    O: O1889 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|946:    O: O1892 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/947:    O: O1894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-948:    O: O1896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\949:    O: O1898 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|950:    O: O1900 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/|\-/|\-/--- Input Phase --- 
=>WM: (13307: I2 ^dir U)
=>WM: (13306: I2 ^reward 1)
=>WM: (13305: I2 ^see 0)
=>WM: (13304: N950 ^status complete)
<=WM: (13293: I2 ^dir U)
<=WM: (13292: I2 ^reward 1)
<=WM: (13291: I2 ^see 0)
=>WM: (13308: I2 ^level-1 R0-root)
<=WM: (13294: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Firing propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Firing propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1900 ^name predict-no +)
 (S1 ^operator O1900 +)
Retracting propose*predict-yes
 -->
 (O1899 ^name predict-yes +)
 (S1 ^operator O1899 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R953 ^value 1 +)
 (R1 ^reward R953 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.)
=>WM: (13314: S1 ^operator O1902 +)
=>WM: (13313: S1 ^operator O1901 +)
=>WM: (13312: O1902 ^name predict-no)
=>WM: (13311: O1901 ^name predict-yes)
=>WM: (13310: R954 ^value 1)
=>WM: (13309: R1 ^reward R954)
<=WM: (13300: S1 ^operator O1899 +)
<=WM: (13301: S1 ^operator O1900 +)
<=WM: (13302: S1 ^operator O1900)
<=WM: (13295: R1 ^reward R953)
<=WM: (13298: O1900 ^name predict-no)
<=WM: (13297: O1899 ^name predict-yes)
<=WM: (13296: R953 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13315: S1 ^operator O1902)

   951:    O: O1902 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N951 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N950 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13316: I3 ^predict-no N951)
<=WM: (13304: N950 ^status complete)
<=WM: (13303: I3 ^predict-no N950)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (13320: I2 ^dir L)
=>WM: (13319: I2 ^reward 1)
=>WM: (13318: I2 ^see 0)
=>WM: (13317: N951 ^status complete)
<=WM: (13307: I2 ^dir U)
<=WM: (13306: I2 ^reward 1)
<=WM: (13305: I2 ^see 0)
=>WM: (13321: I2 ^level-1 R0-root)
<=WM: (13308: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1901 = 0.6597530378637458)
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1902 = 0.133561435542329)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Firing propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Firing propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1902 = 0.3212981720332201)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1901 = 0.3402462579366619)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Retracting propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.)
=>WM: (13328: S1 ^operator O1904 +)
=>WM: (13327: S1 ^operator O1903 +)
=>WM: (13326: I3 ^dir L)
=>WM: (13325: O1904 ^name predict-no)
=>WM: (13324: O1903 ^name predict-yes)
=>WM: (13323: R955 ^value 1)
=>WM: (13322: R1 ^reward R955)
<=WM: (13313: S1 ^operator O1901 +)
<=WM: (13314: S1 ^operator O1902 +)
<=WM: (13315: S1 ^operator O1902)
<=WM: (13299: I3 ^dir U)
<=WM: (13309: R1 ^reward R954)
<=WM: (13312: O1902 ^name predict-no)
<=WM: (13311: O1901 ^name predict-yes)
<=WM: (13310: R954 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1903 = 0.6597530378637458)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1903 = 0.3402462579366619)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1904 = 0.133561435542329)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1904 = 0.3212981720332201)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1902 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1902 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1901 = 0.3402462579366619)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1901 = 0.6597530378637458)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13329: S1 ^operator O1903)

   952:    O: O1903 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N952 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N951 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13330: I3 ^predict-yes N952)
<=WM: (13317: N951 ^status complete)
<=WM: (13316: I3 ^predict-no N951)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13334: I2 ^dir R)
=>WM: (13333: I2 ^reward 1)
=>WM: (13332: I2 ^see 1)
=>WM: (13331: N952 ^status complete)
<=WM: (13320: I2 ^dir L)
<=WM: (13319: I2 ^reward 1)
<=WM: (13318: I2 ^see 0)
=>WM: (13335: I2 ^level-1 L1-root)
<=WM: (13321: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1903 = 0.8879101996662896)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1904 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Firing propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Firing propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.3993329903418046)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.1121099638010357)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Retracting propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1904 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1904 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1903 = 0.3402462579366619)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1903 = 0.6597530378637458)
=>WM: (13343: S1 ^operator O1906 +)
=>WM: (13342: S1 ^operator O1905 +)
=>WM: (13341: I3 ^dir R)
=>WM: (13340: O1906 ^name predict-no)
=>WM: (13339: O1905 ^name predict-yes)
=>WM: (13338: R956 ^value 1)
=>WM: (13337: R1 ^reward R956)
=>WM: (13336: I3 ^see 1)
<=WM: (13327: S1 ^operator O1903 +)
<=WM: (13329: S1 ^operator O1903)
<=WM: (13328: S1 ^operator O1904 +)
<=WM: (13326: I3 ^dir L)
<=WM: (13322: R1 ^reward R955)
<=WM: (13254: I3 ^see 0)
<=WM: (13325: O1904 ^name predict-no)
<=WM: (13324: O1903 ^name predict-yes)
<=WM: (13323: R955 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.1121099638010357)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1905 = 0.8879101996662896)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.3993329903418046)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1906 = 0.02370016355578053)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.3993329903418046)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1904 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.1121099638010357)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1903 = 0.8879101996662896)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.890323,0.0982824)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422821 0.236932 0.659753(R,m,v=1,1,0)
=>WM: (13344: S1 ^operator O1905)

   953:    O: O1905 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N953 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N952 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13345: I3 ^predict-yes N953)
<=WM: (13331: N952 ^status complete)
<=WM: (13330: I3 ^predict-yes N952)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13349: I2 ^dir R)
=>WM: (13348: I2 ^reward 1)
=>WM: (13347: I2 ^see 1)
=>WM: (13346: N953 ^status complete)
<=WM: (13334: I2 ^dir R)
<=WM: (13333: I2 ^reward 1)
<=WM: (13332: I2 ^see 1)
=>WM: (13350: I2 ^level-1 R1-root)
<=WM: (13335: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1906 = 0.6006773674757838)
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1905 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Firing propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Firing propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.3993329903418046)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.1121099638010357)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Retracting propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1906 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.3993329903418046)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1905 = 0.8879101996662896)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.1121099638010357)
=>WM: (13356: S1 ^operator O1908 +)
=>WM: (13355: S1 ^operator O1907 +)
=>WM: (13354: O1908 ^name predict-no)
=>WM: (13353: O1907 ^name predict-yes)
=>WM: (13352: R957 ^value 1)
=>WM: (13351: R1 ^reward R957)
<=WM: (13342: S1 ^operator O1905 +)
<=WM: (13344: S1 ^operator O1905)
<=WM: (13343: S1 ^operator O1906 +)
<=WM: (13337: R1 ^reward R956)
<=WM: (13340: O1906 ^name predict-no)
<=WM: (13339: O1905 ^name predict-yes)
<=WM: (13338: R956 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1907 = 0.1121099638010357)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1907 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1908 = 0.3993329903418046)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1908 = 0.6006773674757838)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.3993329903418046)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1906 = 0.6006773674757838)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.1121099638010357)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1905 = 0.1602187148382515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619034 -0.506924 0.11211 -> 0.61903 -0.506923 0.112107(R,m,v=1,0.895425,0.0942552)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.38099 0.50692 0.88791 -> 0.380987 0.506921 0.887907(R,m,v=1,1,0)
=>WM: (13357: S1 ^operator O1908)

   954:    O: O1908 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N954 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N953 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13358: I3 ^predict-no N954)
<=WM: (13346: N953 ^status complete)
<=WM: (13345: I3 ^predict-yes N953)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13362: I2 ^dir U)
=>WM: (13361: I2 ^reward 1)
=>WM: (13360: I2 ^see 0)
=>WM: (13359: N954 ^status complete)
<=WM: (13349: I2 ^dir R)
<=WM: (13348: I2 ^reward 1)
<=WM: (13347: I2 ^see 1)
=>WM: (13363: I2 ^level-1 R0-root)
<=WM: (13350: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Firing propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Firing propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1908 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1907 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Retracting propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1908 = 0.6006773674757838)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1908 = 0.3993329903418046)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1907 = 0.1602187148382515)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1907 = 0.112106939280937)
=>WM: (13371: S1 ^operator O1910 +)
=>WM: (13370: S1 ^operator O1909 +)
=>WM: (13369: I3 ^dir U)
=>WM: (13368: O1910 ^name predict-no)
=>WM: (13367: O1909 ^name predict-yes)
=>WM: (13366: R958 ^value 1)
=>WM: (13365: R1 ^reward R958)
=>WM: (13364: I3 ^see 0)
<=WM: (13355: S1 ^operator O1907 +)
<=WM: (13356: S1 ^operator O1908 +)
<=WM: (13357: S1 ^operator O1908)
<=WM: (13341: I3 ^dir R)
<=WM: (13351: R1 ^reward R957)
<=WM: (13336: I3 ^see 1)
<=WM: (13354: O1908 ^name predict-no)
<=WM: (13353: O1907 ^name predict-yes)
<=WM: (13352: R957 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1909 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1910 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1908 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1907 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.558041 -0.158708 0.399333 -> 0.55804 -0.158708 0.399331(R,m,v=1,0.926829,0.0682328)
RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441968 0.158709 0.600677 -> 0.441967 0.158709 0.600676(R,m,v=1,1,0)
=>WM: (13372: S1 ^operator O1910)

   955:    O: O1910 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N955 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N954 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13373: I3 ^predict-no N955)
<=WM: (13359: N954 ^status complete)
<=WM: (13358: I3 ^predict-no N954)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13377: I2 ^dir L)
=>WM: (13376: I2 ^reward 1)
=>WM: (13375: I2 ^see 0)
=>WM: (13374: N955 ^status complete)
<=WM: (13362: I2 ^dir U)
<=WM: (13361: I2 ^reward 1)
<=WM: (13360: I2 ^see 0)
=>WM: (13378: I2 ^level-1 R0-root)
<=WM: (13363: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1909 = 0.6597531434936846)
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1910 = 0.133561435542329)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Firing propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Firing propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1910 = 0.3212981720332201)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1909 = 0.3402463635666008)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Retracting propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1910 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1909 = 0.)
=>WM: (13385: S1 ^operator O1912 +)
=>WM: (13384: S1 ^operator O1911 +)
=>WM: (13383: I3 ^dir L)
=>WM: (13382: O1912 ^name predict-no)
=>WM: (13381: O1911 ^name predict-yes)
=>WM: (13380: R959 ^value 1)
=>WM: (13379: R1 ^reward R959)
<=WM: (13370: S1 ^operator O1909 +)
<=WM: (13371: S1 ^operator O1910 +)
<=WM: (13372: S1 ^operator O1910)
<=WM: (13369: I3 ^dir U)
<=WM: (13365: R1 ^reward R958)
<=WM: (13368: O1910 ^name predict-no)
<=WM: (13367: O1909 ^name predict-yes)
<=WM: (13366: R958 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1911 = 0.6597531434936846)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3402463635666008)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1912 = 0.133561435542329)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.3212981720332201)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1910 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1910 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1909 = 0.3402463635666008)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1909 = 0.6597531434936846)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13386: S1 ^operator O1911)

   956:    O: O1911 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N956 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N955 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13387: I3 ^predict-yes N956)
<=WM: (13374: N955 ^status complete)
<=WM: (13373: I3 ^predict-no N955)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13391: I2 ^dir L)
=>WM: (13390: I2 ^reward 1)
=>WM: (13389: I2 ^see 1)
=>WM: (13388: N956 ^status complete)
<=WM: (13377: I2 ^dir L)
<=WM: (13376: I2 ^reward 1)
<=WM: (13375: I2 ^see 0)
=>WM: (13392: I2 ^level-1 L1-root)
<=WM: (13378: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1911 = 0.02884852834965246)
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1912 = 0.6787497288432303)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Firing propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Firing propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.3212981720332201)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3402463635666008)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Retracting propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1912 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3402463635666008)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1911 = 0.6597531434936846)
=>WM: (13399: S1 ^operator O1914 +)
=>WM: (13398: S1 ^operator O1913 +)
=>WM: (13397: O1914 ^name predict-no)
=>WM: (13396: O1913 ^name predict-yes)
=>WM: (13395: R960 ^value 1)
=>WM: (13394: R1 ^reward R960)
=>WM: (13393: I3 ^see 1)
<=WM: (13384: S1 ^operator O1911 +)
<=WM: (13386: S1 ^operator O1911)
<=WM: (13385: S1 ^operator O1912 +)
<=WM: (13379: R1 ^reward R959)
<=WM: (13364: I3 ^see 0)
<=WM: (13382: O1912 ^name predict-no)
<=WM: (13381: O1911 ^name predict-yes)
<=WM: (13380: R959 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3402463635666008)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1913 = 0.02884852834965246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.3212981720332201)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1914 = 0.6787497288432303)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1912 = 0.6787497288432303)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3402463635666008)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1911 = 0.02884852834965246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577179 -0.236932 0.340246(R,m,v=1,0.891026,0.0977254)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422821 0.236932 0.659753(R,m,v=1,1,0)
=>WM: (13400: S1 ^operator O1914)

   957:    O: O1914 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N957 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N956 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13401: I3 ^predict-no N957)
<=WM: (13388: N956 ^status complete)
<=WM: (13387: I3 ^predict-yes N956)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13405: I2 ^dir L)
=>WM: (13404: I2 ^reward 1)
=>WM: (13403: I2 ^see 0)
=>WM: (13402: N957 ^status complete)
<=WM: (13391: I2 ^dir L)
<=WM: (13390: I2 ^reward 1)
<=WM: (13389: I2 ^see 1)
=>WM: (13406: I2 ^level-1 L0-root)
<=WM: (13392: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
 -->
 (S1 ^operator O1913 = -0.08284880498582387)
Firing rl*prefer*rvt*predict-no*H0*2*H1*21
 -->
 (S1 ^operator O1914 = 0.6786780143478275)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Firing propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Firing propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.3212981720332201)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3402464375075579)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Retracting propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1914 = 0.6787497288432303)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1913 = 0.02884852834965246)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3402464375075579)
=>WM: (13413: S1 ^operator O1916 +)
=>WM: (13412: S1 ^operator O1915 +)
=>WM: (13411: O1916 ^name predict-no)
=>WM: (13410: O1915 ^name predict-yes)
=>WM: (13409: R961 ^value 1)
=>WM: (13408: R1 ^reward R961)
=>WM: (13407: I3 ^see 0)
<=WM: (13398: S1 ^operator O1913 +)
<=WM: (13399: S1 ^operator O1914 +)
<=WM: (13400: S1 ^operator O1914)
<=WM: (13394: R1 ^reward R960)
<=WM: (13393: I3 ^see 1)
<=WM: (13397: O1914 ^name predict-no)
<=WM: (13396: O1913 ^name predict-yes)
<=WM: (13395: R960 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1915 = 0.3402464375075579)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
 -->
 (S1 ^operator O1915 = -0.08284880498582387)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1916 = 0.3212981720332201)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*21
 -->
 (S1 ^operator O1916 = 0.6786780143478275)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.3212981720332201)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*21
 -->
 (S1 ^operator O1914 = 0.6786780143478275)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3402464375075579)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
 -->
 (S1 ^operator O1913 = -0.08284880498582387)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.641776 -0.320478 0.321298 -> 0.641768 -0.320477 0.321291(R,m,v=1,0.932432,0.0634308)
RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358272 0.320477 0.67875 -> 0.358265 0.320477 0.678743(R,m,v=1,1,0)
=>WM: (13414: S1 ^operator O1916)

   958:    O: O1916 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N958 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N957 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13415: I3 ^predict-no N958)
<=WM: (13402: N957 ^status complete)
<=WM: (13401: I3 ^predict-no N957)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13419: I2 ^dir R)
=>WM: (13418: I2 ^reward 1)
=>WM: (13417: I2 ^see 0)
=>WM: (13416: N958 ^status complete)
<=WM: (13405: I2 ^dir L)
<=WM: (13404: I2 ^reward 1)
<=WM: (13403: I2 ^see 0)
=>WM: (13420: I2 ^level-1 L0-root)
<=WM: (13406: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1915 = 0.8878774738146793)
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1916 = -0.1957074416057287)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Firing propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Firing propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.3993314366691663)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.112106939280937)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Retracting propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*21
 -->
 (S1 ^operator O1916 = 0.6786780143478275)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1916 = 0.3212909869017525)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
 -->
 (S1 ^operator O1915 = -0.08284880498582387)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1915 = 0.3402464375075579)
=>WM: (13427: S1 ^operator O1918 +)
=>WM: (13426: S1 ^operator O1917 +)
=>WM: (13425: I3 ^dir R)
=>WM: (13424: O1918 ^name predict-no)
=>WM: (13423: O1917 ^name predict-yes)
=>WM: (13422: R962 ^value 1)
=>WM: (13421: R1 ^reward R962)
<=WM: (13412: S1 ^operator O1915 +)
<=WM: (13413: S1 ^operator O1916 +)
<=WM: (13414: S1 ^operator O1916)
<=WM: (13383: I3 ^dir L)
<=WM: (13408: R1 ^reward R961)
<=WM: (13411: O1916 ^name predict-no)
<=WM: (13410: O1915 ^name predict-yes)
<=WM: (13409: R961 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1917 = 0.8878774738146793)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1917 = 0.112106939280937)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1918 = -0.1957074416057287)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1918 = 0.3993314366691663)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1916 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.112106939280937)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1915 = 0.8878774738146793)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.641768 -0.320477 0.321291 -> 0.641773 -0.320478 0.321296(R,m,v=1,0.932886,0.0630328)
RL update rl*prefer*rvt*predict-no*H0*2*H1*21 0.3582 0.320478 0.678678 -> 0.358205 0.320478 0.678683(R,m,v=1,1,0)
=>WM: (13428: S1 ^operator O1917)

   959:    O: O1917 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N959 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N958 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13429: I3 ^predict-yes N959)
<=WM: (13416: N958 ^status complete)
<=WM: (13415: I3 ^predict-no N958)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13433: I2 ^dir U)
=>WM: (13432: I2 ^reward 1)
=>WM: (13431: I2 ^see 1)
=>WM: (13430: N959 ^status complete)
<=WM: (13419: I2 ^dir R)
<=WM: (13418: I2 ^reward 1)
<=WM: (13417: I2 ^see 0)
=>WM: (13434: I2 ^level-1 R1-root)
<=WM: (13420: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Firing propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Firing propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Retracting propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1918 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1918 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1917 = 0.112106939280937)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1917 = 0.8878774738146793)
=>WM: (13442: S1 ^operator O1920 +)
=>WM: (13441: S1 ^operator O1919 +)
=>WM: (13440: I3 ^dir U)
=>WM: (13439: O1920 ^name predict-no)
=>WM: (13438: O1919 ^name predict-yes)
=>WM: (13437: R963 ^value 1)
=>WM: (13436: R1 ^reward R963)
=>WM: (13435: I3 ^see 1)
<=WM: (13426: S1 ^operator O1917 +)
<=WM: (13428: S1 ^operator O1917)
<=WM: (13427: S1 ^operator O1918 +)
<=WM: (13425: I3 ^dir R)
<=WM: (13421: R1 ^reward R962)
<=WM: (13407: I3 ^see 0)
<=WM: (13424: O1918 ^name predict-no)
<=WM: (13423: O1917 ^name predict-yes)
<=WM: (13422: R962 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.61903 -0.506923 0.112107 -> 0.619033 -0.506924 0.112109(R,m,v=1,0.896104,0.0937102)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380951 0.506926 0.887877 -> 0.380954 0.506926 0.88788(R,m,v=1,1,0)
=>WM: (13443: S1 ^operator O1920)

   960:    O: O1920 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N960 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N959 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13444: I3 ^predict-no N960)
<=WM: (13430: N959 ^status complete)
<=WM: (13429: I3 ^predict-yes N959)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13448: I2 ^dir U)
=>WM: (13447: I2 ^reward 1)
=>WM: (13446: I2 ^see 0)
=>WM: (13445: N960 ^status complete)
<=WM: (13433: I2 ^dir U)
<=WM: (13432: I2 ^reward 1)
<=WM: (13431: I2 ^see 1)
=>WM: (13449: I2 ^level-1 R1-root)
<=WM: (13434: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Firing propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Firing propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Retracting propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)
=>WM: (13456: S1 ^operator O1922 +)
=>WM: (13455: S1 ^operator O1921 +)
=>WM: (13454: O1922 ^name predict-no)
=>WM: (13453: O1921 ^name predict-yes)
=>WM: (13452: R964 ^value 1)
=>WM: (13451: R1 ^reward R964)
=>WM: (13450: I3 ^see 0)
<=WM: (13441: S1 ^operator O1919 +)
<=WM: (13442: S1 ^operator O1920 +)
<=WM: (13443: S1 ^operator O1920)
<=WM: (13436: R1 ^reward R963)
<=WM: (13435: I3 ^see 1)
<=WM: (13439: O1920 ^name predict-no)
<=WM: (13438: O1919 ^name predict-yes)
<=WM: (13437: R963 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13457: S1 ^operator O1922)

   961:    O: O1922 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N961 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N960 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13458: I3 ^predict-no N961)
<=WM: (13445: N960 ^status complete)
<=WM: (13444: I3 ^predict-no N960)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (13462: I2 ^dir U)
=>WM: (13461: I2 ^reward 1)
=>WM: (13460: I2 ^see 0)
=>WM: (13459: N961 ^status complete)
<=WM: (13448: I2 ^dir U)
<=WM: (13447: I2 ^reward 1)
<=WM: (13446: I2 ^see 0)
=>WM: (13463: I2 ^level-1 R1-root)
<=WM: (13449: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Firing propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Firing propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Retracting propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)
=>WM: (13469: S1 ^operator O1924 +)
=>WM: (13468: S1 ^operator O1923 +)
=>WM: (13467: O1924 ^name predict-no)
=>WM: (13466: O1923 ^name predict-yes)
=>WM: (13465: R965 ^value 1)
=>WM: (13464: R1 ^reward R965)
<=WM: (13455: S1 ^operator O1921 +)
<=WM: (13456: S1 ^operator O1922 +)
<=WM: (13457: S1 ^operator O1922)
<=WM: (13451: R1 ^reward R964)
<=WM: (13454: O1922 ^name predict-no)
<=WM: (13453: O1921 ^name predict-yes)
<=WM: (13452: R964 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13470: S1 ^operator O1924)

   962:    O: O1924 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N962 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N961 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13471: I3 ^predict-no N962)
<=WM: (13459: N961 ^status complete)
<=WM: (13458: I3 ^predict-no N961)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13475: I2 ^dir U)
=>WM: (13474: I2 ^reward 1)
=>WM: (13473: I2 ^see 0)
=>WM: (13472: N962 ^status complete)
<=WM: (13462: I2 ^dir U)
<=WM: (13461: I2 ^reward 1)
<=WM: (13460: I2 ^see 0)
=>WM: (13476: I2 ^level-1 R1-root)
<=WM: (13463: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Firing propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Firing propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Retracting propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)
=>WM: (13482: S1 ^operator O1926 +)
=>WM: (13481: S1 ^operator O1925 +)
=>WM: (13480: O1926 ^name predict-no)
=>WM: (13479: O1925 ^name predict-yes)
=>WM: (13478: R966 ^value 1)
=>WM: (13477: R1 ^reward R966)
<=WM: (13468: S1 ^operator O1923 +)
<=WM: (13469: S1 ^operator O1924 +)
<=WM: (13470: S1 ^operator O1924)
<=WM: (13464: R1 ^reward R965)
<=WM: (13467: O1924 ^name predict-no)
<=WM: (13466: O1923 ^name predict-yes)
<=WM: (13465: R965 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13483: S1 ^operator O1926)

   963:    O: O1926 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N963 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N962 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13484: I3 ^predict-no N963)
<=WM: (13472: N962 ^status complete)
<=WM: (13471: I3 ^predict-no N962)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13488: I2 ^dir L)
=>WM: (13487: I2 ^reward 1)
=>WM: (13486: I2 ^see 0)
=>WM: (13485: N963 ^status complete)
<=WM: (13475: I2 ^dir U)
<=WM: (13474: I2 ^reward 1)
<=WM: (13473: I2 ^see 0)
=>WM: (13489: I2 ^level-1 R1-root)
<=WM: (13476: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1926 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1925 = 0.6597567463960877)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Firing propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Firing propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 0.3212956367143155)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.3402464375075579)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Retracting propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.)
=>WM: (13496: S1 ^operator O1928 +)
=>WM: (13495: S1 ^operator O1927 +)
=>WM: (13494: I3 ^dir L)
=>WM: (13493: O1928 ^name predict-no)
=>WM: (13492: O1927 ^name predict-yes)
=>WM: (13491: R967 ^value 1)
=>WM: (13490: R1 ^reward R967)
<=WM: (13481: S1 ^operator O1925 +)
<=WM: (13482: S1 ^operator O1926 +)
<=WM: (13483: S1 ^operator O1926)
<=WM: (13440: I3 ^dir U)
<=WM: (13477: R1 ^reward R966)
<=WM: (13480: O1926 ^name predict-no)
<=WM: (13479: O1925 ^name predict-yes)
<=WM: (13478: R966 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1927 = 0.6597567463960877)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.3402464375075579)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1928 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 0.3212956367143155)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1926 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.3402464375075579)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1925 = 0.6597567463960877)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13497: S1 ^operator O1927)

   964:    O: O1927 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N964 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N963 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13498: I3 ^predict-yes N964)
<=WM: (13485: N963 ^status complete)
<=WM: (13484: I3 ^predict-no N963)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13502: I2 ^dir R)
=>WM: (13501: I2 ^reward 1)
=>WM: (13500: I2 ^see 1)
=>WM: (13499: N964 ^status complete)
<=WM: (13488: I2 ^dir L)
<=WM: (13487: I2 ^reward 1)
<=WM: (13486: I2 ^see 0)
=>WM: (13503: I2 ^level-1 L1-root)
<=WM: (13489: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1927 = 0.8879071751461909)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1928 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Firing propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Firing propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1928 = 0.3993314366691663)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1927 = 0.1121092773165946)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Retracting propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1928 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.3402464375075579)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1927 = 0.6597567463960877)
=>WM: (13511: S1 ^operator O1930 +)
=>WM: (13510: S1 ^operator O1929 +)
=>WM: (13509: I3 ^dir R)
=>WM: (13508: O1930 ^name predict-no)
=>WM: (13507: O1929 ^name predict-yes)
=>WM: (13506: R968 ^value 1)
=>WM: (13505: R1 ^reward R968)
=>WM: (13504: I3 ^see 1)
<=WM: (13495: S1 ^operator O1927 +)
<=WM: (13497: S1 ^operator O1927)
<=WM: (13496: S1 ^operator O1928 +)
<=WM: (13494: I3 ^dir L)
<=WM: (13490: R1 ^reward R967)
<=WM: (13450: I3 ^see 0)
<=WM: (13493: O1928 ^name predict-no)
<=WM: (13492: O1927 ^name predict-yes)
<=WM: (13491: R967 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1929 = 0.1121092773165946)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1929 = 0.8879071751461909)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1930 = 0.3993314366691663)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1930 = 0.02370016355578053)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1928 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1928 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1927 = 0.1121092773165946)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1927 = 0.8879071751461909)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577179 -0.236932 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.89172,0.0971746)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422823 0.236934 0.659757 -> 0.422823 0.236934 0.659756(R,m,v=1,1,0)
=>WM: (13512: S1 ^operator O1929)

   965:    O: O1929 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N965 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N964 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13513: I3 ^predict-yes N965)
<=WM: (13499: N964 ^status complete)
<=WM: (13498: I3 ^predict-yes N964)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|\sleeping...
---- Input Phase --- 
=>WM: (13517: I2 ^dir U)
=>WM: (13516: I2 ^reward 1)
=>WM: (13515: I2 ^see 1)
=>WM: (13514: N965 ^status complete)
<=WM: (13502: I2 ^dir R)
<=WM: (13501: I2 ^reward 1)
<=WM: (13500: I2 ^see 1)
=>WM: (13518: I2 ^level-1 R1-root)
<=WM: (13503: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Firing propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Firing propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1930 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1929 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Retracting propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1930 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1930 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1929 = 0.8879071751461909)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1929 = 0.1121092773165946)
=>WM: (13525: S1 ^operator O1932 +)
=>WM: (13524: S1 ^operator O1931 +)
=>WM: (13523: I3 ^dir U)
=>WM: (13522: O1932 ^name predict-no)
=>WM: (13521: O1931 ^name predict-yes)
=>WM: (13520: R969 ^value 1)
=>WM: (13519: R1 ^reward R969)
<=WM: (13510: S1 ^operator O1929 +)
<=WM: (13512: S1 ^operator O1929)
<=WM: (13511: S1 ^operator O1930 +)
<=WM: (13509: I3 ^dir R)
<=WM: (13505: R1 ^reward R968)
<=WM: (13508: O1930 ^name predict-no)
<=WM: (13507: O1929 ^name predict-yes)
<=WM: (13506: R968 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1930 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1929 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619033 -0.506924 0.112109 -> 0.61903 -0.506923 0.112107(R,m,v=1,0.896774,0.0931713)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380987 0.506921 0.887907 -> 0.380984 0.506921 0.887905(R,m,v=1,1,0)
=>WM: (13526: S1 ^operator O1932)

   966:    O: O1932 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N966 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N965 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13527: I3 ^predict-no N966)
<=WM: (13514: N965 ^status complete)
<=WM: (13513: I3 ^predict-yes N965)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13531: I2 ^dir L)
=>WM: (13530: I2 ^reward 1)
=>WM: (13529: I2 ^see 0)
=>WM: (13528: N966 ^status complete)
<=WM: (13517: I2 ^dir U)
<=WM: (13516: I2 ^reward 1)
<=WM: (13515: I2 ^see 1)
=>WM: (13532: I2 ^level-1 R1-root)
<=WM: (13518: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1932 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1931 = 0.6597562688105409)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Firing propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Firing propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 0.3212956367143155)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.3402459599220111)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Retracting propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.)
=>WM: (13540: S1 ^operator O1934 +)
=>WM: (13539: S1 ^operator O1933 +)
=>WM: (13538: I3 ^dir L)
=>WM: (13537: O1934 ^name predict-no)
=>WM: (13536: O1933 ^name predict-yes)
=>WM: (13535: R970 ^value 1)
=>WM: (13534: R1 ^reward R970)
=>WM: (13533: I3 ^see 0)
<=WM: (13524: S1 ^operator O1931 +)
<=WM: (13525: S1 ^operator O1932 +)
<=WM: (13526: S1 ^operator O1932)
<=WM: (13523: I3 ^dir U)
<=WM: (13519: R1 ^reward R969)
<=WM: (13504: I3 ^see 1)
<=WM: (13522: O1932 ^name predict-no)
<=WM: (13521: O1931 ^name predict-yes)
<=WM: (13520: R969 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1933 = 0.6597562688105409)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1933 = 0.3402459599220111)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1934 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1934 = 0.3212956367143155)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1932 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.3402459599220111)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1931 = 0.6597562688105409)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13541: S1 ^operator O1933)

   967:    O: O1933 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N967 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N966 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13542: I3 ^predict-yes N967)
<=WM: (13528: N966 ^status complete)
<=WM: (13527: I3 ^predict-no N966)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13546: I2 ^dir R)
=>WM: (13545: I2 ^reward 1)
=>WM: (13544: I2 ^see 1)
=>WM: (13543: N967 ^status complete)
<=WM: (13531: I2 ^dir L)
<=WM: (13530: I2 ^reward 1)
<=WM: (13529: I2 ^see 0)
=>WM: (13547: I2 ^level-1 L1-root)
<=WM: (13532: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1933 = 0.887904707276773)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1934 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Firing propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Firing propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1934 = 0.3993314366691663)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1933 = 0.1121068094471768)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Retracting propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1934 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1934 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1933 = 0.3402459599220111)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1933 = 0.6597562688105409)
=>WM: (13555: S1 ^operator O1936 +)
=>WM: (13554: S1 ^operator O1935 +)
=>WM: (13553: I3 ^dir R)
=>WM: (13552: O1936 ^name predict-no)
=>WM: (13551: O1935 ^name predict-yes)
=>WM: (13550: R971 ^value 1)
=>WM: (13549: R1 ^reward R971)
=>WM: (13548: I3 ^see 1)
<=WM: (13539: S1 ^operator O1933 +)
<=WM: (13541: S1 ^operator O1933)
<=WM: (13540: S1 ^operator O1934 +)
<=WM: (13538: I3 ^dir L)
<=WM: (13534: R1 ^reward R970)
<=WM: (13533: I3 ^see 0)
<=WM: (13537: O1934 ^name predict-no)
<=WM: (13536: O1933 ^name predict-yes)
<=WM: (13535: R970 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.1121068094471768)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1935 = 0.887904707276773)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.3993314366691663)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1936 = 0.02370016355578053)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1934 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1934 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1933 = 0.1121068094471768)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1933 = 0.887904707276773)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.892405,0.0966298)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422823 0.236934 0.659756 -> 0.422823 0.236933 0.659756(R,m,v=1,1,0)
=>WM: (13556: S1 ^operator O1935)

   968:    O: O1935 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N968 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N967 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13557: I3 ^predict-yes N968)
<=WM: (13543: N967 ^status complete)
<=WM: (13542: I3 ^predict-yes N967)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13561: I2 ^dir U)
=>WM: (13560: I2 ^reward 1)
=>WM: (13559: I2 ^see 1)
=>WM: (13558: N968 ^status complete)
<=WM: (13546: I2 ^dir R)
<=WM: (13545: I2 ^reward 1)
<=WM: (13544: I2 ^see 1)
=>WM: (13562: I2 ^level-1 R1-root)
<=WM: (13547: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Firing propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Firing propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Retracting propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1936 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1935 = 0.887904707276773)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.1121068094471768)
=>WM: (13569: S1 ^operator O1938 +)
=>WM: (13568: S1 ^operator O1937 +)
=>WM: (13567: I3 ^dir U)
=>WM: (13566: O1938 ^name predict-no)
=>WM: (13565: O1937 ^name predict-yes)
=>WM: (13564: R972 ^value 1)
=>WM: (13563: R1 ^reward R972)
<=WM: (13554: S1 ^operator O1935 +)
<=WM: (13556: S1 ^operator O1935)
<=WM: (13555: S1 ^operator O1936 +)
<=WM: (13553: I3 ^dir R)
<=WM: (13549: R1 ^reward R971)
<=WM: (13552: O1936 ^name predict-no)
<=WM: (13551: O1935 ^name predict-yes)
<=WM: (13550: R971 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.61903 -0.506923 0.112107 -> 0.619028 -0.506923 0.112105(R,m,v=1,0.897436,0.0926385)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380984 0.506921 0.887905 -> 0.380982 0.506921 0.887903(R,m,v=1,1,0)
=>WM: (13570: S1 ^operator O1938)

   969:    O: O1938 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N969 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N968 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13571: I3 ^predict-no N969)
<=WM: (13558: N968 ^status complete)
<=WM: (13557: I3 ^predict-yes N968)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13575: I2 ^dir L)
=>WM: (13574: I2 ^reward 1)
=>WM: (13573: I2 ^see 0)
=>WM: (13572: N969 ^status complete)
<=WM: (13561: I2 ^dir U)
<=WM: (13560: I2 ^reward 1)
<=WM: (13559: I2 ^see 1)
=>WM: (13576: I2 ^level-1 R1-root)
<=WM: (13562: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1938 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1937 = 0.6597559345006581)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Firing propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Firing propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1938 = 0.3212956367143155)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1937 = 0.3402456256121283)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Retracting propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.)
=>WM: (13584: S1 ^operator O1940 +)
=>WM: (13583: S1 ^operator O1939 +)
=>WM: (13582: I3 ^dir L)
=>WM: (13581: O1940 ^name predict-no)
=>WM: (13580: O1939 ^name predict-yes)
=>WM: (13579: R973 ^value 1)
=>WM: (13578: R1 ^reward R973)
=>WM: (13577: I3 ^see 0)
<=WM: (13568: S1 ^operator O1937 +)
<=WM: (13569: S1 ^operator O1938 +)
<=WM: (13570: S1 ^operator O1938)
<=WM: (13567: I3 ^dir U)
<=WM: (13563: R1 ^reward R972)
<=WM: (13548: I3 ^see 1)
<=WM: (13566: O1938 ^name predict-no)
<=WM: (13565: O1937 ^name predict-yes)
<=WM: (13564: R972 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1939 = 0.6597559345006581)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.3402456256121283)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1940 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 0.3212956367143155)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1938 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1938 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1937 = 0.3402456256121283)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1937 = 0.6597559345006581)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13585: S1 ^operator O1939)

   970:    O: O1939 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N970 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N969 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13586: I3 ^predict-yes N970)
<=WM: (13572: N969 ^status complete)
<=WM: (13571: I3 ^predict-no N969)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13590: I2 ^dir U)
=>WM: (13589: I2 ^reward 1)
=>WM: (13588: I2 ^see 1)
=>WM: (13587: N970 ^status complete)
<=WM: (13575: I2 ^dir L)
<=WM: (13574: I2 ^reward 1)
<=WM: (13573: I2 ^see 0)
=>WM: (13591: I2 ^level-1 L1-root)
<=WM: (13576: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Firing propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Firing propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Retracting propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1940 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.3402456256121283)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1939 = 0.6597559345006581)
=>WM: (13599: S1 ^operator O1942 +)
=>WM: (13598: S1 ^operator O1941 +)
=>WM: (13597: I3 ^dir U)
=>WM: (13596: O1942 ^name predict-no)
=>WM: (13595: O1941 ^name predict-yes)
=>WM: (13594: R974 ^value 1)
=>WM: (13593: R1 ^reward R974)
=>WM: (13592: I3 ^see 1)
<=WM: (13583: S1 ^operator O1939 +)
<=WM: (13585: S1 ^operator O1939)
<=WM: (13584: S1 ^operator O1940 +)
<=WM: (13582: I3 ^dir L)
<=WM: (13578: R1 ^reward R973)
<=WM: (13577: I3 ^see 0)
<=WM: (13581: O1940 ^name predict-no)
<=WM: (13580: O1939 ^name predict-yes)
<=WM: (13579: R973 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1941 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1942 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340246 -> 0.577178 -0.236933 0.340245(R,m,v=1,0.893082,0.0960911)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422823 0.236933 0.659756 -> 0.422822 0.236933 0.659756(R,m,v=1,1,0)
=>WM: (13600: S1 ^operator O1942)

   971:    O: O1942 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N971 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N970 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13601: I3 ^predict-no N971)
<=WM: (13587: N970 ^status complete)
<=WM: (13586: I3 ^predict-yes N970)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13605: I2 ^dir L)
=>WM: (13604: I2 ^reward 1)
=>WM: (13603: I2 ^see 0)
=>WM: (13602: N971 ^status complete)
<=WM: (13590: I2 ^dir U)
<=WM: (13589: I2 ^reward 1)
<=WM: (13588: I2 ^see 1)
=>WM: (13606: I2 ^level-1 L1-root)
<=WM: (13591: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1941 = 0.02884852834965246)
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1942 = 0.6787425437117627)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Firing propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Firing propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 0.3212956367143155)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.3402453915952103)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Retracting propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1942 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1941 = 0.)
=>WM: (13614: S1 ^operator O1944 +)
=>WM: (13613: S1 ^operator O1943 +)
=>WM: (13612: I3 ^dir L)
=>WM: (13611: O1944 ^name predict-no)
=>WM: (13610: O1943 ^name predict-yes)
=>WM: (13609: R975 ^value 1)
=>WM: (13608: R1 ^reward R975)
=>WM: (13607: I3 ^see 0)
<=WM: (13598: S1 ^operator O1941 +)
<=WM: (13599: S1 ^operator O1942 +)
<=WM: (13600: S1 ^operator O1942)
<=WM: (13597: I3 ^dir U)
<=WM: (13593: R1 ^reward R974)
<=WM: (13592: I3 ^see 1)
<=WM: (13596: O1942 ^name predict-no)
<=WM: (13595: O1941 ^name predict-yes)
<=WM: (13594: R974 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1943 = 0.02884852834965246)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1943 = 0.3402453915952103)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1944 = 0.6787425437117627)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1944 = 0.3212956367143155)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1942 = 0.6787425437117627)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.3402453915952103)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1941 = 0.02884852834965246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13615: S1 ^operator O1944)

   972:    O: O1944 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N972 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N971 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13616: I3 ^predict-no N972)
<=WM: (13602: N971 ^status complete)
<=WM: (13601: I3 ^predict-no N971)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13620: I2 ^dir R)
=>WM: (13619: I2 ^reward 1)
=>WM: (13618: I2 ^see 0)
=>WM: (13617: N972 ^status complete)
<=WM: (13605: I2 ^dir L)
<=WM: (13604: I2 ^reward 1)
<=WM: (13603: I2 ^see 0)
=>WM: (13621: I2 ^level-1 L0-root)
<=WM: (13606: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1943 = 0.8878798118503368)
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1944 = -0.1957074416057287)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Firing propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Firing propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.3993314366691663)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.1121050819385843)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Retracting propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1944 = 0.3212956367143155)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1944 = 0.6787425437117627)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1943 = 0.3402453915952103)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1943 = 0.02884852834965246)
=>WM: (13628: S1 ^operator O1946 +)
=>WM: (13627: S1 ^operator O1945 +)
=>WM: (13626: I3 ^dir R)
=>WM: (13625: O1946 ^name predict-no)
=>WM: (13624: O1945 ^name predict-yes)
=>WM: (13623: R976 ^value 1)
=>WM: (13622: R1 ^reward R976)
<=WM: (13613: S1 ^operator O1943 +)
<=WM: (13614: S1 ^operator O1944 +)
<=WM: (13615: S1 ^operator O1944)
<=WM: (13612: I3 ^dir L)
<=WM: (13608: R1 ^reward R975)
<=WM: (13611: O1944 ^name predict-no)
<=WM: (13610: O1943 ^name predict-yes)
<=WM: (13609: R975 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.1121050819385843)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1945 = 0.8878798118503368)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.3993314366691663)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1946 = -0.1957074416057287)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1944 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.1121050819385843)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1943 = 0.8878798118503368)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.641773 -0.320478 0.321296 -> 0.641767 -0.320477 0.32129(R,m,v=1,0.933333,0.0626398)
RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358265 0.320477 0.678743 -> 0.358259 0.320477 0.678737(R,m,v=1,1,0)
=>WM: (13629: S1 ^operator O1945)

   973:    O: O1945 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N973 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N972 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13630: I3 ^predict-yes N973)
<=WM: (13617: N972 ^status complete)
<=WM: (13616: I3 ^predict-no N972)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13634: I2 ^dir U)
=>WM: (13633: I2 ^reward 1)
=>WM: (13632: I2 ^see 1)
=>WM: (13631: N973 ^status complete)
<=WM: (13620: I2 ^dir R)
<=WM: (13619: I2 ^reward 1)
<=WM: (13618: I2 ^see 0)
=>WM: (13635: I2 ^level-1 R1-root)
<=WM: (13621: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Firing propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Firing propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1946 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1945 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Retracting propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O1946 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1945 = 0.8878798118503368)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.1121050819385843)
=>WM: (13643: S1 ^operator O1948 +)
=>WM: (13642: S1 ^operator O1947 +)
=>WM: (13641: I3 ^dir U)
=>WM: (13640: O1948 ^name predict-no)
=>WM: (13639: O1947 ^name predict-yes)
=>WM: (13638: R977 ^value 1)
=>WM: (13637: R1 ^reward R977)
=>WM: (13636: I3 ^see 1)
<=WM: (13627: S1 ^operator O1945 +)
<=WM: (13629: S1 ^operator O1945)
<=WM: (13628: S1 ^operator O1946 +)
<=WM: (13626: I3 ^dir R)
<=WM: (13622: R1 ^reward R976)
<=WM: (13607: I3 ^see 0)
<=WM: (13625: O1946 ^name predict-no)
<=WM: (13624: O1945 ^name predict-yes)
<=WM: (13623: R976 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1946 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1945 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619028 -0.506923 0.112105 -> 0.619031 -0.506923 0.112107(R,m,v=1,0.898089,0.0921117)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380954 0.506926 0.88788 -> 0.380957 0.506925 0.887882(R,m,v=1,1,0)
=>WM: (13644: S1 ^operator O1948)

   974:    O: O1948 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N974 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N973 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13645: I3 ^predict-no N974)
<=WM: (13631: N973 ^status complete)
<=WM: (13630: I3 ^predict-yes N973)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13649: I2 ^dir L)
=>WM: (13648: I2 ^reward 1)
=>WM: (13647: I2 ^see 0)
=>WM: (13646: N974 ^status complete)
<=WM: (13634: I2 ^dir U)
<=WM: (13633: I2 ^reward 1)
<=WM: (13632: I2 ^see 1)
=>WM: (13650: I2 ^level-1 R1-root)
<=WM: (13635: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1948 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1947 = 0.6597557004837401)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Firing propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Firing propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1948 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1947 = 0.3402453915952103)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Retracting propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.)
=>WM: (13658: S1 ^operator O1950 +)
=>WM: (13657: S1 ^operator O1949 +)
=>WM: (13656: I3 ^dir L)
=>WM: (13655: O1950 ^name predict-no)
=>WM: (13654: O1949 ^name predict-yes)
=>WM: (13653: R978 ^value 1)
=>WM: (13652: R1 ^reward R978)
=>WM: (13651: I3 ^see 0)
<=WM: (13642: S1 ^operator O1947 +)
<=WM: (13643: S1 ^operator O1948 +)
<=WM: (13644: S1 ^operator O1948)
<=WM: (13641: I3 ^dir U)
<=WM: (13637: R1 ^reward R977)
<=WM: (13636: I3 ^see 1)
<=WM: (13640: O1948 ^name predict-no)
<=WM: (13639: O1947 ^name predict-yes)
<=WM: (13638: R977 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1949 = 0.6597557004837401)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1949 = 0.3402453915952103)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1950 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1950 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1948 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1948 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1947 = 0.3402453915952103)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1947 = 0.6597557004837401)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13659: S1 ^operator O1949)

   975:    O: O1949 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N975 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N974 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13660: I3 ^predict-yes N975)
<=WM: (13646: N974 ^status complete)
<=WM: (13645: I3 ^predict-no N974)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13664: I2 ^dir R)
=>WM: (13663: I2 ^reward 1)
=>WM: (13662: I2 ^see 1)
=>WM: (13661: N975 ^status complete)
<=WM: (13649: I2 ^dir L)
<=WM: (13648: I2 ^reward 1)
<=WM: (13647: I2 ^see 0)
=>WM: (13665: I2 ^level-1 L1-root)
<=WM: (13650: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1949 = 0.8879029797681804)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1950 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Firing propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Firing propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1950 = 0.3993314366691663)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1949 = 0.1121073478702461)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Retracting propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1950 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1950 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1949 = 0.3402453915952103)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1949 = 0.6597557004837401)
=>WM: (13673: S1 ^operator O1952 +)
=>WM: (13672: S1 ^operator O1951 +)
=>WM: (13671: I3 ^dir R)
=>WM: (13670: O1952 ^name predict-no)
=>WM: (13669: O1951 ^name predict-yes)
=>WM: (13668: R979 ^value 1)
=>WM: (13667: R1 ^reward R979)
=>WM: (13666: I3 ^see 1)
<=WM: (13657: S1 ^operator O1949 +)
<=WM: (13659: S1 ^operator O1949)
<=WM: (13658: S1 ^operator O1950 +)
<=WM: (13656: I3 ^dir L)
<=WM: (13652: R1 ^reward R978)
<=WM: (13651: I3 ^see 0)
<=WM: (13655: O1950 ^name predict-no)
<=WM: (13654: O1949 ^name predict-yes)
<=WM: (13653: R978 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.1121073478702461)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1951 = 0.8879029797681804)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.3993314366691663)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1952 = 0.02370016355578053)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1950 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1950 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1949 = 0.1121073478702461)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1949 = 0.8879029797681804)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340245 -> 0.577178 -0.236933 0.340245(R,m,v=1,0.89375,0.0955582)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422822 0.236933 0.659756 -> 0.422822 0.236933 0.659756(R,m,v=1,1,0)
=>WM: (13674: S1 ^operator O1951)

   976:    O: O1951 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N976 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N975 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13675: I3 ^predict-yes N976)
<=WM: (13661: N975 ^status complete)
<=WM: (13660: I3 ^predict-yes N975)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13679: I2 ^dir R)
=>WM: (13678: I2 ^reward 1)
=>WM: (13677: I2 ^see 1)
=>WM: (13676: N976 ^status complete)
<=WM: (13664: I2 ^dir R)
<=WM: (13663: I2 ^reward 1)
<=WM: (13662: I2 ^see 1)
=>WM: (13680: I2 ^level-1 R1-root)
<=WM: (13665: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1952 = 0.6006758138031456)
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1951 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Firing propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Firing propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.3993314366691663)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.1121073478702461)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Retracting propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1952 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1951 = 0.8879029797681804)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.1121073478702461)
=>WM: (13686: S1 ^operator O1954 +)
=>WM: (13685: S1 ^operator O1953 +)
=>WM: (13684: O1954 ^name predict-no)
=>WM: (13683: O1953 ^name predict-yes)
=>WM: (13682: R980 ^value 1)
=>WM: (13681: R1 ^reward R980)
<=WM: (13672: S1 ^operator O1951 +)
<=WM: (13674: S1 ^operator O1951)
<=WM: (13673: S1 ^operator O1952 +)
<=WM: (13667: R1 ^reward R979)
<=WM: (13670: O1952 ^name predict-no)
<=WM: (13669: O1951 ^name predict-yes)
<=WM: (13668: R979 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1953 = 0.1121073478702461)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1953 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1954 = 0.3993314366691663)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1954 = 0.6006758138031456)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1952 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1952 = 0.6006758138031456)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1951 = 0.1121073478702461)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1951 = 0.1602187148382515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619031 -0.506923 0.112107 -> 0.619029 -0.506923 0.112106(R,m,v=1,0.898734,0.0915907)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380982 0.506921 0.887903 -> 0.38098 0.506922 0.887901(R,m,v=1,1,0)
=>WM: (13687: S1 ^operator O1954)

   977:    O: O1954 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N977 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N976 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13688: I3 ^predict-no N977)
<=WM: (13676: N976 ^status complete)
<=WM: (13675: I3 ^predict-yes N976)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13692: I2 ^dir U)
=>WM: (13691: I2 ^reward 1)
=>WM: (13690: I2 ^see 0)
=>WM: (13689: N977 ^status complete)
<=WM: (13679: I2 ^dir R)
<=WM: (13678: I2 ^reward 1)
<=WM: (13677: I2 ^see 1)
=>WM: (13693: I2 ^level-1 R0-root)
<=WM: (13680: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Firing propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Firing propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1954 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1953 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Retracting propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1954 = 0.6006758138031456)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1954 = 0.3993314366691663)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1953 = 0.1602187148382515)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1953 = 0.1121057987244822)
=>WM: (13701: S1 ^operator O1956 +)
=>WM: (13700: S1 ^operator O1955 +)
=>WM: (13699: I3 ^dir U)
=>WM: (13698: O1956 ^name predict-no)
=>WM: (13697: O1955 ^name predict-yes)
=>WM: (13696: R981 ^value 1)
=>WM: (13695: R1 ^reward R981)
=>WM: (13694: I3 ^see 0)
<=WM: (13685: S1 ^operator O1953 +)
<=WM: (13686: S1 ^operator O1954 +)
<=WM: (13687: S1 ^operator O1954)
<=WM: (13671: I3 ^dir R)
<=WM: (13681: R1 ^reward R980)
<=WM: (13666: I3 ^see 1)
<=WM: (13684: O1954 ^name predict-no)
<=WM: (13683: O1953 ^name predict-yes)
<=WM: (13682: R980 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1954 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1953 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.55804 -0.158708 0.399331 -> 0.558039 -0.158709 0.39933(R,m,v=1,0.927273,0.0678492)
RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441967 0.158709 0.600676 -> 0.441966 0.158709 0.600675(R,m,v=1,1,0)
=>WM: (13702: S1 ^operator O1956)

   978:    O: O1956 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N978 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N977 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13703: I3 ^predict-no N978)
<=WM: (13689: N977 ^status complete)
<=WM: (13688: I3 ^predict-no N977)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/|--- Input Phase --- 
=>WM: (13707: I2 ^dir U)
=>WM: (13706: I2 ^reward 1)
=>WM: (13705: I2 ^see 0)
=>WM: (13704: N978 ^status complete)
<=WM: (13692: I2 ^dir U)
<=WM: (13691: I2 ^reward 1)
<=WM: (13690: I2 ^see 0)
=>WM: (13708: I2 ^level-1 R0-root)
<=WM: (13693: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Firing propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Firing propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Retracting propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)
=>WM: (13714: S1 ^operator O1958 +)
=>WM: (13713: S1 ^operator O1957 +)
=>WM: (13712: O1958 ^name predict-no)
=>WM: (13711: O1957 ^name predict-yes)
=>WM: (13710: R982 ^value 1)
=>WM: (13709: R1 ^reward R982)
<=WM: (13700: S1 ^operator O1955 +)
<=WM: (13701: S1 ^operator O1956 +)
<=WM: (13702: S1 ^operator O1956)
<=WM: (13695: R1 ^reward R981)
<=WM: (13698: O1956 ^name predict-no)
<=WM: (13697: O1955 ^name predict-yes)
<=WM: (13696: R981 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13715: S1 ^operator O1958)

   979:    O: O1958 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N979 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N978 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13716: I3 ^predict-no N979)
<=WM: (13704: N978 ^status complete)
<=WM: (13703: I3 ^predict-no N978)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13720: I2 ^dir L)
=>WM: (13719: I2 ^reward 1)
=>WM: (13718: I2 ^see 0)
=>WM: (13717: N979 ^status complete)
<=WM: (13707: I2 ^dir U)
<=WM: (13706: I2 ^reward 1)
<=WM: (13705: I2 ^see 0)
=>WM: (13721: I2 ^level-1 R0-root)
<=WM: (13708: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1957 = 0.6597532174346419)
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1958 = 0.133561435542329)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Firing propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Firing propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1958 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1957 = 0.3402452277833678)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Retracting propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.)
=>WM: (13728: S1 ^operator O1960 +)
=>WM: (13727: S1 ^operator O1959 +)
=>WM: (13726: I3 ^dir L)
=>WM: (13725: O1960 ^name predict-no)
=>WM: (13724: O1959 ^name predict-yes)
=>WM: (13723: R983 ^value 1)
=>WM: (13722: R1 ^reward R983)
<=WM: (13713: S1 ^operator O1957 +)
<=WM: (13714: S1 ^operator O1958 +)
<=WM: (13715: S1 ^operator O1958)
<=WM: (13699: I3 ^dir U)
<=WM: (13709: R1 ^reward R982)
<=WM: (13712: O1958 ^name predict-no)
<=WM: (13711: O1957 ^name predict-yes)
<=WM: (13710: R982 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1959 = 0.6597532174346419)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1959 = 0.3402452277833678)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1960 = 0.133561435542329)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1960 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1958 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1958 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1957 = 0.3402452277833678)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1957 = 0.6597532174346419)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13729: S1 ^operator O1959)

   980:    O: O1959 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N980 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N979 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13730: I3 ^predict-yes N980)
<=WM: (13717: N979 ^status complete)
<=WM: (13716: I3 ^predict-no N979)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13734: I2 ^dir R)
=>WM: (13733: I2 ^reward 1)
=>WM: (13732: I2 ^see 1)
=>WM: (13731: N980 ^status complete)
<=WM: (13720: I2 ^dir L)
<=WM: (13719: I2 ^reward 1)
<=WM: (13718: I2 ^see 0)
=>WM: (13735: I2 ^level-1 L1-root)
<=WM: (13721: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1959 = 0.8879014306224164)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1960 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Firing propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Firing propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.3993303490983195)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.1121057987244822)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Retracting propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1960 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1960 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1959 = 0.3402452277833678)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1959 = 0.6597532174346419)
=>WM: (13743: S1 ^operator O1962 +)
=>WM: (13742: S1 ^operator O1961 +)
=>WM: (13741: I3 ^dir R)
=>WM: (13740: O1962 ^name predict-no)
=>WM: (13739: O1961 ^name predict-yes)
=>WM: (13738: R984 ^value 1)
=>WM: (13737: R1 ^reward R984)
=>WM: (13736: I3 ^see 1)
<=WM: (13727: S1 ^operator O1959 +)
<=WM: (13729: S1 ^operator O1959)
<=WM: (13728: S1 ^operator O1960 +)
<=WM: (13726: I3 ^dir L)
<=WM: (13722: R1 ^reward R983)
<=WM: (13694: I3 ^see 0)
<=WM: (13725: O1960 ^name predict-no)
<=WM: (13724: O1959 ^name predict-yes)
<=WM: (13723: R983 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.1121057987244822)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1961 = 0.8879014306224164)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.3993303490983195)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1962 = 0.02370016355578053)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.3993303490983195)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1960 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.1121057987244822)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1959 = 0.8879014306224164)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340245 -> 0.577178 -0.236933 0.340245(R,m,v=1,0.89441,0.0950311)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422821 0.236932 0.659753(R,m,v=1,1,0)
=>WM: (13744: S1 ^operator O1961)

   981:    O: O1961 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N981 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N980 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13745: I3 ^predict-yes N981)
<=WM: (13731: N980 ^status complete)
<=WM: (13730: I3 ^predict-yes N980)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13749: I2 ^dir U)
=>WM: (13748: I2 ^reward 1)
=>WM: (13747: I2 ^see 1)
=>WM: (13746: N981 ^status complete)
<=WM: (13734: I2 ^dir R)
<=WM: (13733: I2 ^reward 1)
<=WM: (13732: I2 ^see 1)
=>WM: (13750: I2 ^level-1 R1-root)
<=WM: (13735: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Firing propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Firing propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1962 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1961 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Retracting propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1962 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.3993303490983195)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1961 = 0.8879014306224164)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.1121057987244822)
=>WM: (13757: S1 ^operator O1964 +)
=>WM: (13756: S1 ^operator O1963 +)
=>WM: (13755: I3 ^dir U)
=>WM: (13754: O1964 ^name predict-no)
=>WM: (13753: O1963 ^name predict-yes)
=>WM: (13752: R985 ^value 1)
=>WM: (13751: R1 ^reward R985)
<=WM: (13742: S1 ^operator O1961 +)
<=WM: (13744: S1 ^operator O1961)
<=WM: (13743: S1 ^operator O1962 +)
<=WM: (13741: I3 ^dir R)
<=WM: (13737: R1 ^reward R984)
<=WM: (13740: O1962 ^name predict-no)
<=WM: (13739: O1961 ^name predict-yes)
<=WM: (13738: R984 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1962 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1961 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619029 -0.506923 0.112106 -> 0.619028 -0.506923 0.112105(R,m,v=1,0.899371,0.0910756)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.38098 0.506922 0.887901 -> 0.380978 0.506922 0.8879(R,m,v=1,1,0)
=>WM: (13758: S1 ^operator O1964)

   982:    O: O1964 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N982 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N981 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13759: I3 ^predict-no N982)
<=WM: (13746: N981 ^status complete)
<=WM: (13745: I3 ^predict-yes N981)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13763: I2 ^dir R)
=>WM: (13762: I2 ^reward 1)
=>WM: (13761: I2 ^see 0)
=>WM: (13760: N982 ^status complete)
<=WM: (13749: I2 ^dir U)
<=WM: (13748: I2 ^reward 1)
<=WM: (13747: I2 ^see 1)
=>WM: (13764: I2 ^level-1 R1-root)
<=WM: (13750: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1964 = 0.6006747262322989)
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1963 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Firing propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Firing propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1964 = 0.3993303490983195)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1963 = 0.1121047143224474)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Retracting propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.)
=>WM: (13772: S1 ^operator O1966 +)
=>WM: (13771: S1 ^operator O1965 +)
=>WM: (13770: I3 ^dir R)
=>WM: (13769: O1966 ^name predict-no)
=>WM: (13768: O1965 ^name predict-yes)
=>WM: (13767: R986 ^value 1)
=>WM: (13766: R1 ^reward R986)
=>WM: (13765: I3 ^see 0)
<=WM: (13756: S1 ^operator O1963 +)
<=WM: (13757: S1 ^operator O1964 +)
<=WM: (13758: S1 ^operator O1964)
<=WM: (13755: I3 ^dir U)
<=WM: (13751: R1 ^reward R985)
<=WM: (13736: I3 ^see 1)
<=WM: (13754: O1964 ^name predict-no)
<=WM: (13753: O1963 ^name predict-yes)
<=WM: (13752: R985 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1965 = 0.1602187148382515)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1965 = 0.1121047143224474)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1966 = 0.6006747262322989)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1966 = 0.3993303490983195)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1964 = 0.3993303490983195)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1964 = 0.6006747262322989)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1963 = 0.1121047143224474)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1963 = 0.1602187148382515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13773: S1 ^operator O1966)

   983:    O: O1966 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N983 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N982 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13774: I3 ^predict-no N983)
<=WM: (13760: N982 ^status complete)
<=WM: (13759: I3 ^predict-no N982)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13778: I2 ^dir L)
=>WM: (13777: I2 ^reward 1)
=>WM: (13776: I2 ^see 0)
=>WM: (13775: N983 ^status complete)
<=WM: (13763: I2 ^dir R)
<=WM: (13762: I2 ^reward 1)
<=WM: (13761: I2 ^see 0)
=>WM: (13779: I2 ^level-1 R0-root)
<=WM: (13764: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1965 = 0.6597534506519405)
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1966 = 0.133561435542329)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Firing propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Firing propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1966 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1965 = 0.3402454610006663)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Retracting propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1966 = 0.3993303490983195)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1966 = 0.6006747262322989)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1965 = 0.1121047143224474)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1965 = 0.1602187148382515)
=>WM: (13786: S1 ^operator O1968 +)
=>WM: (13785: S1 ^operator O1967 +)
=>WM: (13784: I3 ^dir L)
=>WM: (13783: O1968 ^name predict-no)
=>WM: (13782: O1967 ^name predict-yes)
=>WM: (13781: R987 ^value 1)
=>WM: (13780: R1 ^reward R987)
<=WM: (13771: S1 ^operator O1965 +)
<=WM: (13772: S1 ^operator O1966 +)
<=WM: (13773: S1 ^operator O1966)
<=WM: (13770: I3 ^dir R)
<=WM: (13766: R1 ^reward R986)
<=WM: (13769: O1966 ^name predict-no)
<=WM: (13768: O1965 ^name predict-yes)
<=WM: (13767: R986 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1967 = 0.3402454610006663)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1967 = 0.6597534506519405)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1968 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1968 = 0.133561435542329)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1966 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1966 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1965 = 0.3402454610006663)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1965 = 0.6597534506519405)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.558039 -0.158709 0.39933 -> 0.558038 -0.158709 0.39933(R,m,v=1,0.927711,0.0674699)
RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441966 0.158709 0.600675 -> 0.441965 0.158709 0.600674(R,m,v=1,1,0)
=>WM: (13787: S1 ^operator O1967)

   984:    O: O1967 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N984 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N983 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13788: I3 ^predict-yes N984)
<=WM: (13775: N983 ^status complete)
<=WM: (13774: I3 ^predict-no N983)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13792: I2 ^dir U)
=>WM: (13791: I2 ^reward 1)
=>WM: (13790: I2 ^see 1)
=>WM: (13789: N984 ^status complete)
<=WM: (13778: I2 ^dir L)
<=WM: (13777: I2 ^reward 1)
<=WM: (13776: I2 ^see 0)
=>WM: (13793: I2 ^level-1 L1-root)
<=WM: (13779: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Firing propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Firing propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1968 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1967 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Retracting propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1968 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1968 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1967 = 0.6597534506519405)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1967 = 0.3402454610006663)
=>WM: (13801: S1 ^operator O1970 +)
=>WM: (13800: S1 ^operator O1969 +)
=>WM: (13799: I3 ^dir U)
=>WM: (13798: O1970 ^name predict-no)
=>WM: (13797: O1969 ^name predict-yes)
=>WM: (13796: R988 ^value 1)
=>WM: (13795: R1 ^reward R988)
=>WM: (13794: I3 ^see 1)
<=WM: (13785: S1 ^operator O1967 +)
<=WM: (13787: S1 ^operator O1967)
<=WM: (13786: S1 ^operator O1968 +)
<=WM: (13784: I3 ^dir L)
<=WM: (13780: R1 ^reward R987)
<=WM: (13765: I3 ^see 0)
<=WM: (13783: O1968 ^name predict-no)
<=WM: (13782: O1967 ^name predict-yes)
<=WM: (13781: R987 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1969 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1970 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1968 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1967 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340245 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.895062,0.0945096)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422822 0.236932 0.659754(R,m,v=1,1,0)
=>WM: (13802: S1 ^operator O1970)

   985:    O: O1970 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N985 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N984 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13803: I3 ^predict-no N985)
<=WM: (13789: N984 ^status complete)
<=WM: (13788: I3 ^predict-yes N984)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13807: I2 ^dir R)
=>WM: (13806: I2 ^reward 1)
=>WM: (13805: I2 ^see 0)
=>WM: (13804: N985 ^status complete)
<=WM: (13792: I2 ^dir U)
<=WM: (13791: I2 ^reward 1)
<=WM: (13790: I2 ^see 1)
=>WM: (13808: I2 ^level-1 L1-root)
<=WM: (13793: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1969 = 0.8879003462203817)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1970 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Firing propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Firing propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1970 = 0.3993295877987267)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1969 = 0.1121047143224474)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Retracting propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1970 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1969 = 0.)
=>WM: (13816: S1 ^operator O1972 +)
=>WM: (13815: S1 ^operator O1971 +)
=>WM: (13814: I3 ^dir R)
=>WM: (13813: O1972 ^name predict-no)
=>WM: (13812: O1971 ^name predict-yes)
=>WM: (13811: R989 ^value 1)
=>WM: (13810: R1 ^reward R989)
=>WM: (13809: I3 ^see 0)
<=WM: (13800: S1 ^operator O1969 +)
<=WM: (13801: S1 ^operator O1970 +)
<=WM: (13802: S1 ^operator O1970)
<=WM: (13799: I3 ^dir U)
<=WM: (13795: R1 ^reward R988)
<=WM: (13794: I3 ^see 1)
<=WM: (13798: O1970 ^name predict-no)
<=WM: (13797: O1969 ^name predict-yes)
<=WM: (13796: R988 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1971 = 0.8879003462203817)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.1121047143224474)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1972 = 0.02370016355578053)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.3993295877987267)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1970 = 0.3993295877987267)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1970 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1969 = 0.1121047143224474)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1969 = 0.8879003462203817)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13817: S1 ^operator O1971)

   986:    O: O1971 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N986 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N985 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13818: I3 ^predict-yes N986)
<=WM: (13804: N985 ^status complete)
<=WM: (13803: I3 ^predict-no N985)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13822: I2 ^dir R)
=>WM: (13821: I2 ^reward 1)
=>WM: (13820: I2 ^see 1)
=>WM: (13819: N986 ^status complete)
<=WM: (13807: I2 ^dir R)
<=WM: (13806: I2 ^reward 1)
<=WM: (13805: I2 ^see 0)
=>WM: (13823: I2 ^level-1 R1-root)
<=WM: (13808: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1972 = 0.600673964932706)
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1971 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Firing propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Firing propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.3993295877987267)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.1121047143224474)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Retracting propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.3993295877987267)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1972 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.1121047143224474)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1971 = 0.8879003462203817)
=>WM: (13830: S1 ^operator O1974 +)
=>WM: (13829: S1 ^operator O1973 +)
=>WM: (13828: O1974 ^name predict-no)
=>WM: (13827: O1973 ^name predict-yes)
=>WM: (13826: R990 ^value 1)
=>WM: (13825: R1 ^reward R990)
=>WM: (13824: I3 ^see 1)
<=WM: (13815: S1 ^operator O1971 +)
<=WM: (13817: S1 ^operator O1971)
<=WM: (13816: S1 ^operator O1972 +)
<=WM: (13810: R1 ^reward R989)
<=WM: (13809: I3 ^see 0)
<=WM: (13813: O1972 ^name predict-no)
<=WM: (13812: O1971 ^name predict-yes)
<=WM: (13811: R989 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1973 = 0.1121047143224474)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1973 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1974 = 0.3993295877987267)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1974 = 0.600673964932706)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1972 = 0.3993295877987267)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1972 = 0.600673964932706)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1971 = 0.1121047143224474)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1971 = 0.1602187148382515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619028 -0.506923 0.112105 -> 0.619027 -0.506923 0.112104(R,m,v=1,0.9,0.090566)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380978 0.506922 0.8879 -> 0.380978 0.506922 0.8879(R,m,v=1,1,0)
=>WM: (13831: S1 ^operator O1974)

   987:    O: O1974 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N987 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N986 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13832: I3 ^predict-no N987)
<=WM: (13819: N986 ^status complete)
<=WM: (13818: I3 ^predict-yes N986)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13836: I2 ^dir L)
=>WM: (13835: I2 ^reward 1)
=>WM: (13834: I2 ^see 0)
=>WM: (13833: N987 ^status complete)
<=WM: (13822: I2 ^dir R)
<=WM: (13821: I2 ^reward 1)
<=WM: (13820: I2 ^see 1)
=>WM: (13837: I2 ^level-1 R0-root)
<=WM: (13823: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1973 = 0.6597536139040494)
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1974 = 0.133561435542329)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Firing propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Firing propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1974 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1973 = 0.3402456242527754)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Retracting propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1974 = 0.600673964932706)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1974 = 0.3993295877987267)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1973 = 0.1602187148382515)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1973 = 0.112103955241023)
=>WM: (13845: S1 ^operator O1976 +)
=>WM: (13844: S1 ^operator O1975 +)
=>WM: (13843: I3 ^dir L)
=>WM: (13842: O1976 ^name predict-no)
=>WM: (13841: O1975 ^name predict-yes)
=>WM: (13840: R991 ^value 1)
=>WM: (13839: R1 ^reward R991)
=>WM: (13838: I3 ^see 0)
<=WM: (13829: S1 ^operator O1973 +)
<=WM: (13830: S1 ^operator O1974 +)
<=WM: (13831: S1 ^operator O1974)
<=WM: (13814: I3 ^dir R)
<=WM: (13825: R1 ^reward R990)
<=WM: (13824: I3 ^see 1)
<=WM: (13828: O1974 ^name predict-no)
<=WM: (13827: O1973 ^name predict-yes)
<=WM: (13826: R990 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1975 = 0.3402456242527754)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1975 = 0.6597536139040494)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1976 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1976 = 0.133561435542329)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1974 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1974 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1973 = 0.3402456242527754)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1973 = 0.6597536139040494)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.558038 -0.158709 0.39933 -> 0.558038 -0.158709 0.399329(R,m,v=1,0.928144,0.0670947)
RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441965 0.158709 0.600674 -> 0.441965 0.158709 0.600673(R,m,v=1,1,0)
=>WM: (13846: S1 ^operator O1975)

   988:    O: O1975 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N988 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N987 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13847: I3 ^predict-yes N988)
<=WM: (13833: N987 ^status complete)
<=WM: (13832: I3 ^predict-no N987)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13851: I2 ^dir U)
=>WM: (13850: I2 ^reward 1)
=>WM: (13849: I2 ^see 1)
=>WM: (13848: N988 ^status complete)
<=WM: (13836: I2 ^dir L)
<=WM: (13835: I2 ^reward 1)
<=WM: (13834: I2 ^see 0)
=>WM: (13852: I2 ^level-1 L1-root)
<=WM: (13837: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Firing propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Firing propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Retracting propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1976 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1976 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1975 = 0.6597536139040494)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1975 = 0.3402456242527754)
=>WM: (13860: S1 ^operator O1978 +)
=>WM: (13859: S1 ^operator O1977 +)
=>WM: (13858: I3 ^dir U)
=>WM: (13857: O1978 ^name predict-no)
=>WM: (13856: O1977 ^name predict-yes)
=>WM: (13855: R992 ^value 1)
=>WM: (13854: R1 ^reward R992)
=>WM: (13853: I3 ^see 1)
<=WM: (13844: S1 ^operator O1975 +)
<=WM: (13846: S1 ^operator O1975)
<=WM: (13845: S1 ^operator O1976 +)
<=WM: (13843: I3 ^dir L)
<=WM: (13839: R1 ^reward R991)
<=WM: (13838: I3 ^see 0)
<=WM: (13842: O1976 ^name predict-no)
<=WM: (13841: O1975 ^name predict-yes)
<=WM: (13840: R991 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.895706,0.0939938)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422822 0.236932 0.659754 -> 0.422822 0.236932 0.659754(R,m,v=1,1,0)
=>WM: (13861: S1 ^operator O1978)

   989:    O: O1978 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N989 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N988 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13862: I3 ^predict-no N989)
<=WM: (13848: N988 ^status complete)
<=WM: (13847: I3 ^predict-yes N988)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13866: I2 ^dir R)
=>WM: (13865: I2 ^reward 1)
=>WM: (13864: I2 ^see 0)
=>WM: (13863: N989 ^status complete)
<=WM: (13851: I2 ^dir U)
<=WM: (13850: I2 ^reward 1)
<=WM: (13849: I2 ^see 1)
=>WM: (13867: I2 ^level-1 L1-root)
<=WM: (13852: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1977 = 0.8878995871389573)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1978 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Firing propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Firing propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1978 = 0.3993290548890118)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1977 = 0.112103955241023)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Retracting propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.)
=>WM: (13875: S1 ^operator O1980 +)
=>WM: (13874: S1 ^operator O1979 +)
=>WM: (13873: I3 ^dir R)
=>WM: (13872: O1980 ^name predict-no)
=>WM: (13871: O1979 ^name predict-yes)
=>WM: (13870: R993 ^value 1)
=>WM: (13869: R1 ^reward R993)
=>WM: (13868: I3 ^see 0)
<=WM: (13859: S1 ^operator O1977 +)
<=WM: (13860: S1 ^operator O1978 +)
<=WM: (13861: S1 ^operator O1978)
<=WM: (13858: I3 ^dir U)
<=WM: (13854: R1 ^reward R992)
<=WM: (13853: I3 ^see 1)
<=WM: (13857: O1978 ^name predict-no)
<=WM: (13856: O1977 ^name predict-yes)
<=WM: (13855: R992 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1979 = 0.8878995871389573)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1979 = 0.112103955241023)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1980 = 0.02370016355578053)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1980 = 0.3993290548890118)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1978 = 0.3993290548890118)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1978 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1977 = 0.112103955241023)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1977 = 0.8878995871389573)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13876: S1 ^operator O1979)

   990:    O: O1979 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N990 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N989 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13877: I3 ^predict-yes N990)
<=WM: (13863: N989 ^status complete)
<=WM: (13862: I3 ^predict-no N989)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13881: I2 ^dir U)
=>WM: (13880: I2 ^reward 1)
=>WM: (13879: I2 ^see 1)
=>WM: (13878: N990 ^status complete)
<=WM: (13866: I2 ^dir R)
<=WM: (13865: I2 ^reward 1)
<=WM: (13864: I2 ^see 0)
=>WM: (13882: I2 ^level-1 R1-root)
<=WM: (13867: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Firing propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Firing propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1980 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1979 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Retracting propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1980 = 0.3993290548890118)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1980 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1979 = 0.112103955241023)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1979 = 0.8878995871389573)
=>WM: (13890: S1 ^operator O1982 +)
=>WM: (13889: S1 ^operator O1981 +)
=>WM: (13888: I3 ^dir U)
=>WM: (13887: O1982 ^name predict-no)
=>WM: (13886: O1981 ^name predict-yes)
=>WM: (13885: R994 ^value 1)
=>WM: (13884: R1 ^reward R994)
=>WM: (13883: I3 ^see 1)
<=WM: (13874: S1 ^operator O1979 +)
<=WM: (13876: S1 ^operator O1979)
<=WM: (13875: S1 ^operator O1980 +)
<=WM: (13873: I3 ^dir R)
<=WM: (13869: R1 ^reward R993)
<=WM: (13868: I3 ^see 0)
<=WM: (13872: O1980 ^name predict-no)
<=WM: (13871: O1979 ^name predict-yes)
<=WM: (13870: R993 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1980 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1979 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619027 -0.506923 0.112104 -> 0.619026 -0.506923 0.112103(R,m,v=1,0.900621,0.0900621)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380978 0.506922 0.8879 -> 0.380977 0.506922 0.887899(R,m,v=1,1,0)
=>WM: (13891: S1 ^operator O1982)

   991:    O: O1982 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N991 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N990 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13892: I3 ^predict-no N991)
<=WM: (13878: N990 ^status complete)
<=WM: (13877: I3 ^predict-yes N990)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (13896: I2 ^dir U)
=>WM: (13895: I2 ^reward 1)
=>WM: (13894: I2 ^see 0)
=>WM: (13893: N991 ^status complete)
<=WM: (13881: I2 ^dir U)
<=WM: (13880: I2 ^reward 1)
<=WM: (13879: I2 ^see 1)
=>WM: (13897: I2 ^level-1 R1-root)
<=WM: (13882: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Firing propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Firing propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Retracting propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)
=>WM: (13904: S1 ^operator O1984 +)
=>WM: (13903: S1 ^operator O1983 +)
=>WM: (13902: O1984 ^name predict-no)
=>WM: (13901: O1983 ^name predict-yes)
=>WM: (13900: R995 ^value 1)
=>WM: (13899: R1 ^reward R995)
=>WM: (13898: I3 ^see 0)
<=WM: (13889: S1 ^operator O1981 +)
<=WM: (13890: S1 ^operator O1982 +)
<=WM: (13891: S1 ^operator O1982)
<=WM: (13884: R1 ^reward R994)
<=WM: (13883: I3 ^see 1)
<=WM: (13887: O1982 ^name predict-no)
<=WM: (13886: O1981 ^name predict-yes)
<=WM: (13885: R994 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13905: S1 ^operator O1984)

   992:    O: O1984 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N992 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N991 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13906: I3 ^predict-no N992)
<=WM: (13893: N991 ^status complete)
<=WM: (13892: I3 ^predict-no N991)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13910: I2 ^dir L)
=>WM: (13909: I2 ^reward 1)
=>WM: (13908: I2 ^see 0)
=>WM: (13907: N992 ^status complete)
<=WM: (13896: I2 ^dir U)
<=WM: (13895: I2 ^reward 1)
<=WM: (13894: I2 ^see 0)
=>WM: (13911: I2 ^level-1 R1-root)
<=WM: (13897: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1984 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1983 = 0.6597555366718975)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Firing propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Firing propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.3402457385292517)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Retracting propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.)
=>WM: (13918: S1 ^operator O1986 +)
=>WM: (13917: S1 ^operator O1985 +)
=>WM: (13916: I3 ^dir L)
=>WM: (13915: O1986 ^name predict-no)
=>WM: (13914: O1985 ^name predict-yes)
=>WM: (13913: R996 ^value 1)
=>WM: (13912: R1 ^reward R996)
<=WM: (13903: S1 ^operator O1983 +)
<=WM: (13904: S1 ^operator O1984 +)
<=WM: (13905: S1 ^operator O1984)
<=WM: (13888: I3 ^dir U)
<=WM: (13899: R1 ^reward R995)
<=WM: (13902: O1984 ^name predict-no)
<=WM: (13901: O1983 ^name predict-yes)
<=WM: (13900: R995 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1985 = 0.6597555366718975)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.3402457385292517)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1986 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1984 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.3402457385292517)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1983 = 0.6597555366718975)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13919: S1 ^operator O1985)

   993:    O: O1985 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N993 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N992 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13920: I3 ^predict-yes N993)
<=WM: (13907: N992 ^status complete)
<=WM: (13906: I3 ^predict-no N992)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13924: I2 ^dir R)
=>WM: (13923: I2 ^reward 1)
=>WM: (13922: I2 ^see 1)
=>WM: (13921: N993 ^status complete)
<=WM: (13910: I2 ^dir L)
<=WM: (13909: I2 ^reward 1)
<=WM: (13908: I2 ^see 0)
=>WM: (13925: I2 ^level-1 L1-root)
<=WM: (13911: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1985 = 0.8878990557819602)
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1986 = 0.02370016355578053)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Firing propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Firing propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1986 = 0.3993290548890118)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1985 = 0.1121034238840259)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Retracting propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O1986 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.3402457385292517)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O1985 = 0.6597555366718975)
=>WM: (13933: S1 ^operator O1988 +)
=>WM: (13932: S1 ^operator O1987 +)
=>WM: (13931: I3 ^dir R)
=>WM: (13930: O1988 ^name predict-no)
=>WM: (13929: O1987 ^name predict-yes)
=>WM: (13928: R997 ^value 1)
=>WM: (13927: R1 ^reward R997)
=>WM: (13926: I3 ^see 1)
<=WM: (13917: S1 ^operator O1985 +)
<=WM: (13919: S1 ^operator O1985)
<=WM: (13918: S1 ^operator O1986 +)
<=WM: (13916: I3 ^dir L)
<=WM: (13912: R1 ^reward R996)
<=WM: (13898: I3 ^see 0)
<=WM: (13915: O1986 ^name predict-no)
<=WM: (13914: O1985 ^name predict-yes)
<=WM: (13913: R996 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.1121034238840259)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1987 = 0.8878990557819602)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.3993290548890118)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1988 = 0.02370016355578053)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1986 = 0.3993290548890118)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1986 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1985 = 0.1121034238840259)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1985 = 0.8878990557819602)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.896341,0.0934835)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422822 0.236933 0.659756 -> 0.422822 0.236933 0.659755(R,m,v=1,1,0)
=>WM: (13934: S1 ^operator O1987)

   994:    O: O1987 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N994 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N993 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13935: I3 ^predict-yes N994)
<=WM: (13921: N993 ^status complete)
<=WM: (13920: I3 ^predict-yes N993)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13939: I2 ^dir R)
=>WM: (13938: I2 ^reward 1)
=>WM: (13937: I2 ^see 1)
=>WM: (13936: N994 ^status complete)
<=WM: (13924: I2 ^dir R)
<=WM: (13923: I2 ^reward 1)
<=WM: (13922: I2 ^see 1)
=>WM: (13940: I2 ^level-1 R1-root)
<=WM: (13925: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1988 = 0.6006734320229912)
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1987 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Firing propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Firing propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.3993290548890118)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.1121034238840259)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Retracting propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
 -->
 (S1 ^operator O1988 = 0.02370016355578053)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.3993290548890118)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
 -->
 (S1 ^operator O1987 = 0.8878990557819602)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.1121034238840259)
=>WM: (13946: S1 ^operator O1990 +)
=>WM: (13945: S1 ^operator O1989 +)
=>WM: (13944: O1990 ^name predict-no)
=>WM: (13943: O1989 ^name predict-yes)
=>WM: (13942: R998 ^value 1)
=>WM: (13941: R1 ^reward R998)
<=WM: (13932: S1 ^operator O1987 +)
<=WM: (13934: S1 ^operator O1987)
<=WM: (13933: S1 ^operator O1988 +)
<=WM: (13927: R1 ^reward R997)
<=WM: (13930: O1988 ^name predict-no)
<=WM: (13929: O1987 ^name predict-yes)
<=WM: (13928: R997 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1989 = 0.1121034238840259)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1989 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1990 = 0.3993290548890118)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1990 = 0.6006734320229912)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1988 = 0.3993290548890118)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1988 = 0.6006734320229912)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1987 = 0.1121034238840259)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1987 = 0.1602187148382515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619026 -0.506923 0.112103 -> 0.619026 -0.506923 0.112103(R,m,v=1,0.901235,0.0895637)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380977 0.506922 0.887899 -> 0.380976 0.506922 0.887899(R,m,v=1,1,0)
=>WM: (13947: S1 ^operator O1990)

   995:    O: O1990 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N995 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N994 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13948: I3 ^predict-no N995)
<=WM: (13936: N994 ^status complete)
<=WM: (13935: I3 ^predict-yes N994)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13952: I2 ^dir U)
=>WM: (13951: I2 ^reward 1)
=>WM: (13950: I2 ^see 0)
=>WM: (13949: N995 ^status complete)
<=WM: (13939: I2 ^dir R)
<=WM: (13938: I2 ^reward 1)
<=WM: (13937: I2 ^see 1)
=>WM: (13953: I2 ^level-1 R0-root)
<=WM: (13940: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Firing propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Firing propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Retracting propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O1990 = 0.6006734320229912)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1990 = 0.3993290548890118)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O1989 = 0.1602187148382515)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1989 = 0.1121030519341281)
=>WM: (13961: S1 ^operator O1992 +)
=>WM: (13960: S1 ^operator O1991 +)
=>WM: (13959: I3 ^dir U)
=>WM: (13958: O1992 ^name predict-no)
=>WM: (13957: O1991 ^name predict-yes)
=>WM: (13956: R999 ^value 1)
=>WM: (13955: R1 ^reward R999)
=>WM: (13954: I3 ^see 0)
<=WM: (13945: S1 ^operator O1989 +)
<=WM: (13946: S1 ^operator O1990 +)
<=WM: (13947: S1 ^operator O1990)
<=WM: (13931: I3 ^dir R)
<=WM: (13941: R1 ^reward R998)
<=WM: (13926: I3 ^see 1)
<=WM: (13944: O1990 ^name predict-no)
<=WM: (13943: O1989 ^name predict-yes)
<=WM: (13942: R998 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.558038 -0.158709 0.399329 -> 0.558037 -0.158709 0.399329(R,m,v=1,0.928571,0.0667237)
RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441965 0.158709 0.600673 -> 0.441964 0.158709 0.600673(R,m,v=1,1,0)
=>WM: (13962: S1 ^operator O1992)

   996:    O: O1992 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N996 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N995 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13963: I3 ^predict-no N996)
<=WM: (13949: N995 ^status complete)
<=WM: (13948: I3 ^predict-no N995)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13967: I2 ^dir U)
=>WM: (13966: I2 ^reward 1)
=>WM: (13965: I2 ^see 0)
=>WM: (13964: N996 ^status complete)
<=WM: (13952: I2 ^dir U)
<=WM: (13951: I2 ^reward 1)
<=WM: (13950: I2 ^see 0)
=>WM: (13968: I2 ^level-1 R0-root)
<=WM: (13953: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Firing propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Firing propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Retracting propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)
=>WM: (13974: S1 ^operator O1994 +)
=>WM: (13973: S1 ^operator O1993 +)
=>WM: (13972: O1994 ^name predict-no)
=>WM: (13971: O1993 ^name predict-yes)
=>WM: (13970: R1000 ^value 1)
=>WM: (13969: R1 ^reward R1000)
<=WM: (13960: S1 ^operator O1991 +)
<=WM: (13961: S1 ^operator O1992 +)
<=WM: (13962: S1 ^operator O1992)
<=WM: (13955: R1 ^reward R999)
<=WM: (13958: O1992 ^name predict-no)
<=WM: (13957: O1991 ^name predict-yes)
<=WM: (13956: R999 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13975: S1 ^operator O1994)

   997:    O: O1994 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N997 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N996 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13976: I3 ^predict-no N997)
<=WM: (13964: N996 ^status complete)
<=WM: (13963: I3 ^predict-no N996)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13980: I2 ^dir L)
=>WM: (13979: I2 ^reward 1)
=>WM: (13978: I2 ^see 0)
=>WM: (13977: N997 ^status complete)
<=WM: (13967: I2 ^dir U)
<=WM: (13966: I2 ^reward 1)
<=WM: (13965: I2 ^see 0)
=>WM: (13981: I2 ^level-1 R0-root)
<=WM: (13968: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1993 = 0.6597537281805257)
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1994 = 0.133561435542329)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Firing propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Firing propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1994 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1993 = 0.3402455472490794)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Retracting propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.)
=>WM: (13988: S1 ^operator O1996 +)
=>WM: (13987: S1 ^operator O1995 +)
=>WM: (13986: I3 ^dir L)
=>WM: (13985: O1996 ^name predict-no)
=>WM: (13984: O1995 ^name predict-yes)
=>WM: (13983: R1001 ^value 1)
=>WM: (13982: R1 ^reward R1001)
<=WM: (13973: S1 ^operator O1993 +)
<=WM: (13974: S1 ^operator O1994 +)
<=WM: (13975: S1 ^operator O1994)
<=WM: (13959: I3 ^dir U)
<=WM: (13969: R1 ^reward R1000)
<=WM: (13972: O1994 ^name predict-no)
<=WM: (13971: O1993 ^name predict-yes)
<=WM: (13970: R1000 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1995 = 0.6597537281805257)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.3402455472490794)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1996 = 0.133561435542329)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1994 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1994 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1993 = 0.3402455472490794)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1993 = 0.6597537281805257)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13989: S1 ^operator O1995)

   998:    O: O1995 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N998 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N997 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13990: I3 ^predict-yes N998)
<=WM: (13977: N997 ^status complete)
<=WM: (13976: I3 ^predict-no N997)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13994: I2 ^dir L)
=>WM: (13993: I2 ^reward 1)
=>WM: (13992: I2 ^see 1)
=>WM: (13991: N998 ^status complete)
<=WM: (13980: I2 ^dir L)
<=WM: (13979: I2 ^reward 1)
<=WM: (13978: I2 ^see 0)
=>WM: (13995: I2 ^level-1 L1-root)
<=WM: (13981: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1995 = 0.02884852834965246)
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1996 = 0.678736816647851)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Firing propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Firing propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3212899096504038)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.3402455472490794)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Retracting propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
 -->
 (S1 ^operator O1996 = 0.133561435542329)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.3402455472490794)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
 -->
 (S1 ^operator O1995 = 0.6597537281805257)
=>WM: (14002: S1 ^operator O1998 +)
=>WM: (14001: S1 ^operator O1997 +)
=>WM: (14000: O1998 ^name predict-no)
=>WM: (13999: O1997 ^name predict-yes)
=>WM: (13998: R1002 ^value 1)
=>WM: (13997: R1 ^reward R1002)
=>WM: (13996: I3 ^see 1)
<=WM: (13987: S1 ^operator O1995 +)
<=WM: (13989: S1 ^operator O1995)
<=WM: (13988: S1 ^operator O1996 +)
<=WM: (13982: R1 ^reward R1001)
<=WM: (13954: I3 ^see 0)
<=WM: (13985: O1996 ^name predict-no)
<=WM: (13984: O1995 ^name predict-yes)
<=WM: (13983: R1001 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.3402455472490794)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1997 = 0.02884852834965246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 0.3212899096504038)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1998 = 0.678736816647851)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1996 = 0.678736816647851)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.3402455472490794)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1995 = 0.02884852834965246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.89697,0.0929786)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422822 0.236932 0.659754 -> 0.422822 0.236932 0.659754(R,m,v=1,1,0)
=>WM: (14003: S1 ^operator O1998)

   999:    O: O1998 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N999 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N998 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14004: I3 ^predict-no N999)
<=WM: (13991: N998 ^status complete)
<=WM: (13990: I3 ^predict-yes N998)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14008: I2 ^dir U)
=>WM: (14007: I2 ^reward 1)
=>WM: (14006: I2 ^see 0)
=>WM: (14005: N999 ^status complete)
<=WM: (13994: I2 ^dir L)
<=WM: (13993: I2 ^reward 1)
<=WM: (13992: I2 ^see 1)
=>WM: (14009: I2 ^level-1 L0-root)
<=WM: (13995: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Firing propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Firing propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1998 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1997 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Retracting propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O1998 = 0.678736816647851)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 0.3212899096504038)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O1997 = 0.02884852834965246)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.3402456559346386)
=>WM: (14017: S1 ^operator O2000 +)
=>WM: (14016: S1 ^operator O1999 +)
=>WM: (14015: I3 ^dir U)
=>WM: (14014: O2000 ^name predict-no)
=>WM: (14013: O1999 ^name predict-yes)
=>WM: (14012: R1003 ^value 1)
=>WM: (14011: R1 ^reward R1003)
=>WM: (14010: I3 ^see 0)
<=WM: (14001: S1 ^operator O1997 +)
<=WM: (14002: S1 ^operator O1998 +)
<=WM: (14003: S1 ^operator O1998)
<=WM: (13986: I3 ^dir L)
<=WM: (13997: R1 ^reward R1002)
<=WM: (13996: I3 ^see 1)
<=WM: (14000: O1998 ^name predict-no)
<=WM: (13999: O1997 ^name predict-yes)
<=WM: (13998: R1002 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1999 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2000 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1998 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1997 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.641767 -0.320477 0.32129 -> 0.641763 -0.320477 0.321286(R,m,v=1,0.933775,0.0622517)
RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358259 0.320477 0.678737 -> 0.358255 0.320477 0.678733(R,m,v=1,1,0)
=>WM: (14018: S1 ^operator O2000)

  1000:    O: O2000 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1000 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N999 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14019: I3 ^predict-no N1000)
<=WM: (14005: N999 ^status complete)
<=WM: (14004: I3 ^predict-no N999)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\-/|\-/|\--- Input Phase --- 
=>WM: (14023: I2 ^dir R)
=>WM: (14022: I2 ^reward 1)
=>WM: (14021: I2 ^see 0)
=>WM: (14020: N1000 ^status complete)
<=WM: (14008: I2 ^dir U)
<=WM: (14007: I2 ^reward 1)
<=WM: (14006: I2 ^see 0)
=>WM: (14024: I2 ^level-1 L0-root)
<=WM: (14009: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1999 = 0.8878820777819987)
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2000 = -0.1957074416057287)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Firing propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Firing propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2000 = 0.3993286818522114)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1999 = 0.1121030519341281)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Retracting propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2000 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1999 = 0.)
=>WM: (14031: S1 ^operator O2002 +)
=>WM: (14030: S1 ^operator O2001 +)
=>WM: (14029: I3 ^dir R)
=>WM: (14028: O2002 ^name predict-no)
=>WM: (14027: O2001 ^name predict-yes)
=>WM: (14026: R1004 ^value 1)
=>WM: (14025: R1 ^reward R1004)
<=WM: (14016: S1 ^operator O1999 +)
<=WM: (14017: S1 ^operator O2000 +)
<=WM: (14018: S1 ^operator O2000)
<=WM: (14015: I3 ^dir U)
<=WM: (14011: R1 ^reward R1003)
<=WM: (14014: O2000 ^name predict-no)
<=WM: (14013: O1999 ^name predict-yes)
<=WM: (14012: R1003 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O2001 = 0.8878820777819987)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2001 = 0.1121030519341281)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2002 = -0.1957074416057287)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2002 = 0.3993286818522114)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2000 = 0.3993286818522114)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2000 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1999 = 0.1121030519341281)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O1999 = 0.8878820777819987)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14032: S1 ^operator O2001)

  1001:    O: O2001 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1001 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1000 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14033: I3 ^predict-yes N1001)
<=WM: (14020: N1000 ^status complete)
<=WM: (14019: I3 ^predict-no N1000)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (14037: I2 ^dir L)
=>WM: (14036: I2 ^reward 1)
=>WM: (14035: I2 ^see 1)
=>WM: (14034: N1001 ^status complete)
<=WM: (14023: I2 ^dir R)
<=WM: (14022: I2 ^reward 1)
<=WM: (14021: I2 ^see 0)
=>WM: (14038: I2 ^level-1 R1-root)
<=WM: (14024: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O2002 = 0.03900899329983293)
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O2001 = 0.6597553453917251)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Firing propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Firing propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 0.3212859007056656)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.3402456559346386)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Retracting propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2002 = 0.3993286818522114)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2002 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2001 = 0.1121030519341281)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O2001 = 0.8878820777819987)
=>WM: (14046: S1 ^operator O2004 +)
=>WM: (14045: S1 ^operator O2003 +)
=>WM: (14044: I3 ^dir L)
=>WM: (14043: O2004 ^name predict-no)
=>WM: (14042: O2003 ^name predict-yes)
=>WM: (14041: R1005 ^value 1)
=>WM: (14040: R1 ^reward R1005)
=>WM: (14039: I3 ^see 1)
<=WM: (14030: S1 ^operator O2001 +)
<=WM: (14032: S1 ^operator O2001)
<=WM: (14031: S1 ^operator O2002 +)
<=WM: (14029: I3 ^dir R)
<=WM: (14025: R1 ^reward R1004)
<=WM: (14010: I3 ^see 0)
<=WM: (14028: O2002 ^name predict-no)
<=WM: (14027: O2001 ^name predict-yes)
<=WM: (14026: R1004 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3402456559346386)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O2003 = 0.6597553453917251)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3212859007056656)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O2004 = 0.03900899329983293)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 0.3212859007056656)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O2002 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.3402456559346386)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O2001 = 0.6597553453917251)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619026 -0.506923 0.112103 -> 0.619028 -0.506923 0.112105(R,m,v=1,0.90184,0.0890707)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380957 0.506925 0.887882 -> 0.380959 0.506925 0.887884(R,m,v=1,1,0)
=>WM: (14047: S1 ^operator O2003)

  1002:    O: O2003 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1002 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1001 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14048: I3 ^predict-yes N1002)
<=WM: (14034: N1001 ^status complete)
<=WM: (14033: I3 ^predict-yes N1001)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14052: I2 ^dir L)
=>WM: (14051: I2 ^reward 1)
=>WM: (14050: I2 ^see 1)
=>WM: (14049: N1002 ^status complete)
<=WM: (14037: I2 ^dir L)
<=WM: (14036: I2 ^reward 1)
<=WM: (14035: I2 ^see 1)
=>WM: (14053: I2 ^level-1 L1-root)
<=WM: (14038: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O2003 = 0.02884852834965246)
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O2004 = 0.6787328077031127)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Firing propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Firing propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3212859007056656)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3402456559346386)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Retracting propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
 -->
 (S1 ^operator O2004 = 0.03900899329983293)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3212859007056656)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
 -->
 (S1 ^operator O2003 = 0.6597553453917251)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3402456559346386)
=>WM: (14059: S1 ^operator O2006 +)
=>WM: (14058: S1 ^operator O2005 +)
=>WM: (14057: O2006 ^name predict-no)
=>WM: (14056: O2005 ^name predict-yes)
=>WM: (14055: R1006 ^value 1)
=>WM: (14054: R1 ^reward R1006)
<=WM: (14045: S1 ^operator O2003 +)
<=WM: (14047: S1 ^operator O2003)
<=WM: (14046: S1 ^operator O2004 +)
<=WM: (14040: R1 ^reward R1005)
<=WM: (14043: O2004 ^name predict-no)
<=WM: (14042: O2003 ^name predict-yes)
<=WM: (14041: R1005 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2005 = 0.3402456559346386)
Firing prefer*rvt*predict-yes*H0*1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O2005 = 0.02884852834965246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2006 = 0.3212859007056656)
Firing prefer*rvt*predict-no*H0*2*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O2006 = 0.6787328077031127)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3212859007056656)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O2004 = 0.6787328077031127)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3402456559346386)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O2003 = 0.02884852834965246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.89759,0.092479)
RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422822 0.236933 0.659755 -> 0.422822 0.236933 0.659755(R,m,v=1,1,0)
=>WM: (14060: S1 ^operator O2006)

  1003:    O: O2006 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1003 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1002 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14061: I3 ^predict-no N1003)
<=WM: (14049: N1002 ^status complete)
<=WM: (14048: I3 ^predict-yes N1002)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14065: I2 ^dir R)
=>WM: (14064: I2 ^reward 1)
=>WM: (14063: I2 ^see 0)
=>WM: (14062: N1003 ^status complete)
<=WM: (14052: I2 ^dir L)
<=WM: (14051: I2 ^reward 1)
<=WM: (14050: I2 ^see 1)
=>WM: (14066: I2 ^level-1 L0-root)
<=WM: (14053: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O2005 = 0.8878843083245797)
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2006 = -0.1957074416057287)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Firing propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Firing propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.3993286818522114)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.1121052824767091)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Retracting propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
 -->
 (S1 ^operator O2006 = 0.6787328077031127)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2006 = 0.3212859007056656)
Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
 -->
 (S1 ^operator O2005 = 0.02884852834965246)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2005 = 0.340245505735684)
=>WM: (14074: S1 ^operator O2008 +)
=>WM: (14073: S1 ^operator O2007 +)
=>WM: (14072: I3 ^dir R)
=>WM: (14071: O2008 ^name predict-no)
=>WM: (14070: O2007 ^name predict-yes)
=>WM: (14069: R1007 ^value 1)
=>WM: (14068: R1 ^reward R1007)
=>WM: (14067: I3 ^see 0)
<=WM: (14058: S1 ^operator O2005 +)
<=WM: (14059: S1 ^operator O2006 +)
<=WM: (14060: S1 ^operator O2006)
<=WM: (14044: I3 ^dir L)
<=WM: (14054: R1 ^reward R1006)
<=WM: (14039: I3 ^see 1)
<=WM: (14057: O2006 ^name predict-no)
<=WM: (14056: O2005 ^name predict-yes)
<=WM: (14055: R1006 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.1121052824767091)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O2007 = 0.8878843083245797)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.3993286818522114)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2008 = -0.1957074416057287)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.3993286818522114)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2006 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.1121052824767091)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O2005 = 0.8878843083245797)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.641763 -0.320477 0.321286 -> 0.641761 -0.320477 0.321283(R,m,v=1,0.934211,0.0618682)
RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358255 0.320477 0.678733 -> 0.358253 0.320477 0.67873(R,m,v=1,1,0)
=>WM: (14075: S1 ^operator O2007)

  1004:    O: O2007 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1004 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1003 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14076: I3 ^predict-yes N1004)
<=WM: (14062: N1003 ^status complete)
<=WM: (14061: I3 ^predict-no N1003)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\-/--- Input Phase --- 
=>WM: (14080: I2 ^dir R)
=>WM: (14079: I2 ^reward 1)
=>WM: (14078: I2 ^see 1)
=>WM: (14077: N1004 ^status complete)
<=WM: (14065: I2 ^dir R)
<=WM: (14064: I2 ^reward 1)
<=WM: (14063: I2 ^see 0)
=>WM: (14081: I2 ^level-1 R1-root)
<=WM: (14066: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O2008 = 0.6006730589861906)
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O2007 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Firing propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Firing propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.3993286818522114)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.1121052824767091)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Retracting propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
 -->
 (S1 ^operator O2008 = -0.1957074416057287)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.3993286818522114)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
 -->
 (S1 ^operator O2007 = 0.8878843083245797)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.1121052824767091)
=>WM: (14088: S1 ^operator O2010 +)
=>WM: (14087: S1 ^operator O2009 +)
=>WM: (14086: O2010 ^name predict-no)
=>WM: (14085: O2009 ^name predict-yes)
=>WM: (14084: R1008 ^value 1)
=>WM: (14083: R1 ^reward R1008)
=>WM: (14082: I3 ^see 1)
<=WM: (14073: S1 ^operator O2007 +)
<=WM: (14075: S1 ^operator O2007)
<=WM: (14074: S1 ^operator O2008 +)
<=WM: (14068: R1 ^reward R1007)
<=WM: (14067: I3 ^see 0)
<=WM: (14071: O2008 ^name predict-no)
<=WM: (14070: O2007 ^name predict-yes)
<=WM: (14069: R1007 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2009 = 0.1121052824767091)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O2009 = 0.1602187148382515)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2010 = 0.3993286818522114)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O2010 = 0.6006730589861906)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.3993286818522114)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O2008 = 0.6006730589861906)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.1121052824767091)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O2007 = 0.1602187148382515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.619028 -0.506923 0.112105 -> 0.61903 -0.506923 0.112107(R,m,v=1,0.902439,0.088583)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380959 0.506925 0.887884 -> 0.380961 0.506925 0.887886(R,m,v=1,1,0)
=>WM: (14089: S1 ^operator O2010)

  1005:    O: O2010 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1005 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1004 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14090: I3 ^predict-no N1005)
<=WM: (14077: N1004 ^status complete)
<=WM: (14076: I3 ^predict-yes N1004)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14094: I2 ^dir U)
=>WM: (14093: I2 ^reward 1)
=>WM: (14092: I2 ^see 0)
=>WM: (14091: N1005 ^status complete)
<=WM: (14080: I2 ^dir R)
<=WM: (14079: I2 ^reward 1)
<=WM: (14078: I2 ^see 1)
=>WM: (14095: I2 ^level-1 R0-root)
<=WM: (14081: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Firing propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Firing propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2010 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2009 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Retracting propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
 -->
 (S1 ^operator O2010 = 0.6006730589861906)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2010 = 0.3993286818522114)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
 -->
 (S1 ^operator O2009 = 0.1602187148382515)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2009 = 0.1121068438565158)
=>WM: (14103: S1 ^operator O2012 +)
=>WM: (14102: S1 ^operator O2011 +)
=>WM: (14101: I3 ^dir U)
=>WM: (14100: O2012 ^name predict-no)
=>WM: (14099: O2011 ^name predict-yes)
=>WM: (14098: R1009 ^value 1)
=>WM: (14097: R1 ^reward R1009)
=>WM: (14096: I3 ^see 0)
<=WM: (14087: S1 ^operator O2009 +)
<=WM: (14088: S1 ^operator O2010 +)
<=WM: (14089: S1 ^operator O2010)
<=WM: (14072: I3 ^dir R)
<=WM: (14083: R1 ^reward R1008)
<=WM: (14082: I3 ^see 1)
<=WM: (14086: O2010 ^name predict-no)
<=WM: (14085: O2009 ^name predict-yes)
<=WM: (14084: R1008 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2011 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2012 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2010 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2009 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.558037 -0.158709 0.399329 -> 0.558037 -0.158709 0.399328(R,m,v=1,0.928994,0.0663567)
RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441964 0.158709 0.600673 -> 0.441964 0.158709 0.600673(R,m,v=1,1,0)
=>WM: (14104: S1 ^operator O2012)

  1006:    O: O2012 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1006 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1005 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14105: I3 ^predict-no N1006)
<=WM: (14091: N1005 ^status complete)
<=WM: (14090: I3 ^predict-no N1005)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14109: I2 ^dir R)
=>WM: (14108: I2 ^reward 1)
=>WM: (14107: I2 ^see 0)
=>WM: (14106: N1006 ^status complete)
<=WM: (14094: I2 ^dir U)
<=WM: (14093: I2 ^reward 1)
<=WM: (14092: I2 ^see 0)
=>WM: (14110: I2 ^level-1 R0-root)
<=WM: (14095: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2011 = 0.08295067548181556)
Firing rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2012 = 0.600662900915969)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Firing propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Firing propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2012 = 0.3993284207264511)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2011 = 0.1121068438565158)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Retracting propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2012 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2011 = 0.)
=>WM: (14117: S1 ^operator O2014 +)
=>WM: (14116: S1 ^operator O2013 +)
=>WM: (14115: I3 ^dir R)
=>WM: (14114: O2014 ^name predict-no)
=>WM: (14113: O2013 ^name predict-yes)
=>WM: (14112: R1010 ^value 1)
=>WM: (14111: R1 ^reward R1010)
<=WM: (14102: S1 ^operator O2011 +)
<=WM: (14103: S1 ^operator O2012 +)
<=WM: (14104: S1 ^operator O2012)
<=WM: (14101: I3 ^dir U)
<=WM: (14097: R1 ^reward R1009)
<=WM: (14100: O2012 ^name predict-no)
<=WM: (14099: O2011 ^name predict-yes)
<=WM: (14098: R1009 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2013 = 0.08295067548181556)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.1121068438565158)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2014 = 0.600662900915969)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.3993284207264511)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2012 = 0.3993284207264511)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2012 = 0.600662900915969)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2011 = 0.1121068438565158)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2011 = 0.08295067548181556)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14118: S1 ^operator O2014)

  1007:    O: O2014 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1007 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1006 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14119: I3 ^predict-no N1007)
<=WM: (14106: N1006 ^status complete)
<=WM: (14105: I3 ^predict-no N1006)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14123: I2 ^dir R)
=>WM: (14122: I2 ^reward 1)
=>WM: (14121: I2 ^see 0)
=>WM: (14120: N1007 ^status complete)
<=WM: (14109: I2 ^dir R)
<=WM: (14108: I2 ^reward 1)
<=WM: (14107: I2 ^see 0)
=>WM: (14124: I2 ^level-1 R0-root)
<=WM: (14110: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2013 = 0.08295067548181556)
Firing rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2014 = 0.600662900915969)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1011 ^value 1 +)
 (R1 ^reward R1011 +)
Firing propose*predict-yes
 -->
 (O2015 ^name predict-yes +)
 (S1 ^operator O2015 +)
Firing propose*predict-no
 -->
 (O2016 ^name predict-no +)
 (S1 ^operator O2016 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.3993284207264511)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.1121068438565158)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Retracting propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.3993284207264511)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2014 = 0.600662900915969)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.1121068438565158)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2013 = 0.08295067548181556)
=>WM: (14130: S1 ^operator O2016 +)
=>WM: (14129: S1 ^operator O2015 +)
=>WM: (14128: O2016 ^name predict-no)
=>WM: (14127: O2015 ^name predict-yes)
=>WM: (14126: R1011 ^value 1)
=>WM: (14125: R1 ^reward R1011)
<=WM: (14116: S1 ^operator O2013 +)
<=WM: (14117: S1 ^operator O2014 +)
<=WM: (14118: S1 ^operator O2014)
<=WM: (14111: R1 ^reward R1010)
<=WM: (14114: O2014 ^name predict-no)
<=WM: (14113: O2013 ^name predict-yes)
<=WM: (14112: R1010 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2015 = 0.08295067548181556)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2015 = 0.1121068438565158)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2016 = 0.600662900915969)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2016 = 0.3993284207264511)
Firing prefer*rvt*predict-no*H0*6*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.3993284207264511)
Retracting rl*prefer*rvt*predict-no*H0*6*H1*13
 -->
 (S1 ^operator O2014 = 0.600662900915969)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.1121068438565158)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2013 = 0.08295067548181556)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.558037 -0.158709 0.399328 -> 0.558038 -0.158709 0.39933(R,m,v=1,0.929412,0.0659937)
RL update rl*prefer*rvt*predict-no*H0*6*H1*13 0.441955 0.158708 0.600663 -> 0.441956 0.158708 0.600664(R,m,v=1,1,0)
=>WM: (14131: S1 ^operator O2016)

  1008:    O: O2016 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1008 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1007 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14132: I3 ^predict-no N1008)
<=WM: (14120: N1007 ^status complete)
<=WM: (14119: I3 ^predict-no N1007)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14136: I2 ^dir L)
=>WM: (14135: I2 ^reward 1)
=>WM: (14134: I2 ^see 0)
=>WM: (14133: N1008 ^status complete)
<=WM: (14123: I2 ^dir R)
<=WM: (14122: I2 ^reward 1)
<=WM: (14121: I2