stdout-flip-2.5K_4.txt

/flipv2/20121113-091142-2.5K-ReLST-asneeded_epochs_50_5runs_noalphadecay/stdout-flip-2.5K_4.txt

https://bitbucket.org/evan13579b/soar-ziggurat
Plain Text | 16322 lines | 15596 code | 726 blank | 0 comment | 0 complexity | d3a0179b931f9256445520fb852fb030 MD5 | raw file
Possible License(s): BSD-3-Clause

Seeding... 4
dir: dir isL
Python-Soar Flip environment.
To accept commands from an external sml process, you'll need to
type 'slave <log file> <n decisons>' at the prompt...
sourcing 'flip_predict.soar'
***********
Total: 11 productions sourced.

seeding Soar with 4 ...

soar> Entering slave mode:
  - log file 'rl-slave-2.5K_4.log'....
  - will exit slave mode after 2500 decisions
  waiting for commands from an externally connected sml process...
-/|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\-/|\-/|\-/|sleeping...
\-/|\-/sleeping...
|1:    O: O2 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

\-/|\-/2:    O: O3 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-3:    O: O6 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
/|\4:    O: O8 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
-/5:    O: O10 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-6:    O: O11 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|7:    O: O14 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/8:    O: O15 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\-9:    O: O17 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\-10:    O: O19 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\11:    O: O21 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

-12:    O: O24 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
/|\13:    O: O26 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|14:    O: O28 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/15:    O: O30 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-16:    O: O32 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|17:    O: O34 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-18:    O: O36 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\19:    O: O38 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|20:    O: O39 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/|sleeping...
\21:    O: O42 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-22:    O: O43 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|23:    O: O45 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/24:    O: O48 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-25:    O: O50 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\-26:    O: O52 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
/|27:    O: O53 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/28:    O: O56 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-29:    O: O57 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\30:    O: O60 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
-31:    O: O61 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/32:    O: O64 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
|\-33:    O: O66 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\34:    O: O68 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|35:    O: O69 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-36:    O: O72 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
/|\37:    O: O74 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|38:    O: O76 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
\-/39:    O: O78 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-40:    O: O80 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-41:    O: O81 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/42:    O: O83 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-/43:    O: O85 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-44:    O: O87 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\45:    O: O89 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|46:    O: O91 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-/47:    O: O94 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\48:    O: O95 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/49:    O: O97 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-/50:    O: O99 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-/|\-sleeping...
/sleeping...
|51:    O: O102 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
rule alias: '*'

rule alias: '*'

\52:    O: O103 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/53:    O: O106 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-54:    O: O107 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/|\55:    O: O110 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\sleeping...
-56:    O: O112 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
/|57:    O: O113 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-58:    O: O115 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|59:    O: O117 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/60:    O: O119 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-/61:    O: O121 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

|62:    O: O124 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/63:    O: O125 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-64:    O: O127 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|65:    O: O130 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/66:    O: O131 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-67:    O: O133 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\68:    O: O135 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/|69:    O: O138 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-70:    O: O140 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
/|\-71:    O: O141 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

/72:    O: O143 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|\-73:    O: O145 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|\74:    O: O147 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/75:    O: O149 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-76:    O: O151 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\77:    O: O154 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\78:    O: O156 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-79:    O: O158 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\80:    O: O159 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-/|\81:    O: O161 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

-82:    O: O163 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\83:    O: O166 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\84:    O: O168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/85:    O: O169 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-/86:    O: O172 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\87:    O: O173 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/88:    O: O176 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-89:    O: O177 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|90:    O: O180 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/91:    O: O181 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

|92:    O: O183 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/93:    O: O185 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-94:    O: O188 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\95:    O: O189 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|\96:    O: O192 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|97:    O: O194 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/98:    O: O196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\99:    O: O198 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-100:    O: O199 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\101:    O: O201 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|sleeping...
\102:    O: O204 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\103:    O: O206 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|104:    O: O208 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-105:    O: O209 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/106:    O: O211 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
|\107:    O: O214 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/108:    O: O216 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-109:    O: O218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|110:    O: O220 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-111:    O: O221 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

/112:    O: O223 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-113:    O: O226 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
/|\114:    O: O228 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/115:    O: O229 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-116:    O: O231 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\117:    O: O234 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|118:    O: O235 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/119:    O: O238 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-120:    O: O239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\121:    O: O242 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

-122:    O: O243 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\-123:    O: O245 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\124:    O: O248 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
-125:    O: O249 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\126:    O: O252 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\127:    O: O254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|\128:    O: O255 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/129:    O: O257 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-130:    O: O259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|131:    O: O262 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

\132:    O: O264 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/133:    O: O266 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-134:    O: O268 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|135:    O: O270 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|136:    O: O272 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-137:    O: O274 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\138:    O: O275 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|\139:    O: O278 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/140:    O: O279 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\141:    O: O282 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-142:    O: O283 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/|\143:    O: O286 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/144:    O: O288 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-145:    O: O289 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|146:    O: O291 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/|147:    O: O293 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-/|148:    O: O296 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-149:    O: O298 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|150:    O: O300 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-151:    O: O301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/152:    O: O303 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\-/153:    O: O306 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/154:    O: O307 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\-/sleeping...
|155:    O: O310 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/156:    O: O312 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\157:    O: O314 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|158:    O: O315 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/159:    O: O318 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
|\-/160:    O: O319 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\-161:    O: O321 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/162:    O: O323 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\-163:    O: O326 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\-164:    O: O327 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\165:    O: O330 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/166:    O: O332 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\167:    O: O334 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|168:    O: O336 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/169:    O: O338 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-170:    O: O340 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-171:    O: O342 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/172:    O: O344 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\173:    O: O345 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/174:    O: O348 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-175:    O: O350 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|176:    O: O352 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/177:    O: O353 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\178:    O: O356 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|179:    O: O358 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/180:    O: O360 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-181:    O: O362 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/182:    O: O364 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-183:    O: O366 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|184:    O: O368 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/185:    O: O370 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\186:    O: O371 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|187:    O: O374 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-188:    O: O375 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\189:    O: O378 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|190:    O: O380 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-191:    O: O381 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/192:    O: O384 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\193:    O: O385 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|194:    O: O387 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/|sleeping...
\195:    O: O390 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
-/|\196:    O: O392 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|197:    O: O394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
\-/198:    O: O396 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-199:    O: O398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\200:    O: O400 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|201:    O: O401 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-202:    O: O403 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\203:    O: O406 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|204:    O: O408 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-205:    O: O409 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\-206:    O: O412 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\207:    O: O414 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-208:    O: O416 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\209:    O: O417 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|210:    O: O420 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/211:    O: O422 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|212:    O: O423 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/213:    O: O426 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-214:    O: O428 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
/|\215:    O: O430 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/216:    O: O431 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-217:    O: O434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\218:    O: O435 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|219:    O: O438 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/220:    O: O440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\221:    O: O442 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-222:    O: O444 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\223:    O: O446 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/224:    O: O447 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|225:    O: O450 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-226:    O: O452 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\227:    O: O454 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|228:    O: O456 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-229:    O: O457 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\230:    O: O460 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\231:    O: O461 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-232:    O: O463 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\233:    O: O466 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|234:    O: O468 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/|235:    O: O469 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/236:    O: O472 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/237:    O: O474 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-238:    O: O475 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/|\-239:    O: O477 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\240:    O: O479 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/241:    O: O482 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|242:    O: O483 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/243:    O: O485 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\244:    O: O488 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\245:    O: O490 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|246:    O: O491 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/|247:    O: O494 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-248:    O: O496 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|249:    O: O497 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/250:    O: O500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-/251:    O: O502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|252:    O: O504 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/253:    O: O506 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-254:    O: O508 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\-255:    O: O509 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\256:    O: O512 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|257:    O: O513 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/|258:    O: O516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/259:    O: O517 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-260:    O: O520 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\261:    O: O522 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-262:    O: O523 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|263:    O: O526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/264:    O: O527 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-/265:    O: O530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-266:    O: O531 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/267:    O: O534 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|268:    O: O535 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/|269:    O: O538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/270:    O: O540 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\271:    O: O542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-272:    O: O543 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\273:    O: O546 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|274:    O: O547 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/275:    O: O550 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\276:    O: O552 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|277:    O: O554 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/278:    O: O555 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-279:    O: O557 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\280:    O: O560 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|281:    O: O561 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\282:    O: O564 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
-/|283:    O: O565 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/284:    O: O568 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\285:    O: O570 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|286:    O: O572 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/287:    O: O574 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\288:    O: O576 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|289:    O: O577 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\290:    O: O579 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/291:    O: O582 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|292:    O: O584 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/293:    O: O585 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-294:    O: O588 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|295:    O: O590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-296:    O: O592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|297:    O: O594 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/298:    O: O596 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-/299:    O: O597 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-300:    O: O599 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\-/|301:    O: O602 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\302:    O: O604 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-303:    O: O606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\304:    O: O608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|305:    O: O609 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/|306:    O: O612 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/307:    O: O613 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-308:    O: O616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\309:    O: O618 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|310:    O: O620 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-311:    O: O621 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/312:    O: O624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\313:    O: O626 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|314:    O: O627 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-315:    O: O630 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\316:    O: O632 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|317:    O: O634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/318:    O: O636 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-319:    O: O638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|320:    O: O639 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/|321:    O: O641 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\322:    O: O643 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-323:    O: O646 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\324:    O: O647 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|325:    O: O650 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/326:    O: O652 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/327:    O: O653 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\-328:    O: O656 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\329:    O: O658 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/330:    O: O659 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\331:    O: O662 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-332:    O: O664 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\-333:    O: O666 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\334:    O: O668 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|335:    O: O669 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/336:    O: O671 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-/337:    O: O674 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-338:    O: O676 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\339:    O: O678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/340:    O: O680 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/341:    O: O682 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|342:    O: O684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|343:    O: O686 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/344:    O: O688 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-/345:    O: O690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-346:    O: O691 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|347:    O: O694 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-348:    O: O696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\-sleeping...
/349:    O: O698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-/350:    O: O699 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\351:    O: O702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-352:    O: O703 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|353:    O: O705 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-354:    O: O707 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\355:    O: O710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|\356:    O: O712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|\sleeping...
-357:    O: O714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-358:    O: O716 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\359:    O: O718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\360:    O: O720 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|361:    O: O722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\362:    O: O724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|363:    O: O726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-364:    O: O728 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\365:    O: O729 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|366:    O: O732 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/367:    O: O734 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-/368:    O: O736 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-369:    O: O738 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\370:    O: O740 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|371:    O: O742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\372:    O: O744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/373:    O: O746 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-374:    O: O747 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\375:    O: O750 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|376:    O: O751 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-377:    O: O753 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\378:    O: O756 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\379:    O: O758 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|380:    O: O760 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-381:    O: O761 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/382:    O: O764 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-383:    O: O766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\-384:    O: O767 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\385:    O: O769 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|\386:    O: O772 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/387:    O: O773 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-388:    O: O776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\389:    O: O778 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|390:    O: O780 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|391:    O: O782 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\392:    O: O784 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|393:    O: O786 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/|394:    O: O787 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/395:    O: O790 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-396:    O: O791 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\397:    O: O794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|398:    O: O795 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/399:    O: O797 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\400:    O: O800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|\401:    O: O802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-402:    O: O804 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\403:    O: O806 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|404:    O: O808 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/405:    O: O809 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-406:    O: O812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/407:    O: O814 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-408:    O: O816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\409:    O: O817 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/410:    O: O819 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-411:    O: O822 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/412:    O: O823 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-413:    O: O825 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\414:    O: O828 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|415:    O: O830 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/416:    O: O831 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-417:    O: O833 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\418:    O: O835 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|419:    O: O838 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-420:    O: O839 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\421:    O: O841 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-422:    O: O844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\423:    O: O846 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/|\424:    O: O848 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\425:    O: O850 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/426:    O: O852 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-427:    O: O854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\428:    O: O856 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\429:    O: O858 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/430:    O: O860 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\431:    O: O862 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-432:    O: O864 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\-433:    O: O866 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\434:    O: O868 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\435:    O: O869 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|436:    O: O872 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-437:    O: O873 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\438:    O: O876 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\439:    O: O878 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|\440:    O: O879 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|441:    O: O882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\442:    O: O884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/443:    O: O886 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\444:    O: O888 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-445:    O: O890 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\446:    O: O892 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/447:    O: O894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-448:    O: O896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\449:    O: O897 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|\450:    O: O899 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|451:    O: O902 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\452:    O: O903 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/453:    O: O906 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-454:    O: O907 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\-455:    O: O910 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|456:    O: O912 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-457:    O: O913 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|458:    O: O915 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/459:    O: O917 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\460:    O: O920 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/461:    O: O921 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|462:    O: O924 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/463:    O: O925 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-464:    O: O927 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|465:    O: O930 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-466:    O: O931 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\467:    O: O933 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|468:    O: O936 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-469:    O: O938 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|470:    O: O940 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-471:    O: O941 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/472:    O: O944 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\473:    O: O946 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|474:    O: O948 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/|475:    O: O949 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/476:    O: O951 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\477:    O: O954 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|478:    O: O956 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/479:    O: O958 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-480:    O: O960 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\481:    O: O962 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-482:    O: O963 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\-sleeping...
/483:    O: O966 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-484:    O: O968 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|485:    O: O969 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/486:    O: O971 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-487:    O: O973 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\488:    O: O975 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|489:    O: O977 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-490:    O: O980 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|491:    O: O982 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\492:    O: O984 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/493:    O: O986 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-494:    O: O988 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\495:    O: O990 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|496:    O: O992 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|497:    O: O994 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-498:    O: O996 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\499:    O: O998 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|500:    O: O1000 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/|\-501:    O: O1002 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/502:    O: O1004 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-503:    O: O1005 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\504:    O: O1007 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-505:    O: O1009 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\506:    O: O1012 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|507:    O: O1014 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-508:    O: O1015 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|509:    O: O1018 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/510:    O: O1020 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-511:    O: O1022 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/512:    O: O1024 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-513:    O: O1026 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\514:    O: O1027 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/515:    O: O1030 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-/516:    O: O1032 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\517:    O: O1033 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|518:    O: O1036 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-519:    O: O1038 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\520:    O: O1040 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/521:    O: O1042 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|522:    O: O1044 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/523:    O: O1046 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-/524:    O: O1048 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-525:    O: O1050 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-526:    O: O1052 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\527:    O: O1054 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/528:    O: O1056 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-/529:    O: O1057 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-530:    O: O1060 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\-531:    O: O1062 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/532:    O: O1063 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|533:    O: O1066 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/534:    O: O1068 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\535:    O: O1070 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|536:    O: O1072 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-537:    O: O1074 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|538:    O: O1076 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/539:    O: O1078 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-540:    O: O1080 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\541:    O: O1082 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-542:    O: O1084 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-543:    O: O1086 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-544:    O: O1088 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|545:    O: O1090 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/546:    O: O1091 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-547:    O: O1094 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\548:    O: O1096 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|549:    O: O1098 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/550:    O: O1100 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-551:    O: O1102 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/552:    O: O1104 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\553:    O: O1105 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|554:    O: O1108 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|555:    O: O1110 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/556:    O: O1112 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-557:    O: O1113 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\558:    O: O1115 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/559:    O: O1117 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-/560:    O: O1120 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-561:    O: O1122 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/562:    O: O1124 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-563:    O: O1126 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\564:    O: O1127 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|565:    O: O1129 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-566:    O: O1132 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\567:    O: O1133 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/568:    O: O1136 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\569:    O: O1138 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|570:    O: O1140 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/571:    O: O1142 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|572:    O: O1144 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/573:    O: O1145 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-574:    O: O1147 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\575:    O: O1150 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|576:    O: O1152 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/|577:    O: O1153 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/|578:    O: O1155 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/579:    O: O1157 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\580:    O: O1160 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/581:    O: O1162 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|582:    O: O1164 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-583:    O: O1165 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\584:    O: O1168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/585:    O: O1170 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-586:    O: O1172 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|587:    O: O1173 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/588:    O: O1175 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-589:    O: O1178 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\590:    O: O1179 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|591:    O: O1182 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\592:    O: O1184 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/593:    O: O1186 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-594:    O: O1188 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\-595:    O: O1189 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\596:    O: O1191 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|597:    O: O1194 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-598:    O: O1196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\599:    O: O1198 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|600:    O: O1199 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/601:    O: O1201 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|602:    O: O1204 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/603:    O: O1205 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\604:    O: O1208 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|605:    O: O1210 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-606:    O: O1212 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\-607:    O: O1213 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\608:    O: O1215 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/609:    O: O1217 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\610:    O: O1219 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|\611:    O: O1221 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-612:    O: O1224 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-613:    O: O1226 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\614:    O: O1228 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|615:    O: O1230 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-616:    O: O1231 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\-617:    O: O1233 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\618:    O: O1236 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/619:    O: O1238 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-620:    O: O1240 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\621:    O: O1242 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-622:    O: O1243 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|623:    O: O1245 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/624:    O: O1247 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-/625:    O: O1250 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\626:    O: O1251 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|627:    O: O1254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/|628:    O: O1255 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/629:    O: O1258 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-630:    O: O1259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\631:    O: O1261 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-632:    O: O1264 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\633:    O: O1266 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|634:    O: O1268 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/635:    O: O1270 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-636:    O: O1272 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|637:    O: O1273 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/638:    O: O1276 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-/639:    O: O1277 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-640:    O: O1280 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|641:    O: O1281 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\642:    O: O1284 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|643:    O: O1285 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-644:    O: O1287 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\645:    O: O1289 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|\646:    O: O1292 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/647:    O: O1294 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/648:    O: O1296 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\649:    O: O1298 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|\650:    O: O1299 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/651:    O: O1301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|652:    O: O1303 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-653:    O: O1306 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|654:    O: O1308 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-655:    O: O1309 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\656:    O: O1311 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/657:    O: O1313 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-658:    O: O1315 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\659:    O: O1318 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|660:    O: O1319 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/661:    O: O1322 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|662:    O: O1324 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/663:    O: O1326 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\664:    O: O1328 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-665:    O: O1330 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|666:    O: O1332 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/667:    O: O1334 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-668:    O: O1336 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|669:    O: O1338 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|670:    O: O1340 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/671:    O: O1341 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|672:    O: O1344 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-673:    O: O1345 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\674:    O: O1347 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|675:    O: O1350 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/676:    O: O1352 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-677:    O: O1354 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\678:    O: O1356 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/679:    O: O1358 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-/680:    O: O1359 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\681:    O: O1361 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-682:    O: O1363 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\-683:    O: O1366 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\684:    O: O1368 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|685:    O: O1370 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/686:    O: O1371 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-687:    O: O1373 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|688:    O: O1375 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-689:    O: O1377 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\690:    O: O1379 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|691:    O: O1381 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\692:    O: O1384 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/693:    O: O1386 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-694:    O: O1388 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\-695:    O: O1389 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\696:    O: O1392 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|697:    O: O1393 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-698:    O: O1395 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\699:    O: O1398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\700:    O: O1400 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|701:    O: O1402 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\702:    O: O1403 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/703:    O: O1405 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\704:    O: O1407 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|705:    O: O1409 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/706:    O: O1412 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\707:    O: O1414 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|708:    O: O1416 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/709:    O: O1418 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-710:    O: O1419 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\-711:    O: O1422 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/712:    O: O1424 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/713:    O: O1426 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|714:    O: O1428 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/|715:    O: O1429 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/716:    O: O1431 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-717:    O: O1434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|718:    O: O1436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/719:    O: O1438 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|720:    O: O1440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/721:    O: O1442 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|722:    O: O1443 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-723:    O: O1445 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|724:    O: O1447 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/725:    O: O1450 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\726:    O: O1451 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|\727:    O: O1454 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|728:    O: O1456 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-729:    O: O1458 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|730:    O: O1460 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\731:    O: O1462 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-732:    O: O1464 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|733:    O: O1466 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/734:    O: O1468 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\735:    O: O1470 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|736:    O: O1472 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/737:    O: O1474 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\738:    O: O1475 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/739:    O: O1478 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-/sleeping...
|740:    O: O1479 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-741:    O: O1482 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/742:    O: O1484 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\743:    O: O1485 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|744:    O: O1487 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/|745:    O: O1489 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/746:    O: O1492 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\747:    O: O1493 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|\748:    O: O1496 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/749:    O: O1498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\750:    O: O1500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|\751:    O: O1502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-752:    O: O1504 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\753:    O: O1506 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|754:    O: O1507 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/755:    O: O1510 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-/756:    O: O1512 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\757:    O: O1514 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|758:    O: O1516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/759:    O: O1518 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\760:    O: O1520 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-761:    O: O1521 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/762:    O: O1523 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-763:    O: O1526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-764:    O: O1528 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\765:    O: O1530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|766:    O: O1532 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-767:    O: O1534 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\768:    O: O1536 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/769:    O: O1538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\770:    O: O1539 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|771:    O: O1541 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\772:    O: O1543 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/773:    O: O1546 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-774:    O: O1548 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|775:    O: O1550 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/776:    O: O1551 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\777:    O: O1554 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\778:    O: O1556 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|779:    O: O1558 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/780:    O: O1559 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\781:    O: O1562 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-782:    O: O1564 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\783:    O: O1565 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|784:    O: O1568 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/785:    O: O1569 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-786:    O: O1572 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|787:    O: O1574 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-788:    O: O1576 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|789:    O: O1578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-790:    O: O1580 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\-sleeping...
/791:    O: O1581 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|792:    O: O1584 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/793:    O: O1586 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-794:    O: O1588 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-795:    O: O1590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\796:    O: O1592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|797:    O: O1593 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/798:    O: O1595 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-799:    O: O1597 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\800:    O: O1600 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|801:    O: O1601 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\802:    O: O1604 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|803:    O: O1605 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/804:    O: O1608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\805:    O: O1609 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|806:    O: O1612 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/807:    O: O1614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\808:    O: O1616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|809:    O: O1618 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/810:    O: O1620 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-811:    O: O1621 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/812:    O: O1624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-/813:    O: O1626 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-814:    O: O1628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\815:    O: O1630 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|816:    O: O1631 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/|817:    O: O1634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/818:    O: O1635 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-819:    O: O1638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\-820:    O: O1639 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\821:    O: O1641 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-822:    O: O1644 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|823:    O: O1645 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/|824:    O: O1648 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/825:    O: O1650 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\826:    O: O1652 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\827:    O: O1654 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-828:    O: O1656 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|829:    O: O1658 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/|830:    O: O1660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/831:    O: O1662 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|832:    O: O1664 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/|833:    O: O1666 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/834:    O: O1668 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-835:    O: O1670 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|836:    O: O1672 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/837:    O: O1674 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-838:    O: O1675 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|839:    O: O1677 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-840:    O: O1679 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\841:    O: O1681 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-842:    O: O1683 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\-843:    O: O1686 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|844:    O: O1688 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/845:    O: O1690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\846:    O: O1692 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/847:    O: O1694 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\848:    O: O1696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|849:    O: O1698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/850:    O: O1700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|851:    O: O1702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\852:    O: O1704 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|\853:    O: O1705 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|854:    O: O1707 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/855:    O: O1710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\856:    O: O1712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/857:    O: O1713 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-858:    O: O1716 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|859:    O: O1718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-860:    O: O1719 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\861:    O: O1722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-862:    O: O1724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|863:    O: O1725 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/|864:    O: O1727 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/865:    O: O1730 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-/866:    O: O1732 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\867:    O: O1734 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|868:    O: O1736 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-869:    O: O1738 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|870:    O: O1739 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/871:    O: O1742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|872:    O: O1744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-873:    O: O1746 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|874:    O: O1748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/875:    O: O1750 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-876:    O: O1752 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|877:    O: O1754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-878:    O: O1755 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\879:    O: O1758 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|880:    O: O1760 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-881:    O: O1762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/882:    O: O1763 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-883:    O: O1766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|884:    O: O1768 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/885:    O: O1770 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-/886:    O: O1771 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-887:    O: O1774 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\-888:    O: O1775 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\889:    O: O1778 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|890:    O: O1779 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-891:    O: O1782 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/892:    O: O1783 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\893:    O: O1786 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/894:    O: O1788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\895:    O: O1790 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/896:    O: O1792 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\897:    O: O1794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/898:    O: O1796 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\899:    O: O1798 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|900:    O: O1800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-901:    O: O1801 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/902:    O: O1804 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-903:    O: O1806 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\904:    O: O1807 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|905:    O: O1810 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/|906:    O: O1812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-907:    O: O1814 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\908:    O: O1816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/909:    O: O1818 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\910:    O: O1819 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|911:    O: O1821 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\912:    O: O1823 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|\913:    O: O1826 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-914:    O: O1827 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|915:    O: O1830 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/916:    O: O1832 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\917:    O: O1833 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/918:    O: O1836 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-/919:    O: O1838 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-920:    O: O1840 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\921:    O: O1842 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-922:    O: O1844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\923:    O: O1846 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/924:    O: O1848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\925:    O: O1849 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-926:    O: O1852 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\927:    O: O1854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|928:    O: O1856 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/929:    O: O1857 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\930:    O: O1860 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/931:    O: O1861 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|932:    O: O1863 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-933:    O: O1866 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\934:    O: O1868 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|935:    O: O1869 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/936:    O: O1871 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\937:    O: O1873 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|938:    O: O1875 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/939:    O: O1878 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-940:    O: O1880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-sleeping...
/941:    O: O1882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|942:    O: O1884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/943:    O: O1885 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-944:    O: O1887 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\-945:    O: O1890 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\946:    O: O1892 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|947:    O: O1893 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-948:    O: O1896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|949:    O: O1897 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/950:    O: O1899 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-/|\-/|--- Input Phase --- 
=>WM: (13313: I2 ^dir U)
=>WM: (13312: I2 ^reward 1)
=>WM: (13311: I2 ^see 1)
=>WM: (13310: N950 ^status complete)
<=WM: (13298: I2 ^dir R)
<=WM: (13297: I2 ^reward 1)
<=WM: (13296: I2 ^see 1)
=>WM: (13314: I2 ^level-1 R1-root)
<=WM: (13299: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Firing propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Firing propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1900 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1899 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1900 ^name predict-no +)
 (S1 ^operator O1900 +)
Retracting propose*predict-yes
 -->
 (O1899 ^name predict-yes +)
 (S1 ^operator O1899 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R953 ^value 1 +)
 (R1 ^reward R953 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1900 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 0.4476192676183378)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1899 = 0.8155729125006117)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.1844075378173239)
=>WM: (13321: S1 ^operator O1902 +)
=>WM: (13320: S1 ^operator O1901 +)
=>WM: (13319: I3 ^dir U)
=>WM: (13318: O1902 ^name predict-no)
=>WM: (13317: O1901 ^name predict-yes)
=>WM: (13316: R954 ^value 1)
=>WM: (13315: R1 ^reward R954)
<=WM: (13306: S1 ^operator O1899 +)
<=WM: (13308: S1 ^operator O1899)
<=WM: (13307: S1 ^operator O1900 +)
<=WM: (13305: I3 ^dir R)
<=WM: (13301: R1 ^reward R953)
<=WM: (13304: O1900 ^name predict-no)
<=WM: (13303: O1899 ^name predict-yes)
<=WM: (13302: R953 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1901 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1902 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1900 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1899 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675409 -0.491002 0.184408 -> 0.675413 -0.491002 0.18441(R,m,v=1,0.89441,0.0950311)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324566 0.491007 0.815573 -> 0.324569 0.491006 0.815576(R,m,v=1,1,0)
=>WM: (13322: S1 ^operator O1902)

   951:    O: O1902 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N951 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N950 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13323: I3 ^predict-no N951)
<=WM: (13310: N950 ^status complete)
<=WM: (13309: I3 ^predict-yes N950)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (13327: I2 ^dir R)
=>WM: (13326: I2 ^reward 1)
=>WM: (13325: I2 ^see 0)
=>WM: (13324: N951 ^status complete)
<=WM: (13313: I2 ^dir U)
<=WM: (13312: I2 ^reward 1)
<=WM: (13311: I2 ^see 1)
=>WM: (13328: I2 ^level-1 R1-root)
<=WM: (13314: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1901 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1902 = 0.5523833737960075)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Firing propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Firing propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.4476192676183378)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.1844104702696336)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Retracting propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1902 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1901 = 0.)
=>WM: (13336: S1 ^operator O1904 +)
=>WM: (13335: S1 ^operator O1903 +)
=>WM: (13334: I3 ^dir R)
=>WM: (13333: O1904 ^name predict-no)
=>WM: (13332: O1903 ^name predict-yes)
=>WM: (13331: R955 ^value 1)
=>WM: (13330: R1 ^reward R955)
=>WM: (13329: I3 ^see 0)
<=WM: (13320: S1 ^operator O1901 +)
<=WM: (13321: S1 ^operator O1902 +)
<=WM: (13322: S1 ^operator O1902)
<=WM: (13319: I3 ^dir U)
<=WM: (13315: R1 ^reward R954)
<=WM: (13300: I3 ^see 1)
<=WM: (13318: O1902 ^name predict-no)
<=WM: (13317: O1901 ^name predict-yes)
<=WM: (13316: R954 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1903 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1903 = 0.1844104702696336)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1904 = 0.5523833737960075)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1904 = 0.4476192676183378)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.4476192676183378)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1902 = 0.5523833737960075)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.1844104702696336)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1901 = 0.1398795999120246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13337: S1 ^operator O1904)

   952:    O: O1904 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N952 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N951 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13338: I3 ^predict-no N952)
<=WM: (13324: N951 ^status complete)
<=WM: (13323: I3 ^predict-no N951)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13342: I2 ^dir U)
=>WM: (13341: I2 ^reward 1)
=>WM: (13340: I2 ^see 0)
=>WM: (13339: N952 ^status complete)
<=WM: (13327: I2 ^dir R)
<=WM: (13326: I2 ^reward 1)
<=WM: (13325: I2 ^see 0)
=>WM: (13343: I2 ^level-1 R0-root)
<=WM: (13328: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Firing propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Firing propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Retracting propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1904 = 0.4476192676183378)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1904 = 0.5523833737960075)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1903 = 0.1844104702696336)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1903 = 0.1398795999120246)
=>WM: (13350: S1 ^operator O1906 +)
=>WM: (13349: S1 ^operator O1905 +)
=>WM: (13348: I3 ^dir U)
=>WM: (13347: O1906 ^name predict-no)
=>WM: (13346: O1905 ^name predict-yes)
=>WM: (13345: R956 ^value 1)
=>WM: (13344: R1 ^reward R956)
<=WM: (13335: S1 ^operator O1903 +)
<=WM: (13336: S1 ^operator O1904 +)
<=WM: (13337: S1 ^operator O1904)
<=WM: (13334: I3 ^dir R)
<=WM: (13330: R1 ^reward R955)
<=WM: (13333: O1904 ^name predict-no)
<=WM: (13332: O1903 ^name predict-yes)
<=WM: (13331: R955 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622532 -0.174914 0.447619(R,m,v=1,0.925,0.069958)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377469 0.174914 0.552383(R,m,v=1,1,0)
=>WM: (13351: S1 ^operator O1906)

   953:    O: O1906 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N953 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N952 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13352: I3 ^predict-no N953)
<=WM: (13339: N952 ^status complete)
<=WM: (13338: I3 ^predict-no N952)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13356: I2 ^dir L)
=>WM: (13355: I2 ^reward 1)
=>WM: (13354: I2 ^see 0)
=>WM: (13353: N953 ^status complete)
<=WM: (13342: I2 ^dir U)
<=WM: (13341: I2 ^reward 1)
<=WM: (13340: I2 ^see 0)
=>WM: (13357: I2 ^level-1 R0-root)
<=WM: (13343: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1905 = 0.6104621686166466)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1906 = 0.1063475139796038)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Firing propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Firing propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1906 = 0.3873365065796835)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1905 = 0.3895397770301633)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Retracting propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.)
=>WM: (13364: S1 ^operator O1908 +)
=>WM: (13363: S1 ^operator O1907 +)
=>WM: (13362: I3 ^dir L)
=>WM: (13361: O1908 ^name predict-no)
=>WM: (13360: O1907 ^name predict-yes)
=>WM: (13359: R957 ^value 1)
=>WM: (13358: R1 ^reward R957)
<=WM: (13349: S1 ^operator O1905 +)
<=WM: (13350: S1 ^operator O1906 +)
<=WM: (13351: S1 ^operator O1906)
<=WM: (13348: I3 ^dir U)
<=WM: (13344: R1 ^reward R956)
<=WM: (13347: O1906 ^name predict-no)
<=WM: (13346: O1905 ^name predict-yes)
<=WM: (13345: R956 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1907 = 0.6104621686166466)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1907 = 0.3895397770301633)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1908 = 0.1063475139796038)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1908 = 0.3873365065796835)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1906 = 0.3873365065796835)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1906 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1905 = 0.3895397770301633)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1905 = 0.6104621686166466)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13365: S1 ^operator O1907)

   954:    O: O1907 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N954 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N953 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13366: I3 ^predict-yes N954)
<=WM: (13353: N953 ^status complete)
<=WM: (13352: I3 ^predict-no N953)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13370: I2 ^dir U)
=>WM: (13369: I2 ^reward 1)
=>WM: (13368: I2 ^see 1)
=>WM: (13367: N954 ^status complete)
<=WM: (13356: I2 ^dir L)
<=WM: (13355: I2 ^reward 1)
<=WM: (13354: I2 ^see 0)
=>WM: (13371: I2 ^level-1 L1-root)
<=WM: (13357: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Firing propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Firing propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1908 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1907 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Retracting propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1908 = 0.3873365065796835)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1908 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1907 = 0.3895397770301633)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1907 = 0.6104621686166466)
=>WM: (13379: S1 ^operator O1910 +)
=>WM: (13378: S1 ^operator O1909 +)
=>WM: (13377: I3 ^dir U)
=>WM: (13376: O1910 ^name predict-no)
=>WM: (13375: O1909 ^name predict-yes)
=>WM: (13374: R958 ^value 1)
=>WM: (13373: R1 ^reward R958)
=>WM: (13372: I3 ^see 1)
<=WM: (13363: S1 ^operator O1907 +)
<=WM: (13365: S1 ^operator O1907)
<=WM: (13364: S1 ^operator O1908 +)
<=WM: (13362: I3 ^dir L)
<=WM: (13358: R1 ^reward R957)
<=WM: (13329: I3 ^see 0)
<=WM: (13361: O1908 ^name predict-no)
<=WM: (13360: O1907 ^name predict-yes)
<=WM: (13359: R957 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1909 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1910 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1908 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1907 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.886792,0.101027)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610462 -> 0.288049 0.322413 0.610462(R,m,v=1,1,0)
=>WM: (13380: S1 ^operator O1910)

   955:    O: O1910 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N955 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N954 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13381: I3 ^predict-no N955)
<=WM: (13367: N954 ^status complete)
<=WM: (13366: I3 ^predict-yes N954)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13385: I2 ^dir U)
=>WM: (13384: I2 ^reward 1)
=>WM: (13383: I2 ^see 0)
=>WM: (13382: N955 ^status complete)
<=WM: (13370: I2 ^dir U)
<=WM: (13369: I2 ^reward 1)
<=WM: (13368: I2 ^see 1)
=>WM: (13386: I2 ^level-1 L1-root)
<=WM: (13371: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Firing propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Firing propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1910 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1909 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Retracting propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1910 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1909 = 0.)
=>WM: (13393: S1 ^operator O1912 +)
=>WM: (13392: S1 ^operator O1911 +)
=>WM: (13391: O1912 ^name predict-no)
=>WM: (13390: O1911 ^name predict-yes)
=>WM: (13389: R959 ^value 1)
=>WM: (13388: R1 ^reward R959)
=>WM: (13387: I3 ^see 0)
<=WM: (13378: S1 ^operator O1909 +)
<=WM: (13379: S1 ^operator O1910 +)
<=WM: (13380: S1 ^operator O1910)
<=WM: (13373: R1 ^reward R958)
<=WM: (13372: I3 ^see 1)
<=WM: (13376: O1910 ^name predict-no)
<=WM: (13375: O1909 ^name predict-yes)
<=WM: (13374: R958 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1911 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1912 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1910 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1909 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13394: S1 ^operator O1912)

   956:    O: O1912 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N956 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N955 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13395: I3 ^predict-no N956)
<=WM: (13382: N955 ^status complete)
<=WM: (13381: I3 ^predict-no N955)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13399: I2 ^dir L)
=>WM: (13398: I2 ^reward 1)
=>WM: (13397: I2 ^see 0)
=>WM: (13396: N956 ^status complete)
<=WM: (13385: I2 ^dir U)
<=WM: (13384: I2 ^reward 1)
<=WM: (13383: I2 ^see 0)
=>WM: (13400: I2 ^level-1 L1-root)
<=WM: (13386: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1912 = 0.6126622914849755)
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1911 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Firing propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Firing propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.3873365065796835)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3895394851831418)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Retracting propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1912 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1911 = 0.)
=>WM: (13407: S1 ^operator O1914 +)
=>WM: (13406: S1 ^operator O1913 +)
=>WM: (13405: I3 ^dir L)
=>WM: (13404: O1914 ^name predict-no)
=>WM: (13403: O1913 ^name predict-yes)
=>WM: (13402: R960 ^value 1)
=>WM: (13401: R1 ^reward R960)
<=WM: (13392: S1 ^operator O1911 +)
<=WM: (13393: S1 ^operator O1912 +)
<=WM: (13394: S1 ^operator O1912)
<=WM: (13377: I3 ^dir U)
<=WM: (13388: R1 ^reward R959)
<=WM: (13391: O1912 ^name predict-no)
<=WM: (13390: O1911 ^name predict-yes)
<=WM: (13389: R959 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1913 = -0.02274740735326741)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3895394851831418)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1914 = 0.6126622914849755)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.3873365065796835)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 0.3873365065796835)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1912 = 0.6126622914849755)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.3895394851831418)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1911 = -0.02274740735326741)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13408: S1 ^operator O1914)

   957:    O: O1914 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N957 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N956 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13409: I3 ^predict-no N957)
<=WM: (13396: N956 ^status complete)
<=WM: (13395: I3 ^predict-no N956)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13413: I2 ^dir U)
=>WM: (13412: I2 ^reward 1)
=>WM: (13411: I2 ^see 0)
=>WM: (13410: N957 ^status complete)
<=WM: (13399: I2 ^dir L)
<=WM: (13398: I2 ^reward 1)
<=WM: (13397: I2 ^see 0)
=>WM: (13414: I2 ^level-1 L0-root)
<=WM: (13400: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Firing propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Firing propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1914 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1913 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Retracting propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1914 = 0.3873365065796835)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1914 = 0.6126622914849755)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1913 = 0.3895394851831418)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1913 = -0.02274740735326741)
=>WM: (13421: S1 ^operator O1916 +)
=>WM: (13420: S1 ^operator O1915 +)
=>WM: (13419: I3 ^dir U)
=>WM: (13418: O1916 ^name predict-no)
=>WM: (13417: O1915 ^name predict-yes)
=>WM: (13416: R961 ^value 1)
=>WM: (13415: R1 ^reward R961)
<=WM: (13406: S1 ^operator O1913 +)
<=WM: (13407: S1 ^operator O1914 +)
<=WM: (13408: S1 ^operator O1914)
<=WM: (13405: I3 ^dir L)
<=WM: (13401: R1 ^reward R960)
<=WM: (13404: O1914 ^name predict-no)
<=WM: (13403: O1913 ^name predict-yes)
<=WM: (13402: R960 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1914 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1913 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.71908 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.930233,0.0652795)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280918 0.331744 0.612662 -> 0.280918 0.331744 0.612662(R,m,v=1,1,0)
=>WM: (13422: S1 ^operator O1916)

   958:    O: O1916 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N958 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N957 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13423: I3 ^predict-no N958)
<=WM: (13410: N957 ^status complete)
<=WM: (13409: I3 ^predict-no N957)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13427: I2 ^dir R)
=>WM: (13426: I2 ^reward 1)
=>WM: (13425: I2 ^see 0)
=>WM: (13424: N958 ^status complete)
<=WM: (13413: I2 ^dir U)
<=WM: (13412: I2 ^reward 1)
<=WM: (13411: I2 ^see 0)
=>WM: (13428: I2 ^level-1 L0-root)
<=WM: (13414: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1915 = 0.8155985324859676)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1916 = -0.00558448899823713)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Firing propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Firing propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1916 = 0.4476188714061859)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1915 = 0.1844104702696336)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Retracting propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.)
=>WM: (13435: S1 ^operator O1918 +)
=>WM: (13434: S1 ^operator O1917 +)
=>WM: (13433: I3 ^dir R)
=>WM: (13432: O1918 ^name predict-no)
=>WM: (13431: O1917 ^name predict-yes)
=>WM: (13430: R962 ^value 1)
=>WM: (13429: R1 ^reward R962)
<=WM: (13420: S1 ^operator O1915 +)
<=WM: (13421: S1 ^operator O1916 +)
<=WM: (13422: S1 ^operator O1916)
<=WM: (13419: I3 ^dir U)
<=WM: (13415: R1 ^reward R961)
<=WM: (13418: O1916 ^name predict-no)
<=WM: (13417: O1915 ^name predict-yes)
<=WM: (13416: R961 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1917 = 0.8155985324859676)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.1844104702696336)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1918 = -0.00558448899823713)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 0.4476188714061859)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1916 = 0.4476188714061859)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1916 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1915 = 0.1844104702696336)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1915 = 0.8155985324859676)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13436: S1 ^operator O1917)

   959:    O: O1917 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N959 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N958 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13437: I3 ^predict-yes N959)
<=WM: (13424: N958 ^status complete)
<=WM: (13423: I3 ^predict-no N958)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13441: I2 ^dir L)
=>WM: (13440: I2 ^reward 1)
=>WM: (13439: I2 ^see 1)
=>WM: (13438: N959 ^status complete)
<=WM: (13427: I2 ^dir R)
<=WM: (13426: I2 ^reward 1)
<=WM: (13425: I2 ^see 0)
=>WM: (13442: I2 ^level-1 R1-root)
<=WM: (13428: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1917 = 0.6104587229728515)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1918 = 0.2714993082286609)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Firing propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Firing propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1918 = 0.3873366868699847)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1917 = 0.3895394851831418)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Retracting propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 0.4476188714061859)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1918 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.1844104702696336)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1917 = 0.8155985324859676)
=>WM: (13450: S1 ^operator O1920 +)
=>WM: (13449: S1 ^operator O1919 +)
=>WM: (13448: I3 ^dir L)
=>WM: (13447: O1920 ^name predict-no)
=>WM: (13446: O1919 ^name predict-yes)
=>WM: (13445: R963 ^value 1)
=>WM: (13444: R1 ^reward R963)
=>WM: (13443: I3 ^see 1)
<=WM: (13434: S1 ^operator O1917 +)
<=WM: (13436: S1 ^operator O1917)
<=WM: (13435: S1 ^operator O1918 +)
<=WM: (13433: I3 ^dir R)
<=WM: (13429: R1 ^reward R962)
<=WM: (13387: I3 ^see 0)
<=WM: (13432: O1918 ^name predict-no)
<=WM: (13431: O1917 ^name predict-yes)
<=WM: (13430: R962 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1919 = 0.3895394851831418)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1919 = 0.6104587229728515)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1920 = 0.3873366868699847)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1920 = 0.2714993082286609)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1918 = 0.3873366868699847)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1918 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1917 = 0.3895394851831418)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1917 = 0.6104587229728515)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675413 -0.491002 0.18441 -> 0.675411 -0.491002 0.184409(R,m,v=1,0.895062,0.0945096)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324599 0.491 0.815599 -> 0.324597 0.491 0.815597(R,m,v=1,1,0)
=>WM: (13451: S1 ^operator O1919)

   960:    O: O1919 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N960 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N959 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13452: I3 ^predict-yes N960)
<=WM: (13438: N959 ^status complete)
<=WM: (13437: I3 ^predict-yes N959)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13456: I2 ^dir L)
=>WM: (13455: I2 ^reward 1)
=>WM: (13454: I2 ^see 1)
=>WM: (13453: N960 ^status complete)
<=WM: (13441: I2 ^dir L)
<=WM: (13440: I2 ^reward 1)
<=WM: (13439: I2 ^see 1)
=>WM: (13457: I2 ^level-1 L1-root)
<=WM: (13442: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1920 = 0.6126624717752767)
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1919 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Firing propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Firing propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1920 = 0.3873366868699847)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1919 = 0.3895394851831418)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Retracting propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1920 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1920 = 0.3873366868699847)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1919 = 0.6104587229728515)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1919 = 0.3895394851831418)
=>WM: (13463: S1 ^operator O1922 +)
=>WM: (13462: S1 ^operator O1921 +)
=>WM: (13461: O1922 ^name predict-no)
=>WM: (13460: O1921 ^name predict-yes)
=>WM: (13459: R964 ^value 1)
=>WM: (13458: R1 ^reward R964)
<=WM: (13449: S1 ^operator O1919 +)
<=WM: (13451: S1 ^operator O1919)
<=WM: (13450: S1 ^operator O1920 +)
<=WM: (13444: R1 ^reward R963)
<=WM: (13447: O1920 ^name predict-no)
<=WM: (13446: O1919 ^name predict-yes)
<=WM: (13445: R963 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1921 = 0.3895394851831418)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1921 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1922 = 0.3873366868699847)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1922 = 0.6126624717752767)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1920 = 0.3873366868699847)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1920 = 0.6126624717752767)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1919 = 0.3895394851831418)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1919 = -0.02274740735326741)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.8875,0.100472)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.32241 0.610459 -> 0.288049 0.32241 0.610459(R,m,v=1,1,0)
=>WM: (13464: S1 ^operator O1922)

   961:    O: O1922 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N961 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N960 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13465: I3 ^predict-no N961)
<=WM: (13453: N960 ^status complete)
<=WM: (13452: I3 ^predict-yes N960)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13469: I2 ^dir U)
=>WM: (13468: I2 ^reward 1)
=>WM: (13467: I2 ^see 0)
=>WM: (13466: N961 ^status complete)
<=WM: (13456: I2 ^dir L)
<=WM: (13455: I2 ^reward 1)
<=WM: (13454: I2 ^see 1)
=>WM: (13470: I2 ^level-1 L0-root)
<=WM: (13457: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Firing propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Firing propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1922 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1921 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Retracting propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1922 = 0.6126624717752767)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1922 = 0.3873366868699847)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1921 = -0.02274740735326741)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1921 = 0.3895397539597428)
=>WM: (13478: S1 ^operator O1924 +)
=>WM: (13477: S1 ^operator O1923 +)
=>WM: (13476: I3 ^dir U)
=>WM: (13475: O1924 ^name predict-no)
=>WM: (13474: O1923 ^name predict-yes)
=>WM: (13473: R965 ^value 1)
=>WM: (13472: R1 ^reward R965)
=>WM: (13471: I3 ^see 0)
<=WM: (13462: S1 ^operator O1921 +)
<=WM: (13463: S1 ^operator O1922 +)
<=WM: (13464: S1 ^operator O1922)
<=WM: (13448: I3 ^dir L)
<=WM: (13458: R1 ^reward R964)
<=WM: (13443: I3 ^see 1)
<=WM: (13461: O1922 ^name predict-no)
<=WM: (13460: O1921 ^name predict-yes)
<=WM: (13459: R964 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1923 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1924 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1922 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1921 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.930636,0.0649281)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280918 0.331744 0.612662 -> 0.280918 0.331744 0.612663(R,m,v=1,1,0)
=>WM: (13479: S1 ^operator O1924)

   962:    O: O1924 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N962 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N961 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13480: I3 ^predict-no N962)
<=WM: (13466: N961 ^status complete)
<=WM: (13465: I3 ^predict-no N961)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\---- Input Phase --- 
=>WM: (13484: I2 ^dir R)
=>WM: (13483: I2 ^reward 1)
=>WM: (13482: I2 ^see 0)
=>WM: (13481: N962 ^status complete)
<=WM: (13469: I2 ^dir U)
<=WM: (13468: I2 ^reward 1)
<=WM: (13467: I2 ^see 0)
=>WM: (13485: I2 ^level-1 L0-root)
<=WM: (13470: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1923 = 0.8155971820726273)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1924 = -0.00558448899823713)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Firing propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Firing propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.4476188714061859)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.1844091198562935)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Retracting propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1924 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1923 = 0.)
=>WM: (13492: S1 ^operator O1926 +)
=>WM: (13491: S1 ^operator O1925 +)
=>WM: (13490: I3 ^dir R)
=>WM: (13489: O1926 ^name predict-no)
=>WM: (13488: O1925 ^name predict-yes)
=>WM: (13487: R966 ^value 1)
=>WM: (13486: R1 ^reward R966)
<=WM: (13477: S1 ^operator O1923 +)
<=WM: (13478: S1 ^operator O1924 +)
<=WM: (13479: S1 ^operator O1924)
<=WM: (13476: I3 ^dir U)
<=WM: (13472: R1 ^reward R965)
<=WM: (13475: O1924 ^name predict-no)
<=WM: (13474: O1923 ^name predict-yes)
<=WM: (13473: R965 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1925 = 0.8155971820726273)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.1844091198562935)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1926 = -0.00558448899823713)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 0.4476188714061859)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.4476188714061859)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1924 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.1844091198562935)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1923 = 0.8155971820726273)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13493: S1 ^operator O1925)

   963:    O: O1925 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N963 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N962 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13494: I3 ^predict-yes N963)
<=WM: (13481: N962 ^status complete)
<=WM: (13480: I3 ^predict-no N962)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13498: I2 ^dir R)
=>WM: (13497: I2 ^reward 1)
=>WM: (13496: I2 ^see 1)
=>WM: (13495: N963 ^status complete)
<=WM: (13484: I2 ^dir R)
<=WM: (13483: I2 ^reward 1)
<=WM: (13482: I2 ^see 0)
=>WM: (13499: I2 ^level-1 R1-root)
<=WM: (13485: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1925 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1926 = 0.5523829775838558)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Firing propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Firing propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 0.4476188714061859)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.1844091198562935)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Retracting propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 0.4476188714061859)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1926 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.1844091198562935)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1925 = 0.8155971820726273)
=>WM: (13506: S1 ^operator O1928 +)
=>WM: (13505: S1 ^operator O1927 +)
=>WM: (13504: O1928 ^name predict-no)
=>WM: (13503: O1927 ^name predict-yes)
=>WM: (13502: R967 ^value 1)
=>WM: (13501: R1 ^reward R967)
=>WM: (13500: I3 ^see 1)
<=WM: (13491: S1 ^operator O1925 +)
<=WM: (13493: S1 ^operator O1925)
<=WM: (13492: S1 ^operator O1926 +)
<=WM: (13486: R1 ^reward R966)
<=WM: (13471: I3 ^see 0)
<=WM: (13489: O1926 ^name predict-no)
<=WM: (13488: O1925 ^name predict-yes)
<=WM: (13487: R966 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1927 = 0.1844091198562935)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1927 = 0.1398795999120246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1928 = 0.4476188714061859)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1928 = 0.5523829775838558)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1926 = 0.4476188714061859)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1926 = 0.5523829775838558)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1925 = 0.1844091198562935)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1925 = 0.1398795999120246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675411 -0.491002 0.184409 -> 0.67541 -0.491002 0.184408(R,m,v=1,0.895706,0.0939938)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324597 0.491 0.815597 -> 0.324596 0.491 0.815596(R,m,v=1,1,0)
=>WM: (13507: S1 ^operator O1928)

   964:    O: O1928 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N964 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N963 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13508: I3 ^predict-no N964)
<=WM: (13495: N963 ^status complete)
<=WM: (13494: I3 ^predict-yes N963)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13512: I2 ^dir L)
=>WM: (13511: I2 ^reward 1)
=>WM: (13510: I2 ^see 0)
=>WM: (13509: N964 ^status complete)
<=WM: (13498: I2 ^dir R)
<=WM: (13497: I2 ^reward 1)
<=WM: (13496: I2 ^see 1)
=>WM: (13513: I2 ^level-1 R0-root)
<=WM: (13499: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1927 = 0.6104618767696252)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1928 = 0.1063475139796038)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Firing propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Firing propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 0.3873368130731955)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.3895397539597428)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Retracting propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1928 = 0.5523829775838558)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1928 = 0.4476188714061859)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1927 = 0.1398795999120246)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1927 = 0.1844081745669553)
=>WM: (13521: S1 ^operator O1930 +)
=>WM: (13520: S1 ^operator O1929 +)
=>WM: (13519: I3 ^dir L)
=>WM: (13518: O1930 ^name predict-no)
=>WM: (13517: O1929 ^name predict-yes)
=>WM: (13516: R968 ^value 1)
=>WM: (13515: R1 ^reward R968)
=>WM: (13514: I3 ^see 0)
<=WM: (13505: S1 ^operator O1927 +)
<=WM: (13506: S1 ^operator O1928 +)
<=WM: (13507: S1 ^operator O1928)
<=WM: (13490: I3 ^dir R)
<=WM: (13501: R1 ^reward R967)
<=WM: (13500: I3 ^see 1)
<=WM: (13504: O1928 ^name predict-no)
<=WM: (13503: O1927 ^name predict-yes)
<=WM: (13502: R967 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1929 = 0.3895397539597428)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1929 = 0.6104618767696252)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1930 = 0.3873368130731955)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1930 = 0.1063475139796038)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 0.3873368130731955)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1928 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.3895397539597428)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1927 = 0.6104618767696252)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447619 -> 0.622532 -0.174914 0.447619(R,m,v=1,0.92562,0.0694215)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377469 0.174914 0.552383(R,m,v=1,1,0)
=>WM: (13522: S1 ^operator O1929)

   965:    O: O1929 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N965 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N964 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13523: I3 ^predict-yes N965)
<=WM: (13509: N964 ^status complete)
<=WM: (13508: I3 ^predict-no N964)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13527: I2 ^dir L)
=>WM: (13526: I2 ^reward 1)
=>WM: (13525: I2 ^see 1)
=>WM: (13524: N965 ^status complete)
<=WM: (13512: I2 ^dir L)
<=WM: (13511: I2 ^reward 1)
<=WM: (13510: I2 ^see 0)
=>WM: (13528: I2 ^level-1 L1-root)
<=WM: (13513: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1930 = 0.6126625979784875)
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1929 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Firing propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Firing propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1930 = 0.3873368130731955)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1929 = 0.3895397539597428)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Retracting propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1930 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1930 = 0.3873368130731955)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1929 = 0.6104618767696252)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1929 = 0.3895397539597428)
=>WM: (13535: S1 ^operator O1932 +)
=>WM: (13534: S1 ^operator O1931 +)
=>WM: (13533: O1932 ^name predict-no)
=>WM: (13532: O1931 ^name predict-yes)
=>WM: (13531: R969 ^value 1)
=>WM: (13530: R1 ^reward R969)
=>WM: (13529: I3 ^see 1)
<=WM: (13520: S1 ^operator O1929 +)
<=WM: (13522: S1 ^operator O1929)
<=WM: (13521: S1 ^operator O1930 +)
<=WM: (13515: R1 ^reward R968)
<=WM: (13514: I3 ^see 0)
<=WM: (13518: O1930 ^name predict-no)
<=WM: (13517: O1929 ^name predict-yes)
<=WM: (13516: R968 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.3895397539597428)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1931 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 0.3873368130731955)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1932 = 0.6126625979784875)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1930 = 0.3873368130731955)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1930 = 0.6126625979784875)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1929 = 0.3895397539597428)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1929 = -0.02274740735326741)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.888199,0.0999224)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610462 -> 0.288049 0.322413 0.610462(R,m,v=1,1,0)
=>WM: (13536: S1 ^operator O1932)

   966:    O: O1932 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N966 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N965 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13537: I3 ^predict-no N966)
<=WM: (13524: N965 ^status complete)
<=WM: (13523: I3 ^predict-yes N965)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13541: I2 ^dir R)
=>WM: (13540: I2 ^reward 1)
=>WM: (13539: I2 ^see 0)
=>WM: (13538: N966 ^status complete)
<=WM: (13527: I2 ^dir L)
<=WM: (13526: I2 ^reward 1)
<=WM: (13525: I2 ^see 1)
=>WM: (13542: I2 ^level-1 L0-root)
<=WM: (13528: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1931 = 0.8155962367832892)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1932 = -0.00558448899823713)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Firing propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Firing propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 0.4476185940576797)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.1844081745669553)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Retracting propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1932 = 0.6126625979784875)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 0.3873368130731955)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1931 = -0.02274740735326741)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.3895395093503376)
=>WM: (13550: S1 ^operator O1934 +)
=>WM: (13549: S1 ^operator O1933 +)
=>WM: (13548: I3 ^dir R)
=>WM: (13547: O1934 ^name predict-no)
=>WM: (13546: O1933 ^name predict-yes)
=>WM: (13545: R970 ^value 1)
=>WM: (13544: R1 ^reward R970)
=>WM: (13543: I3 ^see 0)
<=WM: (13534: S1 ^operator O1931 +)
<=WM: (13535: S1 ^operator O1932 +)
<=WM: (13536: S1 ^operator O1932)
<=WM: (13519: I3 ^dir L)
<=WM: (13530: R1 ^reward R969)
<=WM: (13529: I3 ^see 1)
<=WM: (13533: O1932 ^name predict-no)
<=WM: (13532: O1931 ^name predict-yes)
<=WM: (13531: R969 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.1844081745669553)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1933 = 0.8155962367832892)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.4476185940576797)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1934 = -0.00558448899823713)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 0.4476185940576797)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1932 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.1844081745669553)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1931 = 0.8155962367832892)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.931034,0.0645804)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280918 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
=>WM: (13551: S1 ^operator O1933)

   967:    O: O1933 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N967 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N966 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13552: I3 ^predict-yes N967)
<=WM: (13538: N966 ^status complete)
<=WM: (13537: I3 ^predict-no N966)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13556: I2 ^dir R)
=>WM: (13555: I2 ^reward 1)
=>WM: (13554: I2 ^see 1)
=>WM: (13553: N967 ^status complete)
<=WM: (13541: I2 ^dir R)
<=WM: (13540: I2 ^reward 1)
<=WM: (13539: I2 ^see 0)
=>WM: (13557: I2 ^level-1 R1-root)
<=WM: (13542: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1933 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1934 = 0.5523827002353495)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Firing propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Firing propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.4476185940576797)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.1844081745669553)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Retracting propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1934 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.4476185940576797)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1933 = 0.8155962367832892)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.1844081745669553)
=>WM: (13564: S1 ^operator O1936 +)
=>WM: (13563: S1 ^operator O1935 +)
=>WM: (13562: O1936 ^name predict-no)
=>WM: (13561: O1935 ^name predict-yes)
=>WM: (13560: R971 ^value 1)
=>WM: (13559: R1 ^reward R971)
=>WM: (13558: I3 ^see 1)
<=WM: (13549: S1 ^operator O1933 +)
<=WM: (13551: S1 ^operator O1933)
<=WM: (13550: S1 ^operator O1934 +)
<=WM: (13544: R1 ^reward R970)
<=WM: (13543: I3 ^see 0)
<=WM: (13547: O1934 ^name predict-no)
<=WM: (13546: O1933 ^name predict-yes)
<=WM: (13545: R970 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.1844081745669553)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1935 = 0.1398795999120246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 0.4476185940576797)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1936 = 0.5523827002353495)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.4476185940576797)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1934 = 0.5523827002353495)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.1844081745669553)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1933 = 0.1398795999120246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.67541 -0.491002 0.184408 -> 0.675409 -0.491002 0.184408(R,m,v=1,0.896341,0.0934835)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324596 0.491 0.815596 -> 0.324595 0.491001 0.815596(R,m,v=1,1,0)
=>WM: (13565: S1 ^operator O1936)

   968:    O: O1936 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N968 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N967 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13566: I3 ^predict-no N968)
<=WM: (13553: N967 ^status complete)
<=WM: (13552: I3 ^predict-yes N967)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13570: I2 ^dir U)
=>WM: (13569: I2 ^reward 1)
=>WM: (13568: I2 ^see 0)
=>WM: (13567: N968 ^status complete)
<=WM: (13556: I2 ^dir R)
<=WM: (13555: I2 ^reward 1)
<=WM: (13554: I2 ^see 1)
=>WM: (13571: I2 ^level-1 R0-root)
<=WM: (13557: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Firing propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Firing propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Retracting propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1936 = 0.5523827002353495)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 0.4476185940576797)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1935 = 0.1398795999120246)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.1844075128644186)
=>WM: (13579: S1 ^operator O1938 +)
=>WM: (13578: S1 ^operator O1937 +)
=>WM: (13577: I3 ^dir U)
=>WM: (13576: O1938 ^name predict-no)
=>WM: (13575: O1937 ^name predict-yes)
=>WM: (13574: R972 ^value 1)
=>WM: (13573: R1 ^reward R972)
=>WM: (13572: I3 ^see 0)
<=WM: (13563: S1 ^operator O1935 +)
<=WM: (13564: S1 ^operator O1936 +)
<=WM: (13565: S1 ^operator O1936)
<=WM: (13548: I3 ^dir R)
<=WM: (13559: R1 ^reward R971)
<=WM: (13558: I3 ^see 1)
<=WM: (13562: O1936 ^name predict-no)
<=WM: (13561: O1935 ^name predict-yes)
<=WM: (13560: R971 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1937 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1938 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447619 -> 0.622532 -0.174914 0.447618(R,m,v=1,0.92623,0.0688931)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377469 0.174914 0.552383(R,m,v=1,1,0)
=>WM: (13580: S1 ^operator O1938)

   969:    O: O1938 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N969 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N968 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13581: I3 ^predict-no N969)
<=WM: (13567: N968 ^status complete)
<=WM: (13566: I3 ^predict-no N968)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13585: I2 ^dir R)
=>WM: (13584: I2 ^reward 1)
=>WM: (13583: I2 ^see 0)
=>WM: (13582: N969 ^status complete)
<=WM: (13570: I2 ^dir U)
<=WM: (13569: I2 ^reward 1)
<=WM: (13568: I2 ^see 0)
=>WM: (13586: I2 ^level-1 R0-root)
<=WM: (13571: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1937 = 0.1664311307472832)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1938 = 0.5523777234651187)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Firing propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Firing propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 0.4476183999137253)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.1844075128644186)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Retracting propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1938 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1937 = 0.)
=>WM: (13593: S1 ^operator O1940 +)
=>WM: (13592: S1 ^operator O1939 +)
=>WM: (13591: I3 ^dir R)
=>WM: (13590: O1940 ^name predict-no)
=>WM: (13589: O1939 ^name predict-yes)
=>WM: (13588: R973 ^value 1)
=>WM: (13587: R1 ^reward R973)
<=WM: (13578: S1 ^operator O1937 +)
<=WM: (13579: S1 ^operator O1938 +)
<=WM: (13580: S1 ^operator O1938)
<=WM: (13577: I3 ^dir U)
<=WM: (13573: R1 ^reward R972)
<=WM: (13576: O1938 ^name predict-no)
<=WM: (13575: O1937 ^name predict-yes)
<=WM: (13574: R972 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1939 = 0.1664311307472832)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.1844075128644186)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1940 = 0.5523777234651187)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 0.4476183999137253)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 0.4476183999137253)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1938 = 0.5523777234651187)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.1844075128644186)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1937 = 0.1664311307472832)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13594: S1 ^operator O1940)

   970:    O: O1940 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N970 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N969 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13595: I3 ^predict-no N970)
<=WM: (13582: N969 ^status complete)
<=WM: (13581: I3 ^predict-no N969)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13599: I2 ^dir L)
=>WM: (13598: I2 ^reward 1)
=>WM: (13597: I2 ^see 0)
=>WM: (13596: N970 ^status complete)
<=WM: (13585: I2 ^dir R)
<=WM: (13584: I2 ^reward 1)
<=WM: (13583: I2 ^see 0)
=>WM: (13600: I2 ^level-1 R0-root)
<=WM: (13586: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1939 = 0.61046163216022)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1940 = 0.1063475139796038)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Firing propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Firing propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 0.387336901415443)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.3895395093503376)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Retracting propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 0.4476183999137253)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1940 = 0.5523777234651187)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.1844075128644186)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1939 = 0.1664311307472832)
=>WM: (13607: S1 ^operator O1942 +)
=>WM: (13606: S1 ^operator O1941 +)
=>WM: (13605: I3 ^dir L)
=>WM: (13604: O1942 ^name predict-no)
=>WM: (13603: O1941 ^name predict-yes)
=>WM: (13602: R974 ^value 1)
=>WM: (13601: R1 ^reward R974)
<=WM: (13592: S1 ^operator O1939 +)
<=WM: (13593: S1 ^operator O1940 +)
<=WM: (13594: S1 ^operator O1940)
<=WM: (13591: I3 ^dir R)
<=WM: (13587: R1 ^reward R973)
<=WM: (13590: O1940 ^name predict-no)
<=WM: (13589: O1939 ^name predict-yes)
<=WM: (13588: R973 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1941 = 0.61046163216022)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.3895395093503376)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1942 = 0.1063475139796038)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 0.387336901415443)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 0.387336901415443)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1940 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.3895395093503376)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1939 = 0.61046163216022)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447618 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.926829,0.0683727)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377465 0.174913 0.552378 -> 0.377465 0.174913 0.552378(R,m,v=1,1,0)
=>WM: (13608: S1 ^operator O1941)

   971:    O: O1941 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N971 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N970 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13609: I3 ^predict-yes N971)
<=WM: (13596: N970 ^status complete)
<=WM: (13595: I3 ^predict-no N970)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13613: I2 ^dir U)
=>WM: (13612: I2 ^reward 1)
=>WM: (13611: I2 ^see 1)
=>WM: (13610: N971 ^status complete)
<=WM: (13599: I2 ^dir L)
<=WM: (13598: I2 ^reward 1)
<=WM: (13597: I2 ^see 0)
=>WM: (13614: I2 ^level-1 L1-root)
<=WM: (13600: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Firing propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Firing propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1942 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1941 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Retracting propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 0.387336901415443)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1942 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.3895395093503376)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1941 = 0.61046163216022)
=>WM: (13622: S1 ^operator O1944 +)
=>WM: (13621: S1 ^operator O1943 +)
=>WM: (13620: I3 ^dir U)
=>WM: (13619: O1944 ^name predict-no)
=>WM: (13618: O1943 ^name predict-yes)
=>WM: (13617: R975 ^value 1)
=>WM: (13616: R1 ^reward R975)
=>WM: (13615: I3 ^see 1)
<=WM: (13606: S1 ^operator O1941 +)
<=WM: (13608: S1 ^operator O1941)
<=WM: (13607: S1 ^operator O1942 +)
<=WM: (13605: I3 ^dir L)
<=WM: (13601: R1 ^reward R974)
<=WM: (13572: I3 ^see 0)
<=WM: (13604: O1942 ^name predict-no)
<=WM: (13603: O1941 ^name predict-yes)
<=WM: (13602: R974 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1942 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1941 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.888889,0.0993789)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610462 -> 0.288049 0.322413 0.610461(R,m,v=1,1,0)
=>WM: (13623: S1 ^operator O1944)

   972:    O: O1944 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N972 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N971 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13624: I3 ^predict-no N972)
<=WM: (13610: N971 ^status complete)
<=WM: (13609: I3 ^predict-yes N971)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13628: I2 ^dir U)
=>WM: (13627: I2 ^reward 1)
=>WM: (13626: I2 ^see 0)
=>WM: (13625: N972 ^status complete)
<=WM: (13613: I2 ^dir U)
<=WM: (13612: I2 ^reward 1)
<=WM: (13611: I2 ^see 1)
=>WM: (13629: I2 ^level-1 L1-root)
<=WM: (13614: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Firing propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Firing propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Retracting propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.)
=>WM: (13636: S1 ^operator O1946 +)
=>WM: (13635: S1 ^operator O1945 +)
=>WM: (13634: O1946 ^name predict-no)
=>WM: (13633: O1945 ^name predict-yes)
=>WM: (13632: R976 ^value 1)
=>WM: (13631: R1 ^reward R976)
=>WM: (13630: I3 ^see 0)
<=WM: (13621: S1 ^operator O1943 +)
<=WM: (13622: S1 ^operator O1944 +)
<=WM: (13623: S1 ^operator O1944)
<=WM: (13616: R1 ^reward R975)
<=WM: (13615: I3 ^see 1)
<=WM: (13619: O1944 ^name predict-no)
<=WM: (13618: O1943 ^name predict-yes)
<=WM: (13617: R975 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13637: S1 ^operator O1946)

   973:    O: O1946 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N973 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N972 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13638: I3 ^predict-no N973)
<=WM: (13625: N972 ^status complete)
<=WM: (13624: I3 ^predict-no N972)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13642: I2 ^dir U)
=>WM: (13641: I2 ^reward 1)
=>WM: (13640: I2 ^see 0)
=>WM: (13639: N973 ^status complete)
<=WM: (13628: I2 ^dir U)
<=WM: (13627: I2 ^reward 1)
<=WM: (13626: I2 ^see 0)
=>WM: (13643: I2 ^level-1 L1-root)
<=WM: (13629: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Firing propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Firing propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Retracting propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.)
=>WM: (13649: S1 ^operator O1948 +)
=>WM: (13648: S1 ^operator O1947 +)
=>WM: (13647: O1948 ^name predict-no)
=>WM: (13646: O1947 ^name predict-yes)
=>WM: (13645: R977 ^value 1)
=>WM: (13644: R1 ^reward R977)
<=WM: (13635: S1 ^operator O1945 +)
<=WM: (13636: S1 ^operator O1946 +)
<=WM: (13637: S1 ^operator O1946)
<=WM: (13631: R1 ^reward R976)
<=WM: (13634: O1946 ^name predict-no)
<=WM: (13633: O1945 ^name predict-yes)
<=WM: (13632: R976 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1947 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1948 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1946 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1945 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13650: S1 ^operator O1948)

   974:    O: O1948 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N974 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N973 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13651: I3 ^predict-no N974)
<=WM: (13639: N973 ^status complete)
<=WM: (13638: I3 ^predict-no N973)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13655: I2 ^dir U)
=>WM: (13654: I2 ^reward 1)
=>WM: (13653: I2 ^see 0)
=>WM: (13652: N974 ^status complete)
<=WM: (13642: I2 ^dir U)
<=WM: (13641: I2 ^reward 1)
<=WM: (13640: I2 ^see 0)
=>WM: (13656: I2 ^level-1 L1-root)
<=WM: (13643: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Firing propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Firing propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1948 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1947 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Retracting propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1948 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1947 = 0.)
=>WM: (13662: S1 ^operator O1950 +)
=>WM: (13661: S1 ^operator O1949 +)
=>WM: (13660: O1950 ^name predict-no)
=>WM: (13659: O1949 ^name predict-yes)
=>WM: (13658: R978 ^value 1)
=>WM: (13657: R1 ^reward R978)
<=WM: (13648: S1 ^operator O1947 +)
<=WM: (13649: S1 ^operator O1948 +)
<=WM: (13650: S1 ^operator O1948)
<=WM: (13644: R1 ^reward R977)
<=WM: (13647: O1948 ^name predict-no)
<=WM: (13646: O1947 ^name predict-yes)
<=WM: (13645: R977 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1949 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1950 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1948 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1947 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13663: S1 ^operator O1950)

   975:    O: O1950 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N975 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N974 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13664: I3 ^predict-no N975)
<=WM: (13652: N974 ^status complete)
<=WM: (13651: I3 ^predict-no N974)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13668: I2 ^dir R)
=>WM: (13667: I2 ^reward 1)
=>WM: (13666: I2 ^see 0)
=>WM: (13665: N975 ^status complete)
<=WM: (13655: I2 ^dir U)
<=WM: (13654: I2 ^reward 1)
<=WM: (13653: I2 ^see 0)
=>WM: (13669: I2 ^level-1 L1-root)
<=WM: (13656: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1950 = -0.02155734064455064)
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1949 = 0.8155758449529213)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Firing propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Firing propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1950 = 0.4476189814068987)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1949 = 0.1844075128644186)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Retracting propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1950 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1949 = 0.)
=>WM: (13676: S1 ^operator O1952 +)
=>WM: (13675: S1 ^operator O1951 +)
=>WM: (13674: I3 ^dir R)
=>WM: (13673: O1952 ^name predict-no)
=>WM: (13672: O1951 ^name predict-yes)
=>WM: (13671: R979 ^value 1)
=>WM: (13670: R1 ^reward R979)
<=WM: (13661: S1 ^operator O1949 +)
<=WM: (13662: S1 ^operator O1950 +)
<=WM: (13663: S1 ^operator O1950)
<=WM: (13620: I3 ^dir U)
<=WM: (13657: R1 ^reward R978)
<=WM: (13660: O1950 ^name predict-no)
<=WM: (13659: O1949 ^name predict-yes)
<=WM: (13658: R978 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1951 = 0.8155758449529213)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1951 = 0.1844075128644186)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1952 = -0.02155734064455064)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1952 = 0.4476189814068987)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1950 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1950 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1949 = 0.1844075128644186)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1949 = 0.8155758449529213)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13677: S1 ^operator O1951)

   976:    O: O1951 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N976 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N975 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13678: I3 ^predict-yes N976)
<=WM: (13665: N975 ^status complete)
<=WM: (13664: I3 ^predict-no N975)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13682: I2 ^dir L)
=>WM: (13681: I2 ^reward 1)
=>WM: (13680: I2 ^see 1)
=>WM: (13679: N976 ^status complete)
<=WM: (13668: I2 ^dir R)
<=WM: (13667: I2 ^reward 1)
<=WM: (13666: I2 ^see 0)
=>WM: (13683: I2 ^level-1 R1-root)
<=WM: (13669: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1951 = 0.6104589917494525)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1952 = 0.2714993082286609)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Firing propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Firing propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1952 = 0.387336901415443)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1951 = 0.389539338123754)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Retracting propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1952 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1952 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1951 = 0.1844075128644186)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1951 = 0.8155758449529213)
=>WM: (13691: S1 ^operator O1954 +)
=>WM: (13690: S1 ^operator O1953 +)
=>WM: (13689: I3 ^dir L)
=>WM: (13688: O1954 ^name predict-no)
=>WM: (13687: O1953 ^name predict-yes)
=>WM: (13686: R980 ^value 1)
=>WM: (13685: R1 ^reward R980)
=>WM: (13684: I3 ^see 1)
<=WM: (13675: S1 ^operator O1951 +)
<=WM: (13677: S1 ^operator O1951)
<=WM: (13676: S1 ^operator O1952 +)
<=WM: (13674: I3 ^dir R)
<=WM: (13670: R1 ^reward R979)
<=WM: (13630: I3 ^see 0)
<=WM: (13673: O1952 ^name predict-no)
<=WM: (13672: O1951 ^name predict-yes)
<=WM: (13671: R979 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1953 = 0.389539338123754)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1953 = 0.6104589917494525)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1954 = 0.387336901415443)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1954 = 0.2714993082286609)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1952 = 0.387336901415443)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1952 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1951 = 0.389539338123754)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1951 = 0.6104589917494525)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675409 -0.491002 0.184408 -> 0.675412 -0.491002 0.18441(R,m,v=1,0.89697,0.0929786)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324569 0.491006 0.815576 -> 0.324573 0.491006 0.815578(R,m,v=1,1,0)
=>WM: (13692: S1 ^operator O1953)

   977:    O: O1953 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N977 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N976 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13693: I3 ^predict-yes N977)
<=WM: (13679: N976 ^status complete)
<=WM: (13678: I3 ^predict-yes N976)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13697: I2 ^dir L)
=>WM: (13696: I2 ^reward 1)
=>WM: (13695: I2 ^see 1)
=>WM: (13694: N977 ^status complete)
<=WM: (13682: I2 ^dir L)
<=WM: (13681: I2 ^reward 1)
<=WM: (13680: I2 ^see 1)
=>WM: (13698: I2 ^level-1 L1-root)
<=WM: (13683: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1954 = 0.6126626863207351)
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1953 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Firing propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Firing propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1954 = 0.387336901415443)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1953 = 0.389539338123754)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Retracting propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1954 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1954 = 0.387336901415443)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1953 = 0.6104589917494525)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1953 = 0.389539338123754)
=>WM: (13704: S1 ^operator O1956 +)
=>WM: (13703: S1 ^operator O1955 +)
=>WM: (13702: O1956 ^name predict-no)
=>WM: (13701: O1955 ^name predict-yes)
=>WM: (13700: R981 ^value 1)
=>WM: (13699: R1 ^reward R981)
<=WM: (13690: S1 ^operator O1953 +)
<=WM: (13692: S1 ^operator O1953)
<=WM: (13691: S1 ^operator O1954 +)
<=WM: (13685: R1 ^reward R980)
<=WM: (13688: O1954 ^name predict-no)
<=WM: (13687: O1953 ^name predict-yes)
<=WM: (13686: R980 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1955 = 0.389539338123754)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1955 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1956 = 0.387336901415443)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1956 = 0.6126626863207351)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1954 = 0.387336901415443)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1954 = 0.6126626863207351)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1953 = 0.389539338123754)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1953 = -0.02274740735326741)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.889571,0.0988412)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.32241 0.610459 -> 0.288049 0.322411 0.610459(R,m,v=1,1,0)
=>WM: (13705: S1 ^operator O1956)

   978:    O: O1956 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N978 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N977 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13706: I3 ^predict-no N978)
<=WM: (13694: N977 ^status complete)
<=WM: (13693: I3 ^predict-yes N977)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13710: I2 ^dir R)
=>WM: (13709: I2 ^reward 1)
=>WM: (13708: I2 ^see 0)
=>WM: (13707: N978 ^status complete)
<=WM: (13697: I2 ^dir L)
<=WM: (13696: I2 ^reward 1)
<=WM: (13695: I2 ^see 1)
=>WM: (13711: I2 ^level-1 L0-root)
<=WM: (13698: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1955 = 0.8155955750807526)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1956 = -0.00558448899823713)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Firing propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Firing propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.4476189814068987)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.1844100091918176)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Retracting propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1956 = 0.6126626863207351)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1956 = 0.387336901415443)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1955 = -0.02274740735326741)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1955 = 0.389539588642773)
=>WM: (13719: S1 ^operator O1958 +)
=>WM: (13718: S1 ^operator O1957 +)
=>WM: (13717: I3 ^dir R)
=>WM: (13716: O1958 ^name predict-no)
=>WM: (13715: O1957 ^name predict-yes)
=>WM: (13714: R982 ^value 1)
=>WM: (13713: R1 ^reward R982)
=>WM: (13712: I3 ^see 0)
<=WM: (13703: S1 ^operator O1955 +)
<=WM: (13704: S1 ^operator O1956 +)
<=WM: (13705: S1 ^operator O1956)
<=WM: (13689: I3 ^dir L)
<=WM: (13699: R1 ^reward R981)
<=WM: (13684: I3 ^see 1)
<=WM: (13702: O1956 ^name predict-no)
<=WM: (13701: O1955 ^name predict-yes)
<=WM: (13700: R981 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.1844100091918176)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1957 = 0.8155955750807526)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 0.4476189814068987)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1958 = -0.00558448899823713)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1956 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.1844100091918176)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1955 = 0.8155955750807526)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.931429,0.0642365)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
=>WM: (13720: S1 ^operator O1957)

   979:    O: O1957 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N979 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N978 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13721: I3 ^predict-yes N979)
<=WM: (13707: N978 ^status complete)
<=WM: (13706: I3 ^predict-no N978)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13725: I2 ^dir U)
=>WM: (13724: I2 ^reward 1)
=>WM: (13723: I2 ^see 1)
=>WM: (13722: N979 ^status complete)
<=WM: (13710: I2 ^dir R)
<=WM: (13709: I2 ^reward 1)
<=WM: (13708: I2 ^see 0)
=>WM: (13726: I2 ^level-1 R1-root)
<=WM: (13711: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Firing propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Firing propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1958 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1957 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Retracting propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1958 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1957 = 0.8155955750807526)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.1844100091918176)
=>WM: (13734: S1 ^operator O1960 +)
=>WM: (13733: S1 ^operator O1959 +)
=>WM: (13732: I3 ^dir U)
=>WM: (13731: O1960 ^name predict-no)
=>WM: (13730: O1959 ^name predict-yes)
=>WM: (13729: R983 ^value 1)
=>WM: (13728: R1 ^reward R983)
=>WM: (13727: I3 ^see 1)
<=WM: (13718: S1 ^operator O1957 +)
<=WM: (13720: S1 ^operator O1957)
<=WM: (13719: S1 ^operator O1958 +)
<=WM: (13717: I3 ^dir R)
<=WM: (13713: R1 ^reward R982)
<=WM: (13712: I3 ^see 0)
<=WM: (13716: O1958 ^name predict-no)
<=WM: (13715: O1957 ^name predict-yes)
<=WM: (13714: R982 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1958 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1957 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675412 -0.491002 0.18441 -> 0.675411 -0.491002 0.184409(R,m,v=1,0.89759,0.092479)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324595 0.491001 0.815596 -> 0.324594 0.491001 0.815595(R,m,v=1,1,0)
=>WM: (13735: S1 ^operator O1960)

   980:    O: O1960 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N980 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N979 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13736: I3 ^predict-no N980)
<=WM: (13722: N979 ^status complete)
<=WM: (13721: I3 ^predict-yes N979)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13740: I2 ^dir U)
=>WM: (13739: I2 ^reward 1)
=>WM: (13738: I2 ^see 0)
=>WM: (13737: N980 ^status complete)
<=WM: (13725: I2 ^dir U)
<=WM: (13724: I2 ^reward 1)
<=WM: (13723: I2 ^see 1)
=>WM: (13741: I2 ^level-1 R1-root)
<=WM: (13726: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Firing propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Firing propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Retracting propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.)
=>WM: (13748: S1 ^operator O1962 +)
=>WM: (13747: S1 ^operator O1961 +)
=>WM: (13746: O1962 ^name predict-no)
=>WM: (13745: O1961 ^name predict-yes)
=>WM: (13744: R984 ^value 1)
=>WM: (13743: R1 ^reward R984)
=>WM: (13742: I3 ^see 0)
<=WM: (13733: S1 ^operator O1959 +)
<=WM: (13734: S1 ^operator O1960 +)
<=WM: (13735: S1 ^operator O1960)
<=WM: (13728: R1 ^reward R983)
<=WM: (13727: I3 ^see 1)
<=WM: (13731: O1960 ^name predict-no)
<=WM: (13730: O1959 ^name predict-yes)
<=WM: (13729: R983 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13749: S1 ^operator O1962)

   981:    O: O1962 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N981 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N980 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13750: I3 ^predict-no N981)
<=WM: (13737: N980 ^status complete)
<=WM: (13736: I3 ^predict-no N980)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (13754: I2 ^dir U)
=>WM: (13753: I2 ^reward 1)
=>WM: (13752: I2 ^see 0)
=>WM: (13751: N981 ^status complete)
<=WM: (13740: I2 ^dir U)
<=WM: (13739: I2 ^reward 1)
<=WM: (13738: I2 ^see 0)
=>WM: (13755: I2 ^level-1 R1-root)
<=WM: (13741: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Firing propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Firing propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Retracting propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.)
=>WM: (13761: S1 ^operator O1964 +)
=>WM: (13760: S1 ^operator O1963 +)
=>WM: (13759: O1964 ^name predict-no)
=>WM: (13758: O1963 ^name predict-yes)
=>WM: (13757: R985 ^value 1)
=>WM: (13756: R1 ^reward R985)
<=WM: (13747: S1 ^operator O1961 +)
<=WM: (13748: S1 ^operator O1962 +)
<=WM: (13749: S1 ^operator O1962)
<=WM: (13743: R1 ^reward R984)
<=WM: (13746: O1962 ^name predict-no)
<=WM: (13745: O1961 ^name predict-yes)
<=WM: (13744: R984 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1963 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1964 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13762: S1 ^operator O1964)

   982:    O: O1964 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N982 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N981 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13763: I3 ^predict-no N982)
<=WM: (13751: N981 ^status complete)
<=WM: (13750: I3 ^predict-no N981)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13767: I2 ^dir R)
=>WM: (13766: I2 ^reward 1)
=>WM: (13765: I2 ^see 0)
=>WM: (13764: N982 ^status complete)
<=WM: (13754: I2 ^dir U)
<=WM: (13753: I2 ^reward 1)
<=WM: (13752: I2 ^see 0)
=>WM: (13768: I2 ^level-1 R1-root)
<=WM: (13755: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1963 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1964 = 0.5523825060913952)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Firing propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Firing propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.4476189814068987)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Retracting propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1964 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1963 = 0.)
=>WM: (13775: S1 ^operator O1966 +)
=>WM: (13774: S1 ^operator O1965 +)
=>WM: (13773: I3 ^dir R)
=>WM: (13772: O1966 ^name predict-no)
=>WM: (13771: O1965 ^name predict-yes)
=>WM: (13770: R986 ^value 1)
=>WM: (13769: R1 ^reward R986)
<=WM: (13760: S1 ^operator O1963 +)
<=WM: (13761: S1 ^operator O1964 +)
<=WM: (13762: S1 ^operator O1964)
<=WM: (13732: I3 ^dir U)
<=WM: (13756: R1 ^reward R985)
<=WM: (13759: O1964 ^name predict-no)
<=WM: (13758: O1963 ^name predict-yes)
<=WM: (13757: R985 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1965 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1965 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1966 = 0.5523825060913952)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1966 = 0.4476189814068987)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1964 = 0.5523825060913952)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.1844091715509321)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1963 = 0.1398795999120246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13776: S1 ^operator O1966)

   983:    O: O1966 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N983 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N982 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13777: I3 ^predict-no N983)
<=WM: (13764: N982 ^status complete)
<=WM: (13763: I3 ^predict-no N982)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/|--- Input Phase --- 
=>WM: (13781: I2 ^dir R)
=>WM: (13780: I2 ^reward 1)
=>WM: (13779: I2 ^see 0)
=>WM: (13778: N983 ^status complete)
<=WM: (13767: I2 ^dir R)
<=WM: (13766: I2 ^reward 1)
<=WM: (13765: I2 ^see 0)
=>WM: (13782: I2 ^level-1 R0-root)
<=WM: (13768: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1965 = 0.1664311307472832)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1966 = 0.5523783049582921)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Firing propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Firing propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1966 = 0.4476189814068987)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1965 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Retracting propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1966 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1966 = 0.5523825060913952)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1965 = 0.1844091715509321)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1965 = 0.1398795999120246)
=>WM: (13788: S1 ^operator O1968 +)
=>WM: (13787: S1 ^operator O1967 +)
=>WM: (13786: O1968 ^name predict-no)
=>WM: (13785: O1967 ^name predict-yes)
=>WM: (13784: R987 ^value 1)
=>WM: (13783: R1 ^reward R987)
<=WM: (13774: S1 ^operator O1965 +)
<=WM: (13775: S1 ^operator O1966 +)
<=WM: (13776: S1 ^operator O1966)
<=WM: (13769: R1 ^reward R986)
<=WM: (13772: O1966 ^name predict-no)
<=WM: (13771: O1965 ^name predict-yes)
<=WM: (13770: R986 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1967 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1967 = 0.1664311307472832)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1968 = 0.4476189814068987)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1968 = 0.5523783049582921)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1966 = 0.4476189814068987)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1966 = 0.5523783049582921)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1965 = 0.1844091715509321)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1965 = 0.1664311307472832)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622532 -0.174914 0.447619(R,m,v=1,0.927419,0.06786)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
=>WM: (13789: S1 ^operator O1968)

   984:    O: O1968 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N984 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N983 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13790: I3 ^predict-no N984)
<=WM: (13778: N983 ^status complete)
<=WM: (13777: I3 ^predict-no N983)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13794: I2 ^dir U)
=>WM: (13793: I2 ^reward 1)
=>WM: (13792: I2 ^see 0)
=>WM: (13791: N984 ^status complete)
<=WM: (13781: I2 ^dir R)
<=WM: (13780: I2 ^reward 1)
<=WM: (13779: I2 ^see 0)
=>WM: (13795: I2 ^level-1 R0-root)
<=WM: (13782: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Firing propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Firing propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1968 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1967 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Retracting propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1968 = 0.5523783049582921)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1968 = 0.4476187582821546)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1967 = 0.1664311307472832)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1967 = 0.1844091715509321)
=>WM: (13802: S1 ^operator O1970 +)
=>WM: (13801: S1 ^operator O1969 +)
=>WM: (13800: I3 ^dir U)
=>WM: (13799: O1970 ^name predict-no)
=>WM: (13798: O1969 ^name predict-yes)
=>WM: (13797: R988 ^value 1)
=>WM: (13796: R1 ^reward R988)
<=WM: (13787: S1 ^operator O1967 +)
<=WM: (13788: S1 ^operator O1968 +)
<=WM: (13789: S1 ^operator O1968)
<=WM: (13773: I3 ^dir R)
<=WM: (13783: R1 ^reward R987)
<=WM: (13786: O1968 ^name predict-no)
<=WM: (13785: O1967 ^name predict-yes)
<=WM: (13784: R987 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1969 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1970 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1968 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1967 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.928,0.0673548)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377465 0.174913 0.552378 -> 0.377466 0.174913 0.552379(R,m,v=1,1,0)
=>WM: (13803: S1 ^operator O1970)

   985:    O: O1970 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N985 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N984 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13804: I3 ^predict-no N985)
<=WM: (13791: N984 ^status complete)
<=WM: (13790: I3 ^predict-no N984)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\-/--- Input Phase --- 
=>WM: (13808: I2 ^dir L)
=>WM: (13807: I2 ^reward 1)
=>WM: (13806: I2 ^see 0)
=>WM: (13805: N985 ^status complete)
<=WM: (13794: I2 ^dir U)
<=WM: (13793: I2 ^reward 1)
<=WM: (13792: I2 ^see 0)
=>WM: (13809: I2 ^level-1 R0-root)
<=WM: (13795: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1969 = 0.6104614609336363)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1970 = 0.1063475139796038)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Firing propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Firing propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1970 = 0.3873369632550164)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1969 = 0.389539588642773)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Retracting propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1970 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1969 = 0.)
=>WM: (13816: S1 ^operator O1972 +)
=>WM: (13815: S1 ^operator O1971 +)
=>WM: (13814: I3 ^dir L)
=>WM: (13813: O1972 ^name predict-no)
=>WM: (13812: O1971 ^name predict-yes)
=>WM: (13811: R989 ^value 1)
=>WM: (13810: R1 ^reward R989)
<=WM: (13801: S1 ^operator O1969 +)
<=WM: (13802: S1 ^operator O1970 +)
<=WM: (13803: S1 ^operator O1970)
<=WM: (13800: I3 ^dir U)
<=WM: (13796: R1 ^reward R988)
<=WM: (13799: O1970 ^name predict-no)
<=WM: (13798: O1969 ^name predict-yes)
<=WM: (13797: R988 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1971 = 0.6104614609336363)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1971 = 0.389539588642773)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1972 = 0.1063475139796038)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1972 = 0.3873369632550164)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1970 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1970 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1969 = 0.389539588642773)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1969 = 0.6104614609336363)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13817: S1 ^operator O1971)

   986:    O: O1971 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N986 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N985 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13818: I3 ^predict-yes N986)
<=WM: (13805: N985 ^status complete)
<=WM: (13804: I3 ^predict-no N985)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13822: I2 ^dir R)
=>WM: (13821: I2 ^reward 1)
=>WM: (13820: I2 ^see 1)
=>WM: (13819: N986 ^status complete)
<=WM: (13808: I2 ^dir L)
<=WM: (13807: I2 ^reward 1)
<=WM: (13806: I2 ^see 0)
=>WM: (13823: I2 ^level-1 L1-root)
<=WM: (13809: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1972 = -0.02155734064455064)
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1971 = 0.8155783412803204)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Firing propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Firing propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1972 = 0.4476191987960876)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1971 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Retracting propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1972 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1972 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1971 = 0.389539588642773)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1971 = 0.6104614609336363)
=>WM: (13831: S1 ^operator O1974 +)
=>WM: (13830: S1 ^operator O1973 +)
=>WM: (13829: I3 ^dir R)
=>WM: (13828: O1974 ^name predict-no)
=>WM: (13827: O1973 ^name predict-yes)
=>WM: (13826: R990 ^value 1)
=>WM: (13825: R1 ^reward R990)
=>WM: (13824: I3 ^see 1)
<=WM: (13815: S1 ^operator O1971 +)
<=WM: (13817: S1 ^operator O1971)
<=WM: (13816: S1 ^operator O1972 +)
<=WM: (13814: I3 ^dir L)
<=WM: (13810: R1 ^reward R989)
<=WM: (13742: I3 ^see 0)
<=WM: (13813: O1972 ^name predict-no)
<=WM: (13812: O1971 ^name predict-yes)
<=WM: (13811: R989 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1973 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1973 = 0.8155783412803204)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1974 = 0.4476191987960876)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1974 = -0.02155734064455064)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1972 = 0.4476191987960876)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1972 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1971 = 0.1844091715509321)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1971 = 0.8155783412803204)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.890244,0.0983091)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
=>WM: (13832: S1 ^operator O1973)

   987:    O: O1973 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N987 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N986 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13833: I3 ^predict-yes N987)
<=WM: (13819: N986 ^status complete)
<=WM: (13818: I3 ^predict-yes N986)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13837: I2 ^dir R)
=>WM: (13836: I2 ^reward 1)
=>WM: (13835: I2 ^see 1)
=>WM: (13834: N987 ^status complete)
<=WM: (13822: I2 ^dir R)
<=WM: (13821: I2 ^reward 1)
<=WM: (13820: I2 ^see 1)
=>WM: (13838: I2 ^level-1 R1-root)
<=WM: (13823: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1973 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1974 = 0.552382282966651)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Firing propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Firing propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1974 = 0.4476191987960876)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1973 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Retracting propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1974 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1974 = 0.4476191987960876)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1973 = 0.8155783412803204)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1973 = 0.1844091715509321)
=>WM: (13844: S1 ^operator O1976 +)
=>WM: (13843: S1 ^operator O1975 +)
=>WM: (13842: O1976 ^name predict-no)
=>WM: (13841: O1975 ^name predict-yes)
=>WM: (13840: R991 ^value 1)
=>WM: (13839: R1 ^reward R991)
<=WM: (13830: S1 ^operator O1973 +)
<=WM: (13832: S1 ^operator O1973)
<=WM: (13831: S1 ^operator O1974 +)
<=WM: (13825: R1 ^reward R990)
<=WM: (13828: O1974 ^name predict-no)
<=WM: (13827: O1973 ^name predict-yes)
<=WM: (13826: R990 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.1844091715509321)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1975 = 0.1398795999120246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 0.4476191987960876)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1976 = 0.552382282966651)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1974 = 0.4476191987960876)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1974 = 0.552382282966651)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1973 = 0.1844091715509321)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1973 = 0.1398795999120246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675411 -0.491002 0.184409 -> 0.675414 -0.491003 0.184411(R,m,v=1,0.898204,0.0919847)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324573 0.491006 0.815578 -> 0.324575 0.491005 0.81558(R,m,v=1,1,0)
=>WM: (13845: S1 ^operator O1976)

   988:    O: O1976 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N988 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N987 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13846: I3 ^predict-no N988)
<=WM: (13834: N987 ^status complete)
<=WM: (13833: I3 ^predict-yes N987)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13850: I2 ^dir R)
=>WM: (13849: I2 ^reward 1)
=>WM: (13848: I2 ^see 0)
=>WM: (13847: N988 ^status complete)
<=WM: (13837: I2 ^dir R)
<=WM: (13836: I2 ^reward 1)
<=WM: (13835: I2 ^see 1)
=>WM: (13851: I2 ^level-1 R0-root)
<=WM: (13838: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1975 = 0.1664311307472832)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1976 = 0.5523787454722251)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Firing propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Firing propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 0.4476191987960876)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.1844110446262441)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Retracting propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1976 = 0.552382282966651)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 0.4476191987960876)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1975 = 0.1398795999120246)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.1844110446262441)
=>WM: (13858: S1 ^operator O1978 +)
=>WM: (13857: S1 ^operator O1977 +)
=>WM: (13856: O1978 ^name predict-no)
=>WM: (13855: O1977 ^name predict-yes)
=>WM: (13854: R992 ^value 1)
=>WM: (13853: R1 ^reward R992)
=>WM: (13852: I3 ^see 0)
<=WM: (13843: S1 ^operator O1975 +)
<=WM: (13844: S1 ^operator O1976 +)
<=WM: (13845: S1 ^operator O1976)
<=WM: (13839: R1 ^reward R991)
<=WM: (13824: I3 ^see 1)
<=WM: (13842: O1976 ^name predict-no)
<=WM: (13841: O1975 ^name predict-yes)
<=WM: (13840: R991 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.1844110446262441)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1977 = 0.1664311307472832)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 0.4476191987960876)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1978 = 0.5523787454722251)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1976 = 0.4476191987960876)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1976 = 0.5523787454722251)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1975 = 0.1844110446262441)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1975 = 0.1664311307472832)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.928571,0.0668571)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552382 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
=>WM: (13859: S1 ^operator O1978)

   989:    O: O1978 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N989 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N988 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13860: I3 ^predict-no N989)
<=WM: (13847: N988 ^status complete)
<=WM: (13846: I3 ^predict-no N988)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13864: I2 ^dir R)
=>WM: (13863: I2 ^reward 1)
=>WM: (13862: I2 ^see 0)
=>WM: (13861: N989 ^status complete)
<=WM: (13850: I2 ^dir R)
<=WM: (13849: I2 ^reward 1)
<=WM: (13848: I2 ^see 0)
=>WM: (13865: I2 ^level-1 R0-root)
<=WM: (13851: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1977 = 0.1664311307472832)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1978 = 0.5523787454722251)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Firing propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Firing propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 0.4476189765316768)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.1844110446262441)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Retracting propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1978 = 0.5523787454722251)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 0.4476189765316768)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1977 = 0.1664311307472832)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.1844110446262441)
=>WM: (13871: S1 ^operator O1980 +)
=>WM: (13870: S1 ^operator O1979 +)
=>WM: (13869: O1980 ^name predict-no)
=>WM: (13868: O1979 ^name predict-yes)
=>WM: (13867: R993 ^value 1)
=>WM: (13866: R1 ^reward R993)
<=WM: (13857: S1 ^operator O1977 +)
<=WM: (13858: S1 ^operator O1978 +)
<=WM: (13859: S1 ^operator O1978)
<=WM: (13853: R1 ^reward R992)
<=WM: (13856: O1978 ^name predict-no)
<=WM: (13855: O1977 ^name predict-yes)
<=WM: (13854: R992 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1979 = 0.1844110446262441)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1979 = 0.1664311307472832)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1980 = 0.4476189765316768)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1980 = 0.5523787454722251)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1978 = 0.4476189765316768)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1978 = 0.5523787454722251)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1977 = 0.1844110446262441)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1977 = 0.1664311307472832)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.929134,0.0663667)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.552379 -> 0.377466 0.174913 0.552379(R,m,v=1,1,0)
=>WM: (13872: S1 ^operator O1980)

   990:    O: O1980 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N990 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N989 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13873: I3 ^predict-no N990)
<=WM: (13861: N989 ^status complete)
<=WM: (13860: I3 ^predict-no N989)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|\--- Input Phase --- 
=>WM: (13877: I2 ^dir L)
=>WM: (13876: I2 ^reward 1)
=>WM: (13875: I2 ^see 0)
=>WM: (13874: N990 ^status complete)
<=WM: (13864: I2 ^dir R)
<=WM: (13863: I2 ^reward 1)
<=WM: (13862: I2 ^see 0)
=>WM: (13878: I2 ^level-1 R0-root)
<=WM: (13865: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1979 = 0.6104613034971749)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1980 = 0.1063475139796038)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Firing propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Firing propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1980 = 0.3873369632550164)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1979 = 0.3895394312063116)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Retracting propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
 -->
 (S1 ^operator O1980 = 0.5523790871716397)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1980 = 0.4476193182310915)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
 -->
 (S1 ^operator O1979 = 0.1664311307472832)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1979 = 0.1844110446262441)
=>WM: (13885: S1 ^operator O1982 +)
=>WM: (13884: S1 ^operator O1981 +)
=>WM: (13883: I3 ^dir L)
=>WM: (13882: O1982 ^name predict-no)
=>WM: (13881: O1981 ^name predict-yes)
=>WM: (13880: R994 ^value 1)
=>WM: (13879: R1 ^reward R994)
<=WM: (13870: S1 ^operator O1979 +)
<=WM: (13871: S1 ^operator O1980 +)
<=WM: (13872: S1 ^operator O1980)
<=WM: (13829: I3 ^dir R)
<=WM: (13866: R1 ^reward R993)
<=WM: (13869: O1980 ^name predict-no)
<=WM: (13868: O1979 ^name predict-yes)
<=WM: (13867: R993 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1981 = 0.6104613034971749)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1981 = 0.3895394312063116)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1982 = 0.1063475139796038)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1982 = 0.3873369632550164)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1980 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1980 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1979 = 0.3895394312063116)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1979 = 0.6104613034971749)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174913 0.44762(R,m,v=1,0.929687,0.0658834)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.552379 -> 0.377466 0.174913 0.552379(R,m,v=1,1,0)
=>WM: (13886: S1 ^operator O1981)

   991:    O: O1981 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N991 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N990 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13887: I3 ^predict-yes N991)
<=WM: (13874: N990 ^status complete)
<=WM: (13873: I3 ^predict-no N990)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13891: I2 ^dir R)
=>WM: (13890: I2 ^reward 1)
=>WM: (13889: I2 ^see 1)
=>WM: (13888: N991 ^status complete)
<=WM: (13877: I2 ^dir L)
<=WM: (13876: I2 ^reward 1)
<=WM: (13875: I2 ^see 0)
=>WM: (13892: I2 ^level-1 L1-root)
<=WM: (13878: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1982 = -0.02155734064455064)
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1981 = 0.8155802143556325)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Firing propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Firing propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 0.4476195574206818)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.1844110446262441)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Retracting propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1982 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1982 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1981 = 0.3895394312063116)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1981 = 0.6104613034971749)
=>WM: (13900: S1 ^operator O1984 +)
=>WM: (13899: S1 ^operator O1983 +)
=>WM: (13898: I3 ^dir R)
=>WM: (13897: O1984 ^name predict-no)
=>WM: (13896: O1983 ^name predict-yes)
=>WM: (13895: R995 ^value 1)
=>WM: (13894: R1 ^reward R995)
=>WM: (13893: I3 ^see 1)
<=WM: (13884: S1 ^operator O1981 +)
<=WM: (13886: S1 ^operator O1981)
<=WM: (13885: S1 ^operator O1982 +)
<=WM: (13883: I3 ^dir L)
<=WM: (13879: R1 ^reward R994)
<=WM: (13852: I3 ^see 0)
<=WM: (13882: O1982 ^name predict-no)
<=WM: (13881: O1981 ^name predict-yes)
<=WM: (13880: R994 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.1844110446262441)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1983 = 0.8155802143556325)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 0.4476195574206818)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1984 = -0.02155734064455064)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1982 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1982 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1981 = 0.1844110446262441)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1981 = 0.8155802143556325)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.890909,0.0977827)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
=>WM: (13901: S1 ^operator O1983)

   992:    O: O1983 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N992 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N991 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13902: I3 ^predict-yes N992)
<=WM: (13888: N991 ^status complete)
<=WM: (13887: I3 ^predict-yes N991)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13906: I2 ^dir L)
=>WM: (13905: I2 ^reward 1)
=>WM: (13904: I2 ^see 1)
=>WM: (13903: N992 ^status complete)
<=WM: (13891: I2 ^dir R)
<=WM: (13890: I2 ^reward 1)
<=WM: (13889: I2 ^see 1)
=>WM: (13907: I2 ^level-1 R1-root)
<=WM: (13892: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1983 = 0.6104592422684716)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1984 = 0.2714993082286609)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Firing propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Firing propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 0.3873369632550164)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.3895393210007886)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Retracting propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1984 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1983 = 0.8155802143556325)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.1844110446262441)
=>WM: (13914: S1 ^operator O1986 +)
=>WM: (13913: S1 ^operator O1985 +)
=>WM: (13912: I3 ^dir L)
=>WM: (13911: O1986 ^name predict-no)
=>WM: (13910: O1985 ^name predict-yes)
=>WM: (13909: R996 ^value 1)
=>WM: (13908: R1 ^reward R996)
<=WM: (13899: S1 ^operator O1983 +)
<=WM: (13901: S1 ^operator O1983)
<=WM: (13900: S1 ^operator O1984 +)
<=WM: (13898: I3 ^dir R)
<=WM: (13894: R1 ^reward R995)
<=WM: (13897: O1984 ^name predict-no)
<=WM: (13896: O1983 ^name predict-yes)
<=WM: (13895: R995 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.3895393210007886)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1985 = 0.6104592422684716)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 0.3873369632550164)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1986 = 0.2714993082286609)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1984 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.3895393210007886)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1983 = 0.6104592422684716)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184411 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.89881,0.0914956)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324575 0.491005 0.81558 -> 0.324577 0.491005 0.815582(R,m,v=1,1,0)
=>WM: (13915: S1 ^operator O1985)

   993:    O: O1985 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N993 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N992 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13916: I3 ^predict-yes N993)
<=WM: (13903: N992 ^status complete)
<=WM: (13902: I3 ^predict-yes N992)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (13920: I2 ^dir R)
=>WM: (13919: I2 ^reward 1)
=>WM: (13918: I2 ^see 1)
=>WM: (13917: N993 ^status complete)
<=WM: (13906: I2 ^dir L)
<=WM: (13905: I2 ^reward 1)
<=WM: (13904: I2 ^see 1)
=>WM: (13921: I2 ^level-1 L1-root)
<=WM: (13907: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1986 = -0.02155734064455064)
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1985 = 0.8155815255083509)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Firing propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Firing propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1986 = 0.4476195574206818)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1985 = 0.1844123557789626)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Retracting propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1986 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1985 = 0.6104592422684716)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.3895393210007886)
=>WM: (13928: S1 ^operator O1988 +)
=>WM: (13927: S1 ^operator O1987 +)
=>WM: (13926: I3 ^dir R)
=>WM: (13925: O1988 ^name predict-no)
=>WM: (13924: O1987 ^name predict-yes)
=>WM: (13923: R997 ^value 1)
=>WM: (13922: R1 ^reward R997)
<=WM: (13913: S1 ^operator O1985 +)
<=WM: (13915: S1 ^operator O1985)
<=WM: (13914: S1 ^operator O1986 +)
<=WM: (13912: I3 ^dir L)
<=WM: (13908: R1 ^reward R996)
<=WM: (13911: O1986 ^name predict-no)
<=WM: (13910: O1985 ^name predict-yes)
<=WM: (13909: R996 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1987 = 0.1844123557789626)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1987 = 0.8155815255083509)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1988 = 0.4476195574206818)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1988 = -0.02155734064455064)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1986 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1986 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1985 = 0.1844123557789626)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1985 = 0.8155815255083509)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.891566,0.0972618)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.610459 -> 0.288049 0.322411 0.610459(R,m,v=1,1,0)
=>WM: (13929: S1 ^operator O1987)

   994:    O: O1987 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N994 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N993 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13930: I3 ^predict-yes N994)
<=WM: (13917: N993 ^status complete)
<=WM: (13916: I3 ^predict-yes N993)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13934: I2 ^dir L)
=>WM: (13933: I2 ^reward 1)
=>WM: (13932: I2 ^see 1)
=>WM: (13931: N994 ^status complete)
<=WM: (13920: I2 ^dir R)
<=WM: (13919: I2 ^reward 1)
<=WM: (13918: I2 ^see 1)
=>WM: (13935: I2 ^level-1 R1-root)
<=WM: (13921: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1987 = 0.6104594577780825)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1988 = 0.2714993082286609)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Firing propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Firing propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1988 = 0.3873369632550164)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1987 = 0.3895395365103996)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Retracting propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1988 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1988 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1987 = 0.8155815255083509)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1987 = 0.1844123557789626)
=>WM: (13942: S1 ^operator O1990 +)
=>WM: (13941: S1 ^operator O1989 +)
=>WM: (13940: I3 ^dir L)
=>WM: (13939: O1990 ^name predict-no)
=>WM: (13938: O1989 ^name predict-yes)
=>WM: (13937: R998 ^value 1)
=>WM: (13936: R1 ^reward R998)
<=WM: (13927: S1 ^operator O1987 +)
<=WM: (13929: S1 ^operator O1987)
<=WM: (13928: S1 ^operator O1988 +)
<=WM: (13926: I3 ^dir R)
<=WM: (13922: R1 ^reward R997)
<=WM: (13925: O1988 ^name predict-no)
<=WM: (13924: O1987 ^name predict-yes)
<=WM: (13923: R997 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1989 = 0.3895395365103996)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1989 = 0.6104594577780825)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1990 = 0.3873369632550164)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1990 = 0.2714993082286609)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1988 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1988 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1987 = 0.3895395365103996)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1987 = 0.6104594577780825)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675417 -0.491003 0.184413(R,m,v=1,0.899408,0.0910116)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324577 0.491005 0.815582 -> 0.324578 0.491005 0.815582(R,m,v=1,1,0)
=>WM: (13943: S1 ^operator O1989)

   995:    O: O1989 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N995 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N994 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13944: I3 ^predict-yes N995)
<=WM: (13931: N994 ^status complete)
<=WM: (13930: I3 ^predict-yes N994)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13948: I2 ^dir L)
=>WM: (13947: I2 ^reward 1)
=>WM: (13946: I2 ^see 1)
=>WM: (13945: N995 ^status complete)
<=WM: (13934: I2 ^dir L)
<=WM: (13933: I2 ^reward 1)
<=WM: (13932: I2 ^see 1)
=>WM: (13949: I2 ^level-1 L1-root)
<=WM: (13935: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1990 = 0.6126627481603084)
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1989 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Firing propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Firing propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1990 = 0.3873369632550164)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1989 = 0.3895395365103996)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Retracting propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O1990 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1990 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O1989 = 0.6104594577780825)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1989 = 0.3895395365103996)
=>WM: (13955: S1 ^operator O1992 +)
=>WM: (13954: S1 ^operator O1991 +)
=>WM: (13953: O1992 ^name predict-no)
=>WM: (13952: O1991 ^name predict-yes)
=>WM: (13951: R999 ^value 1)
=>WM: (13950: R1 ^reward R999)
<=WM: (13941: S1 ^operator O1989 +)
<=WM: (13943: S1 ^operator O1989)
<=WM: (13942: S1 ^operator O1990 +)
<=WM: (13936: R1 ^reward R998)
<=WM: (13939: O1990 ^name predict-no)
<=WM: (13938: O1989 ^name predict-yes)
<=WM: (13937: R998 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1991 = 0.3895395365103996)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1991 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1992 = 0.3873369632550164)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1992 = 0.6126627481603084)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1990 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1990 = 0.6126627481603084)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1989 = 0.3895395365103996)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1989 = -0.02274740735326741)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.892216,0.0967463)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.610459 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
=>WM: (13956: S1 ^operator O1992)

   996:    O: O1992 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N996 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N995 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13957: I3 ^predict-no N996)
<=WM: (13945: N995 ^status complete)
<=WM: (13944: I3 ^predict-yes N995)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\---- Input Phase --- 
=>WM: (13961: I2 ^dir R)
=>WM: (13960: I2 ^reward 1)
=>WM: (13959: I2 ^see 0)
=>WM: (13958: N996 ^status complete)
<=WM: (13948: I2 ^dir L)
<=WM: (13947: I2 ^reward 1)
<=WM: (13946: I2 ^see 1)
=>WM: (13962: I2 ^level-1 L0-root)
<=WM: (13949: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1991 = 0.8155947374398671)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1992 = -0.00558448899823713)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Firing propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Firing propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.4476195574206818)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.1844132735858656)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Retracting propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O1992 = 0.6126627481603084)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1992 = 0.3873369632550164)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O1991 = -0.02274740735326741)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1991 = 0.3895396873671274)
=>WM: (13970: S1 ^operator O1994 +)
=>WM: (13969: S1 ^operator O1993 +)
=>WM: (13968: I3 ^dir R)
=>WM: (13967: O1994 ^name predict-no)
=>WM: (13966: O1993 ^name predict-yes)
=>WM: (13965: R1000 ^value 1)
=>WM: (13964: R1 ^reward R1000)
=>WM: (13963: I3 ^see 0)
<=WM: (13954: S1 ^operator O1991 +)
<=WM: (13955: S1 ^operator O1992 +)
<=WM: (13956: S1 ^operator O1992)
<=WM: (13940: I3 ^dir L)
<=WM: (13950: R1 ^reward R999)
<=WM: (13893: I3 ^see 1)
<=WM: (13953: O1992 ^name predict-no)
<=WM: (13952: O1991 ^name predict-yes)
<=WM: (13951: R999 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.1844132735858656)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1993 = 0.8155947374398671)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.4476195574206818)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1994 = -0.00558448899823713)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1992 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.1844132735858656)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1991 = 0.8155947374398671)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.931818,0.0638961)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
=>WM: (13971: S1 ^operator O1993)

   997:    O: O1993 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N997 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N996 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13972: I3 ^predict-yes N997)
<=WM: (13958: N996 ^status complete)
<=WM: (13957: I3 ^predict-no N996)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13976: I2 ^dir R)
=>WM: (13975: I2 ^reward 1)
=>WM: (13974: I2 ^see 1)
=>WM: (13973: N997 ^status complete)
<=WM: (13961: I2 ^dir R)
<=WM: (13960: I2 ^reward 1)
<=WM: (13959: I2 ^see 0)
=>WM: (13977: I2 ^level-1 R1-root)
<=WM: (13962: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1993 = 0.1398795999120246)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1994 = 0.5523820607022403)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Firing propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Firing propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.4476195574206818)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.1844132735858656)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Retracting propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O1994 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O1993 = 0.8155947374398671)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.1844132735858656)
=>WM: (13984: S1 ^operator O1996 +)
=>WM: (13983: S1 ^operator O1995 +)
=>WM: (13982: O1996 ^name predict-no)
=>WM: (13981: O1995 ^name predict-yes)
=>WM: (13980: R1001 ^value 1)
=>WM: (13979: R1 ^reward R1001)
=>WM: (13978: I3 ^see 1)
<=WM: (13969: S1 ^operator O1993 +)
<=WM: (13971: S1 ^operator O1993)
<=WM: (13970: S1 ^operator O1994 +)
<=WM: (13964: R1 ^reward R1000)
<=WM: (13963: I3 ^see 0)
<=WM: (13967: O1994 ^name predict-no)
<=WM: (13966: O1993 ^name predict-yes)
<=WM: (13965: R1000 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1995 = 0.1844132735858656)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1995 = 0.1398795999120246)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1996 = 0.4476195574206818)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1996 = 0.5523820607022403)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1994 = 0.5523820607022403)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.1844132735858656)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1993 = 0.1398795999120246)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675417 -0.491003 0.184413 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.9,0.0905325)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324594 0.491001 0.815595 -> 0.324592 0.491001 0.815594(R,m,v=1,1,0)
=>WM: (13985: S1 ^operator O1996)

   998:    O: O1996 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N998 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N997 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13986: I3 ^predict-no N998)
<=WM: (13973: N997 ^status complete)
<=WM: (13972: I3 ^predict-yes N997)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13990: I2 ^dir L)
=>WM: (13989: I2 ^reward 1)
=>WM: (13988: I2 ^see 0)
=>WM: (13987: N998 ^status complete)
<=WM: (13976: I2 ^dir R)
<=WM: (13975: I2 ^reward 1)
<=WM: (13974: I2 ^see 1)
=>WM: (13991: I2 ^level-1 R0-root)
<=WM: (13977: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1995 = 0.6104611932916519)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1996 = 0.1063475139796038)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Firing propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Firing propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3873370065427176)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.3895396873671274)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Retracting propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
 -->
 (S1 ^operator O1996 = 0.5523820607022403)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1996 = 0.4476195574206818)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
 -->
 (S1 ^operator O1995 = 0.1398795999120246)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1995 = 0.1844120719320057)
=>WM: (13999: S1 ^operator O1998 +)
=>WM: (13998: S1 ^operator O1997 +)
=>WM: (13997: I3 ^dir L)
=>WM: (13996: O1998 ^name predict-no)
=>WM: (13995: O1997 ^name predict-yes)
=>WM: (13994: R1002 ^value 1)
=>WM: (13993: R1 ^reward R1002)
=>WM: (13992: I3 ^see 0)
<=WM: (13983: S1 ^operator O1995 +)
<=WM: (13984: S1 ^operator O1996 +)
<=WM: (13985: S1 ^operator O1996)
<=WM: (13968: I3 ^dir R)
<=WM: (13979: R1 ^reward R1001)
<=WM: (13978: I3 ^see 1)
<=WM: (13982: O1996 ^name predict-no)
<=WM: (13981: O1995 ^name predict-yes)
<=WM: (13980: R1001 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.3895396873671274)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1997 = 0.6104611932916519)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 0.3873370065427176)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1998 = 0.1063475139796038)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 0.3873370065427176)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1996 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.3895396873671274)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1995 = 0.6104611932916519)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174913 0.44762 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.930233,0.065407)
RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552382 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
=>WM: (14000: S1 ^operator O1997)

   999:    O: O1997 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N999 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N998 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14001: I3 ^predict-yes N999)
<=WM: (13987: N998 ^status complete)
<=WM: (13986: I3 ^predict-no N998)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14005: I2 ^dir R)
=>WM: (14004: I2 ^reward 1)
=>WM: (14003: I2 ^see 1)
=>WM: (14002: N999 ^status complete)
<=WM: (13990: I2 ^dir L)
<=WM: (13989: I2 ^reward 1)
<=WM: (13988: I2 ^see 0)
=>WM: (14006: I2 ^level-1 L1-root)
<=WM: (13991: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1998 = -0.02155734064455064)
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1997 = 0.815582443315254)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Firing propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Firing propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1998 = 0.4476193147022436)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1997 = 0.1844120719320057)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Retracting propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
 -->
 (S1 ^operator O1998 = 0.1063475139796038)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 0.3873370065427176)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
 -->
 (S1 ^operator O1997 = 0.6104611932916519)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.3895396873671274)
=>WM: (14014: S1 ^operator O2000 +)
=>WM: (14013: S1 ^operator O1999 +)
=>WM: (14012: I3 ^dir R)
=>WM: (14011: O2000 ^name predict-no)
=>WM: (14010: O1999 ^name predict-yes)
=>WM: (14009: R1003 ^value 1)
=>WM: (14008: R1 ^reward R1003)
=>WM: (14007: I3 ^see 1)
<=WM: (13998: S1 ^operator O1997 +)
<=WM: (14000: S1 ^operator O1997)
<=WM: (13999: S1 ^operator O1998 +)
<=WM: (13997: I3 ^dir L)
<=WM: (13993: R1 ^reward R1002)
<=WM: (13992: I3 ^see 0)
<=WM: (13996: O1998 ^name predict-no)
<=WM: (13995: O1997 ^name predict-yes)
<=WM: (13994: R1002 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1999 = 0.1844120719320057)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1999 = 0.815582443315254)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2000 = 0.4476193147022436)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O2000 = -0.02155734064455064)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1998 = 0.4476193147022436)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O1998 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1997 = 0.1844120719320057)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1997 = 0.815582443315254)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.892857,0.0962361)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
=>WM: (14015: S1 ^operator O1999)

  1000:    O: O1999 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1000 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N999 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14016: I3 ^predict-yes N1000)
<=WM: (14002: N999 ^status complete)
<=WM: (14001: I3 ^predict-yes N999)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\-/|\-/|\---- Input Phase --- 
=>WM: (14020: I2 ^dir U)
=>WM: (14019: I2 ^reward 1)
=>WM: (14018: I2 ^see 1)
=>WM: (14017: N1000 ^status complete)
<=WM: (14005: I2 ^dir R)
<=WM: (14004: I2 ^reward 1)
<=WM: (14003: I2 ^see 1)
=>WM: (14021: I2 ^level-1 R1-root)
<=WM: (14006: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Firing propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Firing propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2000 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1999 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Retracting propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
 -->
 (S1 ^operator O2000 = -0.02155734064455064)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2000 = 0.4476193147022436)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
 -->
 (S1 ^operator O1999 = 0.815582443315254)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1999 = 0.1844120719320057)
=>WM: (14028: S1 ^operator O2002 +)
=>WM: (14027: S1 ^operator O2001 +)
=>WM: (14026: I3 ^dir U)
=>WM: (14025: O2002 ^name predict-no)
=>WM: (14024: O2001 ^name predict-yes)
=>WM: (14023: R1004 ^value 1)
=>WM: (14022: R1 ^reward R1004)
<=WM: (14013: S1 ^operator O1999 +)
<=WM: (14015: S1 ^operator O1999)
<=WM: (14014: S1 ^operator O2000 +)
<=WM: (14012: I3 ^dir R)
<=WM: (14008: R1 ^reward R1003)
<=WM: (14011: O2000 ^name predict-no)
<=WM: (14010: O1999 ^name predict-yes)
<=WM: (14009: R1003 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2001 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2002 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2000 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1999 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675416 -0.491003 0.184413(R,m,v=1,0.900585,0.0900585)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324578 0.491005 0.815582 -> 0.324579 0.491004 0.815583(R,m,v=1,1,0)
=>WM: (14029: S1 ^operator O2002)

  1001:    O: O2002 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1001 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1000 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14030: I3 ^predict-no N1001)
<=WM: (14017: N1000 ^status complete)
<=WM: (14016: I3 ^predict-yes N1000)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (14034: I2 ^dir L)
=>WM: (14033: I2 ^reward 1)
=>WM: (14032: I2 ^see 0)
=>WM: (14031: N1001 ^status complete)
<=WM: (14020: I2 ^dir U)
<=WM: (14019: I2 ^reward 1)
<=WM: (14018: I2 ^see 1)
=>WM: (14035: I2 ^level-1 R1-root)
<=WM: (14021: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2001 = 0.6104596086348102)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2002 = 0.2714993082286609)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Firing propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Firing propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 0.3873370065427176)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.3895395552683104)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Retracting propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2002 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2001 = 0.)
=>WM: (14043: S1 ^operator O2004 +)
=>WM: (14042: S1 ^operator O2003 +)
=>WM: (14041: I3 ^dir L)
=>WM: (14040: O2004 ^name predict-no)
=>WM: (14039: O2003 ^name predict-yes)
=>WM: (14038: R1005 ^value 1)
=>WM: (14037: R1 ^reward R1005)
=>WM: (14036: I3 ^see 0)
<=WM: (14027: S1 ^operator O2001 +)
<=WM: (14028: S1 ^operator O2002 +)
<=WM: (14029: S1 ^operator O2002)
<=WM: (14026: I3 ^dir U)
<=WM: (14022: R1 ^reward R1004)
<=WM: (14007: I3 ^see 1)
<=WM: (14025: O2002 ^name predict-no)
<=WM: (14024: O2001 ^name predict-yes)
<=WM: (14023: R1004 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2003 = 0.6104596086348102)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3895395552683104)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2004 = 0.2714993082286609)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3873370065427176)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 0.3873370065427176)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2002 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.3895395552683104)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2001 = 0.6104596086348102)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14044: S1 ^operator O2003)

  1002:    O: O2003 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1002 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1001 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14045: I3 ^predict-yes N1002)
<=WM: (14031: N1001 ^status complete)
<=WM: (14030: I3 ^predict-no N1001)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\-/--- Input Phase --- 
=>WM: (14049: I2 ^dir L)
=>WM: (14048: I2 ^reward 1)
=>WM: (14047: I2 ^see 1)
=>WM: (14046: N1002 ^status complete)
<=WM: (14034: I2 ^dir L)
<=WM: (14033: I2 ^reward 1)
<=WM: (14032: I2 ^see 0)
=>WM: (14050: I2 ^level-1 L1-root)
<=WM: (14035: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O2004 = 0.6126627914480096)
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O2003 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Firing propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Firing propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3873370065427176)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3895395552683104)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Retracting propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3873370065427176)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2004 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3895395552683104)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2003 = 0.6104596086348102)
=>WM: (14057: S1 ^operator O2006 +)
=>WM: (14056: S1 ^operator O2005 +)
=>WM: (14055: O2006 ^name predict-no)
=>WM: (14054: O2005 ^name predict-yes)
=>WM: (14053: R1006 ^value 1)
=>WM: (14052: R1 ^reward R1006)
=>WM: (14051: I3 ^see 1)
<=WM: (14042: S1 ^operator O2003 +)
<=WM: (14044: S1 ^operator O2003)
<=WM: (14043: S1 ^operator O2004 +)
<=WM: (14037: R1 ^reward R1005)
<=WM: (14036: I3 ^see 0)
<=WM: (14040: O2004 ^name predict-no)
<=WM: (14039: O2003 ^name predict-yes)
<=WM: (14038: R1005 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2005 = 0.3895395552683104)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O2005 = -0.02274740735326741)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2006 = 0.3873370065427176)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O2006 = 0.6126627914480096)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 0.3873370065427176)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O2004 = 0.6126627914480096)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.3895395552683104)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O2003 = -0.02274740735326741)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.893491,0.0957312)
RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
=>WM: (14058: S1 ^operator O2006)

  1003:    O: O2006 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1003 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1002 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14059: I3 ^predict-no N1003)
<=WM: (14046: N1002 ^status complete)
<=WM: (14045: I3 ^predict-yes N1002)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14063: I2 ^dir R)
=>WM: (14062: I2 ^reward 1)
=>WM: (14061: I2 ^see 0)
=>WM: (14060: N1003 ^status complete)
<=WM: (14049: I2 ^dir L)
<=WM: (14048: I2 ^reward 1)
<=WM: (14047: I2 ^see 1)
=>WM: (14064: I2 ^level-1 L0-root)
<=WM: (14050: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O2005 = 0.8155935357860071)
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O2006 = -0.00558448899823713)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Firing propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Firing propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2006 = 0.4476193147022436)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2005 = 0.1844128946449167)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Retracting propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
 -->
 (S1 ^operator O2006 = 0.6126627914480096)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2006 = 0.3873370065427176)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
 -->
 (S1 ^operator O2005 = -0.02274740735326741)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2005 = 0.3895396806828423)
=>WM: (14072: S1 ^operator O2008 +)
=>WM: (14071: S1 ^operator O2007 +)
=>WM: (14070: I3 ^dir R)
=>WM: (14069: O2008 ^name predict-no)
=>WM: (14068: O2007 ^name predict-yes)
=>WM: (14067: R1007 ^value 1)
=>WM: (14066: R1 ^reward R1007)
=>WM: (14065: I3 ^see 0)
<=WM: (14056: S1 ^operator O2005 +)
<=WM: (14057: S1 ^operator O2006 +)
<=WM: (14058: S1 ^operator O2006)
<=WM: (14041: I3 ^dir L)
<=WM: (14052: R1 ^reward R1006)
<=WM: (14051: I3 ^see 1)
<=WM: (14055: O2006 ^name predict-no)
<=WM: (14054: O2005 ^name predict-yes)
<=WM: (14053: R1006 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2007 = 0.1844128946449167)
Firing prefer*rvt*predict-yes*H0*3*v1*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O2007 = 0.8155935357860071)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2008 = 0.4476193147022436)
Firing prefer*rvt*predict-no*H0*4*v1*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O2008 = -0.00558448899823713)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2006 = 0.4476193147022436)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O2006 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2005 = 0.1844128946449167)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O2005 = 0.8155935357860071)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.932203,0.0635593)
RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
=>WM: (14073: S1 ^operator O2007)

  1004:    O: O2007 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1004 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1003 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14074: I3 ^predict-yes N1004)
<=WM: (14060: N1003 ^status complete)
<=WM: (14059: I3 ^predict-no N1003)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14078: I2 ^dir U)
=>WM: (14077: I2 ^reward 1)
=>WM: (14076: I2 ^see 1)
=>WM: (14075: N1004 ^status complete)
<=WM: (14063: I2 ^dir R)
<=WM: (14062: I2 ^reward 1)
<=WM: (14061: I2 ^see 0)
=>WM: (14079: I2 ^level-1 R1-root)
<=WM: (14064: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Firing propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Firing propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Retracting propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
 -->
 (S1 ^operator O2008 = -0.00558448899823713)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2008 = 0.4476193147022436)
Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
 -->
 (S1 ^operator O2007 = 0.8155935357860071)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2007 = 0.1844128946449167)
=>WM: (14087: S1 ^operator O2010 +)
=>WM: (14086: S1 ^operator O2009 +)
=>WM: (14085: I3 ^dir U)
=>WM: (14084: O2010 ^name predict-no)
=>WM: (14083: O2009 ^name predict-yes)
=>WM: (14082: R1008 ^value 1)
=>WM: (14081: R1 ^reward R1008)
=>WM: (14080: I3 ^see 1)
<=WM: (14071: S1 ^operator O2007 +)
<=WM: (14073: S1 ^operator O2007)
<=WM: (14072: S1 ^operator O2008 +)
<=WM: (14070: I3 ^dir R)
<=WM: (14066: R1 ^reward R1007)
<=WM: (14065: I3 ^see 0)
<=WM: (14069: O2008 ^name predict-no)
<=WM: (14068: O2007 ^name predict-yes)
<=WM: (14067: R1007 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2009 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2010 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.675416 -0.491003 0.184413 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.901163,0.0895893)
RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324592 0.491001 0.815594 -> 0.324591 0.491001 0.815593(R,m,v=1,1,0)
=>WM: (14088: S1 ^operator O2010)

  1005:    O: O2010 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1005 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1004 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14089: I3 ^predict-no N1005)
<=WM: (14075: N1004 ^status complete)
<=WM: (14074: I3 ^predict-yes N1004)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/|--- Input Phase --- 
=>WM: (14093: I2 ^dir L)
=>WM: (14092: I2 ^reward 1)
=>WM: (14091: I2 ^see 0)
=>WM: (14090: N1005 ^status complete)
<=WM: (14078: I2 ^dir U)
<=WM: (14077: I2 ^reward 1)
<=WM: (14076: I2 ^see 1)
=>WM: (14094: I2 ^level-1 R1-root)
<=WM: (14079: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2009 = 0.6104597340493421)
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2010 = 0.2714993082286609)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Firing propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Firing propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2010 = 0.3873370368441085)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2009 = 0.3895396806828423)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Retracting propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2010 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2009 = 0.)
=>WM: (14102: S1 ^operator O2012 +)
=>WM: (14101: S1 ^operator O2011 +)
=>WM: (14100: I3 ^dir L)
=>WM: (14099: O2012 ^name predict-no)
=>WM: (14098: O2011 ^name predict-yes)
=>WM: (14097: R1009 ^value 1)
=>WM: (14096: R1 ^reward R1009)
=>WM: (14095: I3 ^see 0)
<=WM: (14086: S1 ^operator O2009 +)
<=WM: (14087: S1 ^operator O2010 +)
<=WM: (14088: S1 ^operator O2010)
<=WM: (14085: I3 ^dir U)
<=WM: (14081: R1 ^reward R1008)
<=WM: (14080: I3 ^see 1)
<=WM: (14084: O2010 ^name predict-no)
<=WM: (14083: O2009 ^name predict-yes)
<=WM: (14082: R1008 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2011 = 0.6104597340493421)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2011 = 0.3895396806828423)
Firing prefer*rvt*predict-yes*H0*1*v1*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2012 = 0.2714993082286609)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2012 = 0.3873370368441085)
Firing prefer*rvt*predict-no*H0*2*v1*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2010 = 0.3873370368441085)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2010 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2009 = 0.3895396806828423)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2009 = 0.6104597340493421)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14103: S1 ^operator O2011)

  1006:    O: O2011 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1006 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1005 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14104: I3 ^predict-yes N1006)
<=WM: (14090: N1005 ^status complete)
<=WM: (14089: I3 ^predict-no N1005)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14108: I2 ^dir U)
=>WM: (14107: I2 ^reward 1)
=>WM: (14106: I2 ^see 1)
=>WM: (14105: N1006 ^status complete)
<=WM: (14093: I2 ^dir L)
<=WM: (14092: I2 ^reward 1)
<=WM: (14091: I2 ^see 0)
=>WM: (14109: I2 ^level-1 L1-root)
<=WM: (14094: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Firing propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Firing propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2012 = 0.9999999999999999)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2011 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Retracting propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2012 = 0.3873370368441085)
Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
 -->
 (S1 ^operator O2012 = 0.2714993082286609)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2011 = 0.3895396806828423)
Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
 -->
 (S1 ^operator O2011 = 0.6104597340493421)
=>WM: (14117: S1 ^operator O2014 +)
=>WM: (14116: S1 ^operator O2013 +)
=>WM: (14115: I3 ^dir U)
=>WM: (14114: O2014 ^name predict-no)
=>WM: (14113: O2013 ^name predict-yes)
=>WM: (14112: R1010 ^value 1)
=>WM: (14111: R1 ^reward R1010)
=>WM: (14110: I3 ^see 1)
<=WM: (14101: S1 ^operator O2011 +)
<=WM: (14103: S1 ^operator O2011)
<=WM: (14102: S1 ^operator O2012 +)
<=WM: (14100: I3 ^dir L)
<=WM: (14096: R1 ^reward R1009)
<=WM: (14095: I3 ^see 0)
<=WM: (14099: O2012 ^name predict-no)
<=WM: (14098: O2011 ^name predict-yes)
<=WM: (14097: R1009 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.9999999999999999)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2012 = 0.9999999999999999)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2011 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954