stdout-flip-2.5K_2.txt

/flipv2/20121112-101138-2.5K-ReLST-Evan/stdout-flip-2.5K_2.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 35033 lines · 32933 code · 2100 blank · 0 comment · 0 complexity · 89a52a67c906c4fc3a1a3d7f661a0621 MD5 · raw file

Seeding... 2
dir: dir isU
Python-Soar Flip environment.
To accept commands from an external sml process, you'll need to
type 'slave <log file> <n decisons>' at the prompt...
sourcing 'flip_predict.soar'
***********
Total: 11 productions sourced.

seeding Soar with 2 ...

soar> Entering slave mode:
  - log file 'rl-slave-2.5K_2.log'....
  - will exit slave mode after 2500 decisions
  waiting for commands from an externally connected sml process...
-/|sleeping...
\sleeping...
-sleeping...
/sleeping...
|sleeping...
\-/|\-/|\-sleeping...
/|\-/|\sleeping...
-1:    O: O1 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
rule alias: '*'

rule alias: '*'

/|\-/|\-2:    O: O4 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\-3:    O: O5 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\4:    O: O7 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-5:    O: O10 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\6:    O: O12 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|7:    O: O14 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\8:    O: O15 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
-/|9:    O: O17 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/10:    O: O19 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-11:    O: O21 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

/12:    O: O24 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
|\-13:    O: O25 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\14:    O: O27 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/15:    O: O30 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\16:    O: O32 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|17:    O: O34 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\18:    O: O35 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
-/|19:    O: O37 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/20:    O: O39 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-21:    O: O42 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/22:    O: O43 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
|\-23:    O: O45 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\24:    O: O48 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
-/25:    O: O49 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-26:    O: O52 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|27:    O: O53 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-/28:    O: O55 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\-29:    O: O58 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
/30:    O: O60 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\31:    O: O61 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
-32:    O: O64 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\33:    O: O65 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-34:    O: O68 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\35:    O: O70 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-36:    O: O72 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
/|\37:    O: O74 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|38:    O: O76 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/39:    O: O77 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\-40:    O: O79 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
/|\41:    O: O82 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-42:    O: O84 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\43:    O: O85 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|44:    O: O87 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/45:    O: O90 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\46:    O: O92 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isU
-/|47:    O: O94 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-48:    O: O96 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|49:    O: O97 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/50:    O: O99 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-/|\-sleeping...
/51:    O: O101 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
rule alias: '*'

|52:    O: O104 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\53:    O: O105 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-54:    O: O108 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\55:    O: O109 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|\sleeping...
-56:    O: O111 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/|57:    O: O114 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\58:    O: O115 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
-/|59:    O: O117 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/60:    O: O120 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-61:    O: O122 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/62:    O: O123 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\63:    O: O126 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
-/64:    O: O128 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-65:    O: O130 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/66:    O: O132 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|67:    O: O134 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\68:    O: O136 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/69:    O: O138 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|70:    O: O140 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-71:    O: O142 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

rule alias: '*'

/72:    O: O144 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-73:    O: O145 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|74:    O: O147 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-75:    O: O149 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/|\76:    O: O152 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|77:    O: O154 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
\-/78:    O: O156 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|79:    O: O158 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
\-80:    O: O160 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\81:    O: O162 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
rule alias: '*'

rule alias: '*'

-82:    O: O163 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\83:    O: O166 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|84:    O: O168 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-85:    O: O169 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
/|\86:    O: O172 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|87:    O: O173 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\-/88:    O: O176 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\89:    O: O177 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/90:    O: O180 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
|\-91:    O: O182 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
rule alias: '*'

rule alias: '*'

/92:    O: O184 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\93:    O: O186 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|94:    O: O187 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/95:    O: O190 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-96:    O: O191 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/97:    O: O194 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-98:    O: O196 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\99:    O: O198 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
-100:    O: O200 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\101:    O: O201 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/102:    O: O203 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-103:    O: O205 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|104:    O: O208 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/105:    O: O210 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\106:    O: O212 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|107:    O: O214 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/108:    O: O216 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-109:    O: O218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|110:    O: O220 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/111:    O: O222 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
rule alias: '*'

|112:    O: O224 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\113:    O: O226 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/114:    O: O228 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-115:    O: O229 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\-116:    O: O231 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|117:    O: O234 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/118:    O: O236 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|119:    O: O238 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/120:    O: O239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-121:    O: O241 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/122:    O: O244 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-123:    O: O245 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|124:    O: O248 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-/125:    O: O249 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|126:    O: O251 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/127:    O: O254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\128:    O: O256 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|129:    O: O258 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\130:    O: O259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/131:    O: O262 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|132:    O: O264 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/133:    O: O265 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
|\-134:    O: O267 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/135:    O: O269 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\136:    O: O271 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-137:    O: O274 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\138:    O: O275 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|139:    O: O278 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/140:    O: O279 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-141:    O: O281 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/142:    O: O284 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\143:    O: O286 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/144:    O: O287 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-145:    O: O290 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|146:    O: O292 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-147:    O: O293 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\148:    O: O296 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/149:    O: O297 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-150:    O: O299 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\151:    O: O301 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-152:    O: O304 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\153:    O: O306 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|154:    O: O308 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/155:    O: O310 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|156:    O: O311 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-157:    O: O314 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|158:    O: O316 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-159:    O: O318 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\160:    O: O320 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|161:    O: O321 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\162:    O: O323 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|163:    O: O325 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-164:    O: O327 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\165:    O: O329 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
-/|166:    O: O332 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/167:    O: O334 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\168:    O: O335 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/169:    O: O337 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-170:    O: O340 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|171:    O: O342 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
\172:    O: O344 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/173:    O: O345 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\174:    O: O347 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|\175:    O: O350 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/176:    O: O352 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-177:    O: O354 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|178:    O: O356 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\179:    O: O358 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|180:    O: O360 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\181:    O: O361 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-182:    O: O364 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\183:    O: O365 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-184:    O: O367 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/185:    O: O369 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-186:    O: O372 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\187:    O: O374 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|188:    O: O376 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/189:    O: O378 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
|\-190:    O: O380 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|191:    O: O382 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\192:    O: O383 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|193:    O: O385 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\194:    O: O388 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|195:    O: O389 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/196:    O: O392 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-197:    O: O394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\198:    O: O395 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/199:    O: O398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-200:    O: O400 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\-/|201:    O: O402 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\202:    O: O403 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-203:    O: O406 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\204:    O: O407 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|205:    O: O409 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\206:    O: O412 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|207:    O: O414 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/208:    O: O415 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
|\209:    O: O418 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/210:    O: O420 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\211:    O: O422 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-212:    O: O423 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\213:    O: O426 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-214:    O: O428 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\215:    O: O429 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/216:    O: O431 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-217:    O: O433 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\218:    O: O436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|219:    O: O437 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-220:    O: O440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\221:    O: O441 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-222:    O: O444 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|223:    O: O446 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-224:    O: O447 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|225:    O: O449 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
\-/226:    O: O452 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\227:    O: O454 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|228:    O: O456 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\229:    O: O457 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|230:    O: O460 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-231:    O: O462 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/232:    O: O464 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\233:    O: O465 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|234:    O: O468 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-235:    O: O470 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|236:    O: O471 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/237:    O: O474 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
|\238:    O: O476 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|\239:    O: O478 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/240:    O: O480 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-241:    O: O481 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
/242:    O: O484 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-243:    O: O486 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\244:    O: O487 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/245:    O: O489 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-246:    O: O491 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/|247:    O: O494 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/248:    O: O496 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-249:    O: O498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\250:    O: O500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|251:    O: O501 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\252:    O: O504 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|253:    O: O506 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
\-/254:    O: O508 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
|\255:    O: O509 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/256:    O: O511 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\-257:    O: O514 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\258:    O: O516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|259:    O: O517 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\-/260:    O: O519 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|261:    O: O521 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\262:    O: O524 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|263:    O: O525 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/264:    O: O527 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-265:    O: O529 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/266:    O: O532 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-267:    O: O534 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\268:    O: O536 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|269:    O: O537 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-270:    O: O539 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\271:    O: O542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-272:    O: O544 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\273:    O: O546 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|274:    O: O548 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/275:    O: O549 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-276:    O: O552 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/277:    O: O553 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\278:    O: O555 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-279:    O: O558 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\280:    O: O560 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|281:    O: O561 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\282:    O: O563 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-283:    O: O565 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
/|\284:    O: O567 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|285:    O: O569 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\-/286:    O: O572 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
|\-287:    O: O574 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\288:    O: O576 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-289:    O: O578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\290:    O: O580 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|291:    O: O582 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\292:    O: O583 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-293:    O: O585 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\294:    O: O587 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/295:    O: O589 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|296:    O: O592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/297:    O: O593 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-298:    O: O595 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/299:    O: O598 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-300:    O: O599 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
/|\301:    O: O602 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-302:    O: O604 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/303:    O: O606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-304:    O: O608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\305:    O: O610 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/306:    O: O611 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-307:    O: O614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\308:    O: O615 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/309:    O: O617 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\310:    O: O620 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|311:    O: O621 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\312:    O: O624 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
-/|313:    O: O626 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-314:    O: O628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\315:    O: O630 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-316:    O: O632 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\317:    O: O634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isR
-/318:    O: O635 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-319:    O: O638 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|320:    O: O640 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/321:    O: O642 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|322:    O: O644 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/323:    O: O645 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|324:    O: O648 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/325:    O: O650 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|326:    O: O651 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/327:    O: O654 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\328:    O: O656 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|329:    O: O657 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/330:    O: O660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-331:    O: O662 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/332:    O: O664 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-333:    O: O665 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|\334:    O: O667 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|335:    O: O669 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/336:    O: O672 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-337:    O: O674 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|338:    O: O675 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-339:    O: O678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|340:    O: O679 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\341:    O: O681 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-342:    O: O684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\343:    O: O685 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/|344:    O: O688 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/345:    O: O690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-346:    O: O692 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|347:    O: O693 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/348:    O: O696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-349:    O: O698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\350:    O: O700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-351:    O: O702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/352:    O: O703 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\-353:    O: O706 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\354:    O: O707 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|355:    O: O709 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-356:    O: O711 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/357:    O: O714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-358:    O: O716 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|359:    O: O718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\360:    O: O720 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|361:    O: O721 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
\362:    O: O723 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/363:    O: O726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\364:    O: O728 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|365:    O: O730 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\366:    O: O731 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|367:    O: O734 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/368:    O: O735 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|369:    O: O737 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isL
\370:    O: O739 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/371:    O: O742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|372:    O: O743 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\373:    O: O746 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/374:    O: O748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\375:    O: O750 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/376:    O: O752 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-377:    O: O754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|378:    O: O756 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/379:    O: O757 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-380:    O: O759 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\381:    O: O762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-382:    O: O764 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\383:    O: O766 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|384:    O: O767 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/385:    O: O770 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-386:    O: O772 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|387:    O: O773 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-388:    O: O776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|389:    O: O778 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/390:    O: O780 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-391:    O: O782 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/392:    O: O783 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\393:    O: O785 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|394:    O: O788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/395:    O: O790 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\396:    O: O792 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|397:    O: O794 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/398:    O: O796 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-399:    O: O797 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\400:    O: O800 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/401:    O: O802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
|402:    O: O803 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/403:    O: O806 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-404:    O: O808 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/405:    O: O810 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\406:    O: O812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-407:    O: O813 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|408:    O: O816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-409:    O: O817 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\410:    O: O820 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|411:    O: O821 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\412:    O: O823 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/413:    O: O825 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-414:    O: O827 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|415:    O: O829 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-/416:    O: O832 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|417:    O: O833 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/418:    O: O836 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-419:    O: O838 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/420:    O: O839 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-421:    O: O842 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/422:    O: O844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\423:    O: O846 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/424:    O: O848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\425:    O: O849 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/426:    O: O852 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\427:    O: O854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|428:    O: O855 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-429:    O: O858 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\430:    O: O859 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-431:    O: O862 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/432:    O: O864 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-433:    O: O866 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|434:    O: O867 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
\-/435:    O: O870 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|436:    O: O872 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/437:    O: O873 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isU
|\438:    O: O876 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|439:    O: O878 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-440:    O: O880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|441:    O: O882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\442:    O: O884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/443:    O: O886 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-444:    O: O888 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\445:    O: O889 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-446:    O: O891 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\-447:    O: O894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\448:    O: O896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|449:    O: O898 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/450:    O: O899 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-451:    O: O902 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/452:    O: O903 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-453:    O: O906 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|454:    O: O908 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\455:    O: O909 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/456:    O: O912 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|457:    O: O913 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isL
\458:    O: O916 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|459:    O: O917 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/460:    O: O920 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-461:    O: O922 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/462:    O: O924 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|463:    O: O926 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-464:    O: O928 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\465:    O: O930 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/466:    O: O931 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-467:    O: O934 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/468:    O: O935 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\-469:    O: O937 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\470:    O: O939 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|471:    O: O941 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\472:    O: O943 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
-/473:    O: O946 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\474:    O: O948 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|475:    O: O950 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/476:    O: O952 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-477:    O: O953 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isR
/|\478:    O: O955 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|479:    O: O957 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/|480:    O: O959 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-481:    O: O962 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, False)
predict error 1
dir: dir isL
/482:    O: O964 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\483:    O: O965 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|484:    O: O968 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-485:    O: O970 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\486:    O: O972 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|487:    O: O973 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-488:    O: O976 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\489:    O: O978 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/490:    O: O979 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\491:    O: O982 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-492:    O: O983 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\493:    O: O985 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/494:    O: O988 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-495:    O: O990 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\496:    O: O992 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|497:    O: O994 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/498:    O: O996 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\499:    O: O998 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/500:    O: O1000 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-501:    O: O1002 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/502:    O: O1004 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-503:    O: O1005 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/504:    O: O1008 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-505:    O: O1009 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|506:    O: O1012 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/507:    O: O1014 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-/508:    O: O1016 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-509:    O: O1018 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\510:    O: O1020 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|511:    O: O1022 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\512:    O: O1024 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|\513:    O: O1026 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/514:    O: O1027 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-515:    O: O1030 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\516:    O: O1032 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|517:    O: O1034 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/518:    O: O1036 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-519:    O: O1038 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/520:    O: O1040 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\521:    O: O1042 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-522:    O: O1044 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|523:    O: O1046 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-524:    O: O1048 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|525:    O: O1050 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-526:    O: O1052 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isL
/|\527:    O: O1053 (predict-yes)
I see 0 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/528:    O: O1056 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\529:    O: O1058 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|530:    O: O1060 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/531:    O: O1062 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|532:    O: O1063 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/533:    O: O1065 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-534:    O: O1068 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\535:    O: O1070 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|536:    O: O1071 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/537:    O: O1074 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|538:    O: O1075 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-539:    O: O1077 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\540:    O: O1079 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|541:    O: O1082 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\542:    O: O1084 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|543:    O: O1086 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/544:    O: O1087 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-545:    O: O1090 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\546:    O: O1092 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-547:    O: O1093 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|548:    O: O1095 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/549:    O: O1097 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-550:    O: O1100 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\551:    O: O1102 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-552:    O: O1104 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|553:    O: O1106 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-554:    O: O1108 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|555:    O: O1109 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\556:    O: O1112 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-557:    O: O1114 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|558:    O: O1116 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/559:    O: O1117 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, False)
predict error 1
dir: dir isR
|\560:    O: O1120 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/561:    O: O1122 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|562:    O: O1123 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/|563:    O: O1126 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/564:    O: O1128 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|565:    O: O1129 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-566:    O: O1131 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\567:    O: O1133 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|568:    O: O1135 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\569:    O: O1137 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|570:    O: O1140 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/571:    O: O1142 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|572:    O: O1144 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\573:    O: O1146 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/574:    O: O1148 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\575:    O: O1150 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-576:    O: O1151 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|577:    O: O1154 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/578:    O: O1156 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\579:    O: O1158 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|580:    O: O1160 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\581:    O: O1162 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-582:    O: O1163 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\583:    O: O1165 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|584:    O: O1167 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-585:    O: O1169 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|586:    O: O1172 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/587:    O: O1173 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\588:    O: O1176 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/589:    O: O1178 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\590:    O: O1180 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|591:    O: O1182 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\592:    O: O1184 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|593:    O: O1185 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/594:    O: O1188 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-595:    O: O1190 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\596:    O: O1192 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|597:    O: O1193 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/598:    O: O1195 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\599:    O: O1198 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/600:    O: O1200 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-601:    O: O1201 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/602:    O: O1204 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-603:    O: O1205 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/604:    O: O1208 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|605:    O: O1209 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, False)
predict error 1
dir: dir isU
\-606:    O: O1212 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\607:    O: O1213 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-608:    O: O1215 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|609:    O: O1218 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/610:    O: O1220 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-611:    O: O1222 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/612:    O: O1224 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-613:    O: O1226 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|614:    O: O1227 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-615:    O: O1230 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\616:    O: O1232 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|617:    O: O1234 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\618:    O: O1236 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|619:    O: O1238 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/620:    O: O1239 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|\-621:    O: O1242 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/622:    O: O1244 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-623:    O: O1246 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|624:    O: O1248 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/625:    O: O1250 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-/sleeping...
|626:    O: O1251 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-627:    O: O1254 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\628:    O: O1256 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/629:    O: O1258 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|630:    O: O1259 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\631:    O: O1262 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-632:    O: O1264 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|633:    O: O1266 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-634:    O: O1267 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\635:    O: O1270 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-636:    O: O1272 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/637:    O: O1274 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-638:    O: O1275 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|639:    O: O1278 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/640:    O: O1279 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\641:    O: O1281 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-642:    O: O1283 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/643:    O: O1286 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\644:    O: O1287 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|645:    O: O1290 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\646:    O: O1291 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|647:    O: O1294 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-648:    O: O1295 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\649:    O: O1297 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|650:    O: O1300 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/651:    O: O1302 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|652:    O: O1304 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/653:    O: O1305 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\654:    O: O1308 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/655:    O: O1310 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-656:    O: O1312 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|657:    O: O1314 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\658:    O: O1316 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-659:    O: O1318 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\660:    O: O1320 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/|661:    O: O1322 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\662:    O: O1324 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|663:    O: O1326 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-664:    O: O1327 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|665:    O: O1330 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/666:    O: O1332 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-667:    O: O1334 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|668:    O: O1335 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/669:    O: O1337 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\670:    O: O1339 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|671:    O: O1342 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\672:    O: O1343 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|673:    O: O1346 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-674:    O: O1347 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|675:    O: O1349 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/676:    O: O1352 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-677:    O: O1354 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|678:    O: O1356 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/679:    O: O1358 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-680:    O: O1360 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/681:    O: O1361 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|682:    O: O1364 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-683:    O: O1366 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|684:    O: O1368 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/685:    O: O1370 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\686:    O: O1372 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/687:    O: O1374 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-688:    O: O1375 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\689:    O: O1378 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|690:    O: O1380 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/691:    O: O1381 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|692:    O: O1384 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/693:    O: O1386 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-694:    O: O1387 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|695:    O: O1389 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-696:    O: O1391 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\697:    O: O1394 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/698:    O: O1396 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-699:    O: O1398 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\700:    O: O1400 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|701:    O: O1402 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\702:    O: O1404 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/703:    O: O1405 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\704:    O: O1408 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/705:    O: O1409 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|\-706:    O: O1412 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\707:    O: O1413 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/708:    O: O1416 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-709:    O: O1418 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\710:    O: O1420 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|711:    O: O1422 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\712:    O: O1423 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|713:    O: O1426 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/714:    O: O1427 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|\-715:    O: O1429 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|716:    O: O1432 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/717:    O: O1434 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-718:    O: O1436 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/719:    O: O1438 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\720:    O: O1440 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/721:    O: O1441 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|722:    O: O1444 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/723:    O: O1445 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\724:    O: O1447 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|725:    O: O1450 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\726:    O: O1452 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|727:    O: O1453 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-728:    O: O1456 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\729:    O: O1458 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-730:    O: O1460 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\731:    O: O1461 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-732:    O: O1463 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|733:    O: O1465 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/734:    O: O1468 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\735:    O: O1469 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|736:    O: O1471 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/737:    O: O1474 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\738:    O: O1475 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|739:    O: O1478 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/740:    O: O1480 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-741:    O: O1482 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/742:    O: O1484 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-743:    O: O1486 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\744:    O: O1488 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|745:    O: O1489 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/746:    O: O1492 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-747:    O: O1493 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\748:    O: O1496 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-749:    O: O1498 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|750:    O: O1500 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/751:    O: O1502 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|752:    O: O1504 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-753:    O: O1506 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\754:    O: O1507 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|755:    O: O1509 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/756:    O: O1512 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-757:    O: O1513 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|758:    O: O1516 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\759:    O: O1518 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/760:    O: O1520 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|761:    O: O1522 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\762:    O: O1524 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|763:    O: O1526 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isR
\-/764:    O: O1528 (predict-no)
I see 0 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\765:    O: O1530 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|766:    O: O1532 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\767:    O: O1534 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/768:    O: O1535 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|769:    O: O1538 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-770:    O: O1539 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
/|\771:    O: O1542 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-772:    O: O1544 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\773:    O: O1546 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-774:    O: O1547 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|775:    O: O1550 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-776:    O: O1552 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|777:    O: O1554 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/778:    O: O1556 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\779:    O: O1558 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|780:    O: O1559 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/781:    O: O1562 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|782:    O: O1563 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-/783:    O: O1565 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|\784:    O: O1567 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/785:    O: O1570 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\786:    O: O1572 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-787:    O: O1574 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\788:    O: O1575 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|789:    O: O1578 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-790:    O: O1579 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/791:    O: O1581 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
|792:    O: O1583 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\-793:    O: O1585 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\794:    O: O1588 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/795:    O: O1590 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-796:    O: O1592 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\797:    O: O1593 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/798:    O: O1595 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\-799:    O: O1598 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\800:    O: O1600 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/801:    O: O1602 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|802:    O: O1604 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/803:    O: O1606 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\804:    O: O1608 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/805:    O: O1610 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-806:    O: O1612 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/807:    O: O1614 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-808:    O: O1616 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\809:    O: O1617 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|810:    O: O1619 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-/811:    O: O1621 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
|812:    O: O1623 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\-813:    O: O1625 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
/|\814:    O: O1628 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|815:    O: O1630 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/816:    O: O1631 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\817:    O: O1634 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-/|818:    O: O1635 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
\819:    O: O1637 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|820:    O: O1640 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/821:    O: O1642 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|822:    O: O1643 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
\-/823:    O: O1646 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-824:    O: O1648 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|825:    O: O1650 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/826:    O: O1652 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-827:    O: O1653 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\828:    O: O1655 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|829:    O: O1658 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/830:    O: O1660 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-831:    O: O1662 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/832:    O: O1663 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\833:    O: O1666 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|834:    O: O1668 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/835:    O: O1669 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|836:    O: O1672 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/837:    O: O1673 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
|838:    O: O1676 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-839:    O: O1678 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\840:    O: O1680 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|841:    O: O1682 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\842:    O: O1684 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/843:    O: O1686 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-844:    O: O1688 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\845:    O: O1690 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|846:    O: O1692 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/847:    O: O1694 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\848:    O: O1696 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|849:    O: O1698 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-850:    O: O1700 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\851:    O: O1702 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-852:    O: O1703 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\853:    O: O1705 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/854:    O: O1708 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\855:    O: O1710 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|856:    O: O1712 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/857:    O: O1714 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\858:    O: O1716 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-859:    O: O1718 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|\860:    O: O1720 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|861:    O: O1722 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\862:    O: O1724 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|863:    O: O1726 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/864:    O: O1728 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-865:    O: O1729 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|866:    O: O1731 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/867:    O: O1734 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-868:    O: O1736 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/869:    O: O1738 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-870:    O: O1740 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
/|871:    O: O1742 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\872:    O: O1744 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-873:    O: O1746 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\874:    O: O1748 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|875:    O: O1750 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/876:    O: O1752 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\877:    O: O1754 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|878:    O: O1756 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/879:    O: O1758 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-880:    O: O1760 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\881:    O: O1762 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-882:    O: O1763 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\883:    O: O1765 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
-/|884:    O: O1767 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
\-/885:    O: O1770 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-886:    O: O1771 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
/|\887:    O: O1774 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|888:    O: O1776 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\889:    O: O1777 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-890:    O: O1780 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/891:    O: O1782 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|892:    O: O1784 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
\-/893:    O: O1785 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
|\894:    O: O1788 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
-/|895:    O: O1790 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
\-/896:    O: O1792 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-897:    O: O1793 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
/|\898:    O: O1796 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/899:    O: O1798 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-/sleeping...
|900:    O: O1799 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-901:    O: O1802 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/902:    O: O1803 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-903:    O: O1806 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
/|\904:    O: O1807 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|905:    O: O1810 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
\-/906:    O: O1812 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
|\-907:    O: O1814 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
/|\908:    O: O1816 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
-/|909:    O: O1818 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\910:    O: O1819 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
-/|911:    O: O1821 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
\912:    O: O1823 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/913:    O: O1826 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\914:    O: O1828 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
-/|915:    O: O1829 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
\-/916:    O: O1832 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\917:    O: O1834 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
-918:    O: O1835 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\919:    O: O1837 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/|920:    O: O1840 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/921:    O: O1841 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
|922:    O: O1844 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/923:    O: O1846 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
|\-924:    O: O1848 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/925:    O: O1850 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-926:    O: O1852 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|\927:    O: O1854 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
-/928:    O: O1856 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\-929:    O: O1857 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
/|\930:    O: O1859 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
-/931:    O: O1862 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|932:    O: O1863 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
\-/933:    O: O1866 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
|\-934:    O: O1867 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
/|\935:    O: O1869 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
-/936:    O: O1872 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
|\-937:    O: O1874 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
/|\938:    O: O1875 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
-/|939:    O: O1878 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
\-/940:    O: O1880 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
|\-941:    O: O1882 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/942:    O: O1884 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
|\943:    O: O1885 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
-/|944:    O: O1888 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
\-/945:    O: O1889 (predict-yes)
I see 1 and I'm going to do: predict-yes
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
|\-946:    O: O1892 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
/|947:    O: O1894 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-948:    O: O1896 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
/|\949:    O: O1898 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
-/|950:    O: O1900 (predict-no)
I see 1 and I'm going to do: predict-no
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
\-/|\-/--- Input Phase --- 
=>WM: (13326: I2 ^dir L)
=>WM: (13325: I2 ^reward 1)
=>WM: (13324: I2 ^see 0)
=>WM: (13323: N950 ^status complete)
<=WM: (13312: I2 ^dir U)
<=WM: (13311: I2 ^reward 1)
<=WM: (13310: I2 ^see 0)
=>WM: (13327: I2 ^level-1 L0-root)
<=WM: (13313: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1899 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1900 = 0.6854017956462798)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Firing propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Firing propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 0.3145080651024651)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.3908143935841644)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1900 ^name predict-no +)
 (S1 ^operator O1900 +)
Retracting propose*predict-yes
 -->
 (O1899 ^name predict-yes +)
 (S1 ^operator O1899 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R953 ^value 1 +)
 (R1 ^reward R953 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1900 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1899 = 0.)
=>WM: (13334: S1 ^operator O1902 +)
=>WM: (13333: S1 ^operator O1901 +)
=>WM: (13332: I3 ^dir L)
=>WM: (13331: O1902 ^name predict-no)
=>WM: (13330: O1901 ^name predict-yes)
=>WM: (13329: R954 ^value 1)
=>WM: (13328: R1 ^reward R954)
<=WM: (13319: S1 ^operator O1899 +)
<=WM: (13320: S1 ^operator O1900 +)
<=WM: (13321: S1 ^operator O1900)
<=WM: (13318: I3 ^dir U)
<=WM: (13314: R1 ^reward R953)
<=WM: (13317: O1900 ^name predict-no)
<=WM: (13316: O1899 ^name predict-yes)
<=WM: (13315: R953 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1901 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.3908143935841644)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1902 = 0.6854017956462798)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.3145080651024651)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1900 = 0.3145080651024651)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1900 = 0.6854017956462798)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1899 = 0.3908143935841644)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1899 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13335: S1 ^operator O1902)

   951:    O: O1902 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N951 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N950 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13336: I3 ^predict-no N951)
<=WM: (13323: N950 ^status complete)
<=WM: (13322: I3 ^predict-no N950)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (13340: I2 ^dir L)
=>WM: (13339: I2 ^reward 1)
=>WM: (13338: I2 ^see 0)
=>WM: (13337: N951 ^status complete)
<=WM: (13326: I2 ^dir L)
<=WM: (13325: I2 ^reward 1)
<=WM: (13324: I2 ^see 0)
=>WM: (13341: I2 ^level-1 L0-root)
<=WM: (13327: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1901 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1902 = 0.6854017956462798)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Firing propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Firing propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.3145080651024651)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.3908143935841644)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1902 ^name predict-no +)
 (S1 ^operator O1902 +)
Retracting propose*predict-yes
 -->
 (O1901 ^name predict-yes +)
 (S1 ^operator O1901 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R954 ^value 1 +)
 (R1 ^reward R954 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.3145080651024651)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1902 = 0.6854017956462798)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.3908143935841644)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1901 = -0.208713043145708)
=>WM: (13347: S1 ^operator O1904 +)
=>WM: (13346: S1 ^operator O1903 +)
=>WM: (13345: O1904 ^name predict-no)
=>WM: (13344: O1903 ^name predict-yes)
=>WM: (13343: R955 ^value 1)
=>WM: (13342: R1 ^reward R955)
<=WM: (13333: S1 ^operator O1901 +)
<=WM: (13334: S1 ^operator O1902 +)
<=WM: (13335: S1 ^operator O1902)
<=WM: (13328: R1 ^reward R954)
<=WM: (13331: O1902 ^name predict-no)
<=WM: (13330: O1901 ^name predict-yes)
<=WM: (13329: R954 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1903 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1903 = 0.3908143935841644)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1904 = 0.6854017956462798)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1904 = 0.3145080651024651)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1902 = 0.3145080651024651)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1902 = 0.6854017956462798)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1901 = 0.3908143935841644)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1901 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478563 -0.164047 0.314516(R,m,v=1,0.917808,0.0759565)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521362 0.16404 0.685402 -> 0.52137 0.16404 0.685411(R,m,v=1,1,0)
=>WM: (13348: S1 ^operator O1904)

   952:    O: O1904 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N952 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N951 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13349: I3 ^predict-no N952)
<=WM: (13337: N951 ^status complete)
<=WM: (13336: I3 ^predict-no N951)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13353: I2 ^dir R)
=>WM: (13352: I2 ^reward 1)
=>WM: (13351: I2 ^see 0)
=>WM: (13350: N952 ^status complete)
<=WM: (13340: I2 ^dir L)
<=WM: (13339: I2 ^reward 1)
<=WM: (13338: I2 ^see 0)
=>WM: (13354: I2 ^level-1 L0-root)
<=WM: (13341: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1903 = 0.8783877442642956)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Firing propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Firing propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.999977424773942)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.1215951465100475)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1904 ^name predict-no +)
 (S1 ^operator O1904 +)
Retracting propose*predict-yes
 -->
 (O1903 ^name predict-yes +)
 (S1 ^operator O1903 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R955 ^value 1 +)
 (R1 ^reward R955 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1904 = 0.3145155972863931)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1904 = 0.6854105587116136)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1903 = 0.3908143935841644)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1903 = -0.208713043145708)
=>WM: (13361: S1 ^operator O1906 +)
=>WM: (13360: S1 ^operator O1905 +)
=>WM: (13359: I3 ^dir R)
=>WM: (13358: O1906 ^name predict-no)
=>WM: (13357: O1905 ^name predict-yes)
=>WM: (13356: R956 ^value 1)
=>WM: (13355: R1 ^reward R956)
<=WM: (13346: S1 ^operator O1903 +)
<=WM: (13347: S1 ^operator O1904 +)
<=WM: (13348: S1 ^operator O1904)
<=WM: (13332: I3 ^dir L)
<=WM: (13342: R1 ^reward R955)
<=WM: (13345: O1904 ^name predict-no)
<=WM: (13344: O1903 ^name predict-yes)
<=WM: (13343: R955 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1905 = 0.8783877442642956)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.1215951465100475)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.999977424773942)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1904 = 0.999977424773942)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1903 = 0.1215951465100475)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1903 = 0.8783877442642956)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478563 -0.164047 0.314516 -> 0.478568 -0.164047 0.314522(R,m,v=1,0.918367,0.0754822)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.52137 0.16404 0.685411 -> 0.521377 0.164041 0.685418(R,m,v=1,1,0)
=>WM: (13362: S1 ^operator O1905)

   953:    O: O1905 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N953 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N952 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13363: I3 ^predict-yes N953)
<=WM: (13350: N952 ^status complete)
<=WM: (13349: I3 ^predict-no N952)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13367: I2 ^dir U)
=>WM: (13366: I2 ^reward 1)
=>WM: (13365: I2 ^see 1)
=>WM: (13364: N953 ^status complete)
<=WM: (13353: I2 ^dir R)
<=WM: (13352: I2 ^reward 1)
<=WM: (13351: I2 ^see 0)
=>WM: (13368: I2 ^level-1 R1-root)
<=WM: (13354: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Firing propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Firing propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1906 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1905 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1906 ^name predict-no +)
 (S1 ^operator O1906 +)
Retracting propose*predict-yes
 -->
 (O1905 ^name predict-yes +)
 (S1 ^operator O1905 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R956 ^value 1 +)
 (R1 ^reward R956 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1906 = 0.999977424773942)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1905 = 0.1215951465100475)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1905 = 0.8783877442642956)
=>WM: (13376: S1 ^operator O1908 +)
=>WM: (13375: S1 ^operator O1907 +)
=>WM: (13374: I3 ^dir U)
=>WM: (13373: O1908 ^name predict-no)
=>WM: (13372: O1907 ^name predict-yes)
=>WM: (13371: R957 ^value 1)
=>WM: (13370: R1 ^reward R957)
=>WM: (13369: I3 ^see 1)
<=WM: (13360: S1 ^operator O1905 +)
<=WM: (13362: S1 ^operator O1905)
<=WM: (13361: S1 ^operator O1906 +)
<=WM: (13359: I3 ^dir R)
<=WM: (13355: R1 ^reward R956)
<=WM: (13272: I3 ^see 0)
<=WM: (13358: O1906 ^name predict-no)
<=WM: (13357: O1905 ^name predict-yes)
<=WM: (13356: R956 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1907 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1908 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1906 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1905 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534522 -0.412927 0.121595 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.857143,0.123182)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465464 0.412924 0.878388 -> 0.465465 0.412924 0.878389(R,m,v=1,1,0)
=>WM: (13377: S1 ^operator O1908)

   954:    O: O1908 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N954 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N953 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13378: I3 ^predict-no N954)
<=WM: (13364: N953 ^status complete)
<=WM: (13363: I3 ^predict-yes N953)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13382: I2 ^dir L)
=>WM: (13381: I2 ^reward 1)
=>WM: (13380: I2 ^see 0)
=>WM: (13379: N954 ^status complete)
<=WM: (13367: I2 ^dir U)
<=WM: (13366: I2 ^reward 1)
<=WM: (13365: I2 ^see 1)
=>WM: (13383: I2 ^level-1 R1-root)
<=WM: (13368: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1908 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1907 = 0.6093893278107597)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Firing propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Firing propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1908 = 0.3145217607813431)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1907 = 0.3908143935841644)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1908 ^name predict-no +)
 (S1 ^operator O1908 +)
Retracting propose*predict-yes
 -->
 (O1907 ^name predict-yes +)
 (S1 ^operator O1907 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R957 ^value 1 +)
 (R1 ^reward R957 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1908 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1907 = 0.)
=>WM: (13391: S1 ^operator O1910 +)
=>WM: (13390: S1 ^operator O1909 +)
=>WM: (13389: I3 ^dir L)
=>WM: (13388: O1910 ^name predict-no)
=>WM: (13387: O1909 ^name predict-yes)
=>WM: (13386: R958 ^value 1)
=>WM: (13385: R1 ^reward R958)
=>WM: (13384: I3 ^see 0)
<=WM: (13375: S1 ^operator O1907 +)
<=WM: (13376: S1 ^operator O1908 +)
<=WM: (13377: S1 ^operator O1908)
<=WM: (13374: I3 ^dir U)
<=WM: (13370: R1 ^reward R957)
<=WM: (13369: I3 ^see 1)
<=WM: (13373: O1908 ^name predict-no)
<=WM: (13372: O1907 ^name predict-yes)
<=WM: (13371: R957 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1909 = 0.6093893278107597)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1909 = 0.3908143935841644)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1910 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1910 = 0.3145217607813431)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1908 = 0.3145217607813431)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1908 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1907 = 0.3908143935841644)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1907 = 0.6093893278107597)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13392: S1 ^operator O1909)

   955:    O: O1909 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N955 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N954 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13393: I3 ^predict-yes N955)
<=WM: (13379: N954 ^status complete)
<=WM: (13378: I3 ^predict-no N954)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13397: I2 ^dir U)
=>WM: (13396: I2 ^reward 1)
=>WM: (13395: I2 ^see 1)
=>WM: (13394: N955 ^status complete)
<=WM: (13382: I2 ^dir L)
<=WM: (13381: I2 ^reward 1)
<=WM: (13380: I2 ^see 0)
=>WM: (13398: I2 ^level-1 L1-root)
<=WM: (13383: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Firing propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Firing propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1910 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1909 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1910 ^name predict-no +)
 (S1 ^operator O1910 +)
Retracting propose*predict-yes
 -->
 (O1909 ^name predict-yes +)
 (S1 ^operator O1909 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R958 ^value 1 +)
 (R1 ^reward R958 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1910 = 0.3145217607813431)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1910 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1909 = 0.3908143935841644)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1909 = 0.6093893278107597)
=>WM: (13406: S1 ^operator O1912 +)
=>WM: (13405: S1 ^operator O1911 +)
=>WM: (13404: I3 ^dir U)
=>WM: (13403: O1912 ^name predict-no)
=>WM: (13402: O1911 ^name predict-yes)
=>WM: (13401: R959 ^value 1)
=>WM: (13400: R1 ^reward R959)
=>WM: (13399: I3 ^see 1)
<=WM: (13390: S1 ^operator O1909 +)
<=WM: (13392: S1 ^operator O1909)
<=WM: (13391: S1 ^operator O1910 +)
<=WM: (13389: I3 ^dir L)
<=WM: (13385: R1 ^reward R958)
<=WM: (13384: I3 ^see 0)
<=WM: (13388: O1910 ^name predict-no)
<=WM: (13387: O1909 ^name predict-yes)
<=WM: (13386: R958 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1910 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1909 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472355 -0.0815405 0.390814 -> 0.47234 -0.081543 0.390797(R,m,v=1,0.940789,0.0560735)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527819 0.0815706 0.609389 -> 0.527802 0.0815677 0.60937(R,m,v=1,1,0)
=>WM: (13407: S1 ^operator O1912)

   956:    O: O1912 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N956 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N955 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13408: I3 ^predict-no N956)
<=WM: (13394: N955 ^status complete)
<=WM: (13393: I3 ^predict-yes N955)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13412: I2 ^dir L)
=>WM: (13411: I2 ^reward 1)
=>WM: (13410: I2 ^see 0)
=>WM: (13409: N956 ^status complete)
<=WM: (13397: I2 ^dir U)
<=WM: (13396: I2 ^reward 1)
<=WM: (13395: I2 ^see 1)
=>WM: (13413: I2 ^level-1 L1-root)
<=WM: (13398: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1911 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1912 = 0.6855673437364445)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Firing propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Firing propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1912 = 0.3145217607813431)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1911 = 0.3907974841024591)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1912 ^name predict-no +)
 (S1 ^operator O1912 +)
Retracting propose*predict-yes
 -->
 (O1911 ^name predict-yes +)
 (S1 ^operator O1911 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R959 ^value 1 +)
 (R1 ^reward R959 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1912 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1911 = 0.)
=>WM: (13421: S1 ^operator O1914 +)
=>WM: (13420: S1 ^operator O1913 +)
=>WM: (13419: I3 ^dir L)
=>WM: (13418: O1914 ^name predict-no)
=>WM: (13417: O1913 ^name predict-yes)
=>WM: (13416: R960 ^value 1)
=>WM: (13415: R1 ^reward R960)
=>WM: (13414: I3 ^see 0)
<=WM: (13405: S1 ^operator O1911 +)
<=WM: (13406: S1 ^operator O1912 +)
<=WM: (13407: S1 ^operator O1912)
<=WM: (13404: I3 ^dir U)
<=WM: (13400: R1 ^reward R959)
<=WM: (13399: I3 ^see 1)
<=WM: (13403: O1912 ^name predict-no)
<=WM: (13402: O1911 ^name predict-yes)
<=WM: (13401: R959 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1913 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1913 = 0.3907974841024591)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1914 = 0.6855673437364445)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1914 = 0.3145217607813431)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1912 = 0.3145217607813431)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1912 = 0.6855673437364445)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1911 = 0.3907974841024591)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1911 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13422: S1 ^operator O1914)

   957:    O: O1914 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N957 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N956 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13423: I3 ^predict-no N957)
<=WM: (13409: N956 ^status complete)
<=WM: (13408: I3 ^predict-no N956)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13427: I2 ^dir R)
=>WM: (13426: I2 ^reward 1)
=>WM: (13425: I2 ^see 0)
=>WM: (13424: N957 ^status complete)
<=WM: (13412: I2 ^dir L)
<=WM: (13411: I2 ^reward 1)
<=WM: (13410: I2 ^see 0)
=>WM: (13428: I2 ^level-1 L0-root)
<=WM: (13413: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1913 = 0.8783894024939338)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Firing propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Firing propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1914 = 0.999977424773942)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1913 = 0.1215965434178113)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1914 ^name predict-no +)
 (S1 ^operator O1914 +)
Retracting propose*predict-yes
 -->
 (O1913 ^name predict-yes +)
 (S1 ^operator O1913 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R960 ^value 1 +)
 (R1 ^reward R960 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1914 = 0.3145217607813431)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1914 = 0.6855673437364445)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1913 = 0.3907974841024591)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1913 = -0.2062723012911647)
=>WM: (13435: S1 ^operator O1916 +)
=>WM: (13434: S1 ^operator O1915 +)
=>WM: (13433: I3 ^dir R)
=>WM: (13432: O1916 ^name predict-no)
=>WM: (13431: O1915 ^name predict-yes)
=>WM: (13430: R961 ^value 1)
=>WM: (13429: R1 ^reward R961)
<=WM: (13420: S1 ^operator O1913 +)
<=WM: (13421: S1 ^operator O1914 +)
<=WM: (13422: S1 ^operator O1914)
<=WM: (13419: I3 ^dir L)
<=WM: (13415: R1 ^reward R960)
<=WM: (13418: O1914 ^name predict-no)
<=WM: (13417: O1913 ^name predict-yes)
<=WM: (13416: R960 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.1215965434178113)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1915 = 0.8783894024939338)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.999977424773942)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1914 = 0.999977424773942)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1913 = 0.1215965434178113)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1913 = 0.8783894024939338)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478568 -0.164047 0.314522 -> 0.478562 -0.164047 0.314514(R,m,v=1,0.918919,0.0750138)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521513 0.164055 0.685567 -> 0.521505 0.164054 0.685559(R,m,v=1,1,0)
=>WM: (13436: S1 ^operator O1915)

   958:    O: O1915 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N958 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N957 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13437: I3 ^predict-yes N958)
<=WM: (13424: N957 ^status complete)
<=WM: (13423: I3 ^predict-no N957)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13441: I2 ^dir R)
=>WM: (13440: I2 ^reward 1)
=>WM: (13439: I2 ^see 1)
=>WM: (13438: N958 ^status complete)
<=WM: (13427: I2 ^dir R)
<=WM: (13426: I2 ^reward 1)
<=WM: (13425: I2 ^see 0)
=>WM: (13442: I2 ^level-1 R1-root)
<=WM: (13428: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1915 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Firing propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Firing propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.999977424773942)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.1215965434178113)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1916 ^name predict-no +)
 (S1 ^operator O1916 +)
Retracting propose*predict-yes
 -->
 (O1915 ^name predict-yes +)
 (S1 ^operator O1915 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R961 ^value 1 +)
 (R1 ^reward R961 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.999977424773942)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1915 = 0.8783894024939338)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.1215965434178113)
=>WM: (13449: S1 ^operator O1918 +)
=>WM: (13448: S1 ^operator O1917 +)
=>WM: (13447: O1918 ^name predict-no)
=>WM: (13446: O1917 ^name predict-yes)
=>WM: (13445: R962 ^value 1)
=>WM: (13444: R1 ^reward R962)
=>WM: (13443: I3 ^see 1)
<=WM: (13434: S1 ^operator O1915 +)
<=WM: (13436: S1 ^operator O1915)
<=WM: (13435: S1 ^operator O1916 +)
<=WM: (13429: R1 ^reward R961)
<=WM: (13414: I3 ^see 0)
<=WM: (13432: O1916 ^name predict-no)
<=WM: (13431: O1915 ^name predict-yes)
<=WM: (13430: R961 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1917 = 0.1215965434178113)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1917 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1918 = 0.999977424773942)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1916 = 0.999977424773942)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1915 = 0.1215965434178113)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1915 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.857988,0.12257)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465465 0.412924 0.878389 -> 0.465467 0.412924 0.878391(R,m,v=1,1,0)
=>WM: (13450: S1 ^operator O1918)

   959:    O: O1918 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N959 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N958 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13451: I3 ^predict-no N959)
<=WM: (13438: N958 ^status complete)
<=WM: (13437: I3 ^predict-yes N958)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13455: I2 ^dir L)
=>WM: (13454: I2 ^reward 1)
=>WM: (13453: I2 ^see 0)
=>WM: (13452: N959 ^status complete)
<=WM: (13441: I2 ^dir R)
<=WM: (13440: I2 ^reward 1)
<=WM: (13439: I2 ^see 1)
=>WM: (13456: I2 ^level-1 R0-root)
<=WM: (13442: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1918 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1917 = 0.6090773459257411)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Firing propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Firing propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 0.3145143319532709)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.3907974841024591)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1918 ^name predict-no +)
 (S1 ^operator O1918 +)
Retracting propose*predict-yes
 -->
 (O1917 ^name predict-yes +)
 (S1 ^operator O1917 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R962 ^value 1 +)
 (R1 ^reward R962 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1918 = 0.999977424773942)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1917 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1917 = 0.121597689773478)
=>WM: (13464: S1 ^operator O1920 +)
=>WM: (13463: S1 ^operator O1919 +)
=>WM: (13462: I3 ^dir L)
=>WM: (13461: O1920 ^name predict-no)
=>WM: (13460: O1919 ^name predict-yes)
=>WM: (13459: R963 ^value 1)
=>WM: (13458: R1 ^reward R963)
=>WM: (13457: I3 ^see 0)
<=WM: (13448: S1 ^operator O1917 +)
<=WM: (13449: S1 ^operator O1918 +)
<=WM: (13450: S1 ^operator O1918)
<=WM: (13433: I3 ^dir R)
<=WM: (13444: R1 ^reward R962)
<=WM: (13443: I3 ^see 1)
<=WM: (13447: O1918 ^name predict-no)
<=WM: (13446: O1917 ^name predict-yes)
<=WM: (13445: R962 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.3907974841024591)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1919 = 0.6090773459257411)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 0.3145143319532709)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1920 = -0.1984300550322165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1918 = 0.3145143319532709)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1918 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1917 = 0.3907974841024591)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1917 = 0.6090773459257411)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999977 0 0.999977 -> 0.999981 0 0.999981(R,m,v=1,0.936782,0.0595641)
=>WM: (13465: S1 ^operator O1919)

   960:    O: O1919 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N960 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N959 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13466: I3 ^predict-yes N960)
<=WM: (13452: N959 ^status complete)
<=WM: (13451: I3 ^predict-no N959)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13470: I2 ^dir U)
=>WM: (13469: I2 ^reward 1)
=>WM: (13468: I2 ^see 1)
=>WM: (13467: N960 ^status complete)
<=WM: (13455: I2 ^dir L)
<=WM: (13454: I2 ^reward 1)
<=WM: (13453: I2 ^see 0)
=>WM: (13471: I2 ^level-1 L1-root)
<=WM: (13456: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Firing propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Firing propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1920 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1919 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1920 ^name predict-no +)
 (S1 ^operator O1920 +)
Retracting propose*predict-yes
 -->
 (O1919 ^name predict-yes +)
 (S1 ^operator O1919 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R963 ^value 1 +)
 (R1 ^reward R963 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1920 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1920 = 0.3145143319532709)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1919 = 0.6090773459257411)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1919 = 0.3907974841024591)
=>WM: (13479: S1 ^operator O1922 +)
=>WM: (13478: S1 ^operator O1921 +)
=>WM: (13477: I3 ^dir U)
=>WM: (13476: O1922 ^name predict-no)
=>WM: (13475: O1921 ^name predict-yes)
=>WM: (13474: R964 ^value 1)
=>WM: (13473: R1 ^reward R964)
=>WM: (13472: I3 ^see 1)
<=WM: (13463: S1 ^operator O1919 +)
<=WM: (13465: S1 ^operator O1919)
<=WM: (13464: S1 ^operator O1920 +)
<=WM: (13462: I3 ^dir L)
<=WM: (13458: R1 ^reward R963)
<=WM: (13457: I3 ^see 0)
<=WM: (13461: O1920 ^name predict-no)
<=WM: (13460: O1919 ^name predict-yes)
<=WM: (13459: R963 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1921 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1922 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1920 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1919 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.47234 -0.081543 0.390797 -> 0.472349 -0.0815415 0.390808(R,m,v=1,0.941176,0.0557276)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527553 0.0815245 0.609077 -> 0.527563 0.0815262 0.609089(R,m,v=1,1,0)
=>WM: (13480: S1 ^operator O1922)

   961:    O: O1922 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N961 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N960 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13481: I3 ^predict-no N961)
<=WM: (13467: N960 ^status complete)
<=WM: (13466: I3 ^predict-yes N960)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13485: I2 ^dir L)
=>WM: (13484: I2 ^reward 1)
=>WM: (13483: I2 ^see 0)
=>WM: (13482: N961 ^status complete)
<=WM: (13470: I2 ^dir U)
<=WM: (13469: I2 ^reward 1)
<=WM: (13468: I2 ^see 1)
=>WM: (13486: I2 ^level-1 L1-root)
<=WM: (13471: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1921 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1922 = 0.685558831823503)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Firing propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Firing propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 0.3145143319532709)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.390807862285058)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1922 ^name predict-no +)
 (S1 ^operator O1922 +)
Retracting propose*predict-yes
 -->
 (O1921 ^name predict-yes +)
 (S1 ^operator O1921 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R964 ^value 1 +)
 (R1 ^reward R964 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1922 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1921 = 0.)
=>WM: (13494: S1 ^operator O1924 +)
=>WM: (13493: S1 ^operator O1923 +)
=>WM: (13492: I3 ^dir L)
=>WM: (13491: O1924 ^name predict-no)
=>WM: (13490: O1923 ^name predict-yes)
=>WM: (13489: R965 ^value 1)
=>WM: (13488: R1 ^reward R965)
=>WM: (13487: I3 ^see 0)
<=WM: (13478: S1 ^operator O1921 +)
<=WM: (13479: S1 ^operator O1922 +)
<=WM: (13480: S1 ^operator O1922)
<=WM: (13477: I3 ^dir U)
<=WM: (13473: R1 ^reward R964)
<=WM: (13472: I3 ^see 1)
<=WM: (13476: O1922 ^name predict-no)
<=WM: (13475: O1921 ^name predict-yes)
<=WM: (13474: R964 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1923 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.390807862285058)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1924 = 0.685558831823503)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.3145143319532709)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1922 = 0.3145143319532709)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1922 = 0.685558831823503)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1921 = 0.390807862285058)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1921 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13495: S1 ^operator O1924)

   962:    O: O1924 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N962 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N961 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13496: I3 ^predict-no N962)
<=WM: (13482: N961 ^status complete)
<=WM: (13481: I3 ^predict-no N961)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13500: I2 ^dir U)
=>WM: (13499: I2 ^reward 1)
=>WM: (13498: I2 ^see 0)
=>WM: (13497: N962 ^status complete)
<=WM: (13485: I2 ^dir L)
<=WM: (13484: I2 ^reward 1)
<=WM: (13483: I2 ^see 0)
=>WM: (13501: I2 ^level-1 L0-root)
<=WM: (13486: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Firing propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Firing propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1924 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1923 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1924 ^name predict-no +)
 (S1 ^operator O1924 +)
Retracting propose*predict-yes
 -->
 (O1923 ^name predict-yes +)
 (S1 ^operator O1923 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R965 ^value 1 +)
 (R1 ^reward R965 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1924 = 0.3145143319532709)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1924 = 0.685558831823503)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1923 = 0.390807862285058)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1923 = -0.2062723012911647)
=>WM: (13508: S1 ^operator O1926 +)
=>WM: (13507: S1 ^operator O1925 +)
=>WM: (13506: I3 ^dir U)
=>WM: (13505: O1926 ^name predict-no)
=>WM: (13504: O1925 ^name predict-yes)
=>WM: (13503: R966 ^value 1)
=>WM: (13502: R1 ^reward R966)
<=WM: (13493: S1 ^operator O1923 +)
<=WM: (13494: S1 ^operator O1924 +)
<=WM: (13495: S1 ^operator O1924)
<=WM: (13492: I3 ^dir L)
<=WM: (13488: R1 ^reward R965)
<=WM: (13491: O1924 ^name predict-no)
<=WM: (13490: O1923 ^name predict-yes)
<=WM: (13489: R965 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1924 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1923 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478562 -0.164047 0.314514 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.919463,0.0745511)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521505 0.164054 0.685559 -> 0.521498 0.164053 0.685552(R,m,v=1,1,0)
=>WM: (13509: S1 ^operator O1926)

   963:    O: O1926 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N963 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N962 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13510: I3 ^predict-no N963)
<=WM: (13497: N962 ^status complete)
<=WM: (13496: I3 ^predict-no N962)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13514: I2 ^dir U)
=>WM: (13513: I2 ^reward 1)
=>WM: (13512: I2 ^see 0)
=>WM: (13511: N963 ^status complete)
<=WM: (13500: I2 ^dir U)
<=WM: (13499: I2 ^reward 1)
<=WM: (13498: I2 ^see 0)
=>WM: (13515: I2 ^level-1 L0-root)
<=WM: (13501: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Firing propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Firing propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1926 ^name predict-no +)
 (S1 ^operator O1926 +)
Retracting propose*predict-yes
 -->
 (O1925 ^name predict-yes +)
 (S1 ^operator O1925 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R966 ^value 1 +)
 (R1 ^reward R966 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.)
=>WM: (13521: S1 ^operator O1928 +)
=>WM: (13520: S1 ^operator O1927 +)
=>WM: (13519: O1928 ^name predict-no)
=>WM: (13518: O1927 ^name predict-yes)
=>WM: (13517: R967 ^value 1)
=>WM: (13516: R1 ^reward R967)
<=WM: (13507: S1 ^operator O1925 +)
<=WM: (13508: S1 ^operator O1926 +)
<=WM: (13509: S1 ^operator O1926)
<=WM: (13502: R1 ^reward R966)
<=WM: (13505: O1926 ^name predict-no)
<=WM: (13504: O1925 ^name predict-yes)
<=WM: (13503: R966 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1926 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1925 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13522: S1 ^operator O1928)

   964:    O: O1928 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N964 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N963 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13523: I3 ^predict-no N964)
<=WM: (13511: N963 ^status complete)
<=WM: (13510: I3 ^predict-no N963)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13527: I2 ^dir R)
=>WM: (13526: I2 ^reward 1)
=>WM: (13525: I2 ^see 0)
=>WM: (13524: N964 ^status complete)
<=WM: (13514: I2 ^dir U)
<=WM: (13513: I2 ^reward 1)
<=WM: (13512: I2 ^see 0)
=>WM: (13528: I2 ^level-1 L0-root)
<=WM: (13515: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1927 = 0.878390760537652)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Firing propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Firing propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1928 = 0.9999810901454903)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1927 = 0.121597689773478)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1928 ^name predict-no +)
 (S1 ^operator O1928 +)
Retracting propose*predict-yes
 -->
 (O1927 ^name predict-yes +)
 (S1 ^operator O1927 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R967 ^value 1 +)
 (R1 ^reward R967 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1928 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1927 = 0.)
=>WM: (13535: S1 ^operator O1930 +)
=>WM: (13534: S1 ^operator O1929 +)
=>WM: (13533: I3 ^dir R)
=>WM: (13532: O1930 ^name predict-no)
=>WM: (13531: O1929 ^name predict-yes)
=>WM: (13530: R968 ^value 1)
=>WM: (13529: R1 ^reward R968)
<=WM: (13520: S1 ^operator O1927 +)
<=WM: (13521: S1 ^operator O1928 +)
<=WM: (13522: S1 ^operator O1928)
<=WM: (13506: I3 ^dir U)
<=WM: (13516: R1 ^reward R967)
<=WM: (13519: O1928 ^name predict-no)
<=WM: (13518: O1927 ^name predict-yes)
<=WM: (13517: R967 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1929 = 0.878390760537652)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1929 = 0.121597689773478)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1930 = 0.9999810901454903)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1928 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1927 = 0.121597689773478)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1927 = 0.878390760537652)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13536: S1 ^operator O1929)

   965:    O: O1929 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N965 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N964 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13537: I3 ^predict-yes N965)
<=WM: (13524: N964 ^status complete)
<=WM: (13523: I3 ^predict-no N964)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13541: I2 ^dir U)
=>WM: (13540: I2 ^reward 1)
=>WM: (13539: I2 ^see 1)
=>WM: (13538: N965 ^status complete)
<=WM: (13527: I2 ^dir R)
<=WM: (13526: I2 ^reward 1)
<=WM: (13525: I2 ^see 0)
=>WM: (13542: I2 ^level-1 R1-root)
<=WM: (13528: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Firing propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Firing propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1930 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1929 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1930 ^name predict-no +)
 (S1 ^operator O1930 +)
Retracting propose*predict-yes
 -->
 (O1929 ^name predict-yes +)
 (S1 ^operator O1929 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R968 ^value 1 +)
 (R1 ^reward R968 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1930 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1929 = 0.121597689773478)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1929 = 0.878390760537652)
=>WM: (13550: S1 ^operator O1932 +)
=>WM: (13549: S1 ^operator O1931 +)
=>WM: (13548: I3 ^dir U)
=>WM: (13547: O1932 ^name predict-no)
=>WM: (13546: O1931 ^name predict-yes)
=>WM: (13545: R969 ^value 1)
=>WM: (13544: R1 ^reward R969)
=>WM: (13543: I3 ^see 1)
<=WM: (13534: S1 ^operator O1929 +)
<=WM: (13536: S1 ^operator O1929)
<=WM: (13535: S1 ^operator O1930 +)
<=WM: (13533: I3 ^dir R)
<=WM: (13529: R1 ^reward R968)
<=WM: (13487: I3 ^see 0)
<=WM: (13532: O1930 ^name predict-no)
<=WM: (13531: O1929 ^name predict-yes)
<=WM: (13530: R968 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1930 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1929 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.858824,0.121963)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465467 0.412924 0.878391 -> 0.465467 0.412924 0.878392(R,m,v=1,1,0)
=>WM: (13551: S1 ^operator O1932)

   966:    O: O1932 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N966 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N965 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13552: I3 ^predict-no N966)
<=WM: (13538: N965 ^status complete)
<=WM: (13537: I3 ^predict-yes N965)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13556: I2 ^dir L)
=>WM: (13555: I2 ^reward 1)
=>WM: (13554: I2 ^see 0)
=>WM: (13553: N966 ^status complete)
<=WM: (13541: I2 ^dir U)
<=WM: (13540: I2 ^reward 1)
<=WM: (13539: I2 ^see 1)
=>WM: (13557: I2 ^level-1 R1-root)
<=WM: (13542: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1932 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1931 = 0.6093697568764296)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Firing propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Firing propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 0.3145082389793297)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.390807862285058)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1932 ^name predict-no +)
 (S1 ^operator O1932 +)
Retracting propose*predict-yes
 -->
 (O1931 ^name predict-yes +)
 (S1 ^operator O1931 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R969 ^value 1 +)
 (R1 ^reward R969 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1932 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1931 = 0.)
=>WM: (13565: S1 ^operator O1934 +)
=>WM: (13564: S1 ^operator O1933 +)
=>WM: (13563: I3 ^dir L)
=>WM: (13562: O1934 ^name predict-no)
=>WM: (13561: O1933 ^name predict-yes)
=>WM: (13560: R970 ^value 1)
=>WM: (13559: R1 ^reward R970)
=>WM: (13558: I3 ^see 0)
<=WM: (13549: S1 ^operator O1931 +)
<=WM: (13550: S1 ^operator O1932 +)
<=WM: (13551: S1 ^operator O1932)
<=WM: (13548: I3 ^dir U)
<=WM: (13544: R1 ^reward R969)
<=WM: (13543: I3 ^see 1)
<=WM: (13547: O1932 ^name predict-no)
<=WM: (13546: O1931 ^name predict-yes)
<=WM: (13545: R969 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1933 = 0.6093697568764296)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.390807862285058)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1934 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.3145082389793297)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1932 = 0.3145082389793297)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1932 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1931 = 0.390807862285058)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1931 = 0.6093697568764296)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13566: S1 ^operator O1933)

   967:    O: O1933 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N967 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N966 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13567: I3 ^predict-yes N967)
<=WM: (13553: N966 ^status complete)
<=WM: (13552: I3 ^predict-no N966)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13571: I2 ^dir L)
=>WM: (13570: I2 ^reward 1)
=>WM: (13569: I2 ^see 1)
=>WM: (13568: N967 ^status complete)
<=WM: (13556: I2 ^dir L)
<=WM: (13555: I2 ^reward 1)
<=WM: (13554: I2 ^see 0)
=>WM: (13572: I2 ^level-1 L1-root)
<=WM: (13557: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1933 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1934 = 0.685551861847024)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Firing propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Firing propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.3145082389793297)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.390807862285058)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1934 ^name predict-no +)
 (S1 ^operator O1934 +)
Retracting propose*predict-yes
 -->
 (O1933 ^name predict-yes +)
 (S1 ^operator O1933 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R970 ^value 1 +)
 (R1 ^reward R970 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.3145082389793297)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1934 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.390807862285058)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1933 = 0.6093697568764296)
=>WM: (13579: S1 ^operator O1936 +)
=>WM: (13578: S1 ^operator O1935 +)
=>WM: (13577: O1936 ^name predict-no)
=>WM: (13576: O1935 ^name predict-yes)
=>WM: (13575: R971 ^value 1)
=>WM: (13574: R1 ^reward R971)
=>WM: (13573: I3 ^see 1)
<=WM: (13564: S1 ^operator O1933 +)
<=WM: (13566: S1 ^operator O1933)
<=WM: (13565: S1 ^operator O1934 +)
<=WM: (13559: R1 ^reward R970)
<=WM: (13558: I3 ^see 0)
<=WM: (13562: O1934 ^name predict-no)
<=WM: (13561: O1933 ^name predict-yes)
<=WM: (13560: R970 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.390807862285058)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1935 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 0.3145082389793297)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1936 = 0.685551861847024)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1934 = 0.3145082389793297)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1934 = 0.685551861847024)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1933 = 0.390807862285058)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1933 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472349 -0.0815415 0.390808 -> 0.472337 -0.0815436 0.390793(R,m,v=1,0.941558,0.0553858)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527802 0.0815677 0.60937 -> 0.527788 0.0815652 0.609353(R,m,v=1,1,0)
=>WM: (13580: S1 ^operator O1936)

   968:    O: O1936 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N968 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N967 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13581: I3 ^predict-no N968)
<=WM: (13568: N967 ^status complete)
<=WM: (13567: I3 ^predict-yes N967)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13585: I2 ^dir R)
=>WM: (13584: I2 ^reward 1)
=>WM: (13583: I2 ^see 0)
=>WM: (13582: N968 ^status complete)
<=WM: (13571: I2 ^dir L)
<=WM: (13570: I2 ^reward 1)
<=WM: (13569: I2 ^see 1)
=>WM: (13586: I2 ^level-1 L0-root)
<=WM: (13572: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1935 = 0.8783918732984659)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Firing propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Firing propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.9999810901454903)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.1215986309459259)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1936 ^name predict-no +)
 (S1 ^operator O1936 +)
Retracting propose*predict-yes
 -->
 (O1935 ^name predict-yes +)
 (S1 ^operator O1935 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R971 ^value 1 +)
 (R1 ^reward R971 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1936 = 0.685551861847024)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1936 = 0.3145082389793297)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1935 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1935 = 0.3907931512898603)
=>WM: (13594: S1 ^operator O1938 +)
=>WM: (13593: S1 ^operator O1937 +)
=>WM: (13592: I3 ^dir R)
=>WM: (13591: O1938 ^name predict-no)
=>WM: (13590: O1937 ^name predict-yes)
=>WM: (13589: R972 ^value 1)
=>WM: (13588: R1 ^reward R972)
=>WM: (13587: I3 ^see 0)
<=WM: (13578: S1 ^operator O1935 +)
<=WM: (13579: S1 ^operator O1936 +)
<=WM: (13580: S1 ^operator O1936)
<=WM: (13563: I3 ^dir L)
<=WM: (13574: R1 ^reward R971)
<=WM: (13573: I3 ^see 1)
<=WM: (13577: O1936 ^name predict-no)
<=WM: (13576: O1935 ^name predict-yes)
<=WM: (13575: R971 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1937 = 0.1215986309459259)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1937 = 0.8783918732984659)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1938 = 0.9999810901454903)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1936 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1935 = 0.1215986309459259)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1935 = 0.8783918732984659)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478552 -0.164048 0.314503(R,m,v=1,0.92,0.074094)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521498 0.164053 0.685552 -> 0.521493 0.164053 0.685546(R,m,v=1,1,0)
=>WM: (13595: S1 ^operator O1937)

   969:    O: O1937 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N969 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N968 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13596: I3 ^predict-yes N969)
<=WM: (13582: N968 ^status complete)
<=WM: (13581: I3 ^predict-no N968)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13600: I2 ^dir L)
=>WM: (13599: I2 ^reward 1)
=>WM: (13598: I2 ^see 1)
=>WM: (13597: N969 ^status complete)
<=WM: (13585: I2 ^dir R)
<=WM: (13584: I2 ^reward 1)
<=WM: (13583: I2 ^see 0)
=>WM: (13601: I2 ^level-1 R1-root)
<=WM: (13586: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1938 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1937 = 0.6093527419421177)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Firing propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Firing propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 0.3145032394390637)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.3907931512898603)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1938 ^name predict-no +)
 (S1 ^operator O1938 +)
Retracting propose*predict-yes
 -->
 (O1937 ^name predict-yes +)
 (S1 ^operator O1937 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R972 ^value 1 +)
 (R1 ^reward R972 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1938 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1937 = 0.8783918732984659)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1937 = 0.1215986309459259)
=>WM: (13609: S1 ^operator O1940 +)
=>WM: (13608: S1 ^operator O1939 +)
=>WM: (13607: I3 ^dir L)
=>WM: (13606: O1940 ^name predict-no)
=>WM: (13605: O1939 ^name predict-yes)
=>WM: (13604: R973 ^value 1)
=>WM: (13603: R1 ^reward R973)
=>WM: (13602: I3 ^see 1)
<=WM: (13593: S1 ^operator O1937 +)
<=WM: (13595: S1 ^operator O1937)
<=WM: (13594: S1 ^operator O1938 +)
<=WM: (13592: I3 ^dir R)
<=WM: (13588: R1 ^reward R972)
<=WM: (13587: I3 ^see 0)
<=WM: (13591: O1938 ^name predict-no)
<=WM: (13590: O1937 ^name predict-yes)
<=WM: (13589: R972 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.3907931512898603)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1939 = 0.6093527419421177)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 0.3145032394390637)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1940 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1938 = 0.3145032394390637)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1938 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1937 = 0.3907931512898603)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1937 = 0.6093527419421177)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.859649,0.121362)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465467 0.412924 0.878392 -> 0.465468 0.412925 0.878393(R,m,v=1,1,0)
=>WM: (13610: S1 ^operator O1939)

   970:    O: O1939 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N970 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N969 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13611: I3 ^predict-yes N970)
<=WM: (13597: N969 ^status complete)
<=WM: (13596: I3 ^predict-yes N969)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13615: I2 ^dir U)
=>WM: (13614: I2 ^reward 1)
=>WM: (13613: I2 ^see 1)
=>WM: (13612: N970 ^status complete)
<=WM: (13600: I2 ^dir L)
<=WM: (13599: I2 ^reward 1)
<=WM: (13598: I2 ^see 1)
=>WM: (13616: I2 ^level-1 L1-root)
<=WM: (13601: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Firing propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Firing propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1940 ^name predict-no +)
 (S1 ^operator O1940 +)
Retracting propose*predict-yes
 -->
 (O1939 ^name predict-yes +)
 (S1 ^operator O1939 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R973 ^value 1 +)
 (R1 ^reward R973 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1940 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1940 = 0.3145032394390637)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1939 = 0.6093527419421177)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1939 = 0.3907931512898603)
=>WM: (13623: S1 ^operator O1942 +)
=>WM: (13622: S1 ^operator O1941 +)
=>WM: (13621: I3 ^dir U)
=>WM: (13620: O1942 ^name predict-no)
=>WM: (13619: O1941 ^name predict-yes)
=>WM: (13618: R974 ^value 1)
=>WM: (13617: R1 ^reward R974)
<=WM: (13608: S1 ^operator O1939 +)
<=WM: (13610: S1 ^operator O1939)
<=WM: (13609: S1 ^operator O1940 +)
<=WM: (13607: I3 ^dir L)
<=WM: (13603: R1 ^reward R973)
<=WM: (13606: O1940 ^name predict-no)
<=WM: (13605: O1939 ^name predict-yes)
<=WM: (13604: R973 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1940 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1939 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472337 -0.0815436 0.390793 -> 0.472327 -0.0815454 0.390781(R,m,v=1,0.941935,0.0550482)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527788 0.0815652 0.609353 -> 0.527776 0.0815632 0.609339(R,m,v=1,1,0)
=>WM: (13624: S1 ^operator O1942)

   971:    O: O1942 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N971 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N970 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13625: I3 ^predict-no N971)
<=WM: (13612: N970 ^status complete)
<=WM: (13611: I3 ^predict-yes N970)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
---- Input Phase --- 
=>WM: (13629: I2 ^dir R)
=>WM: (13628: I2 ^reward 1)
=>WM: (13627: I2 ^see 0)
=>WM: (13626: N971 ^status complete)
<=WM: (13615: I2 ^dir U)
<=WM: (13614: I2 ^reward 1)
<=WM: (13613: I2 ^see 1)
=>WM: (13630: I2 ^level-1 L1-root)
<=WM: (13616: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1941 = 0.8784169509457307)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Firing propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Firing propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1942 = 0.9999810901454903)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1941 = 0.1215994040064755)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1942 ^name predict-no +)
 (S1 ^operator O1942 +)
Retracting propose*predict-yes
 -->
 (O1941 ^name predict-yes +)
 (S1 ^operator O1941 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R974 ^value 1 +)
 (R1 ^reward R974 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1942 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1941 = 0.)
=>WM: (13638: S1 ^operator O1944 +)
=>WM: (13637: S1 ^operator O1943 +)
=>WM: (13636: I3 ^dir R)
=>WM: (13635: O1944 ^name predict-no)
=>WM: (13634: O1943 ^name predict-yes)
=>WM: (13633: R975 ^value 1)
=>WM: (13632: R1 ^reward R975)
=>WM: (13631: I3 ^see 0)
<=WM: (13622: S1 ^operator O1941 +)
<=WM: (13623: S1 ^operator O1942 +)
<=WM: (13624: S1 ^operator O1942)
<=WM: (13621: I3 ^dir U)
<=WM: (13617: R1 ^reward R974)
<=WM: (13602: I3 ^see 1)
<=WM: (13620: O1942 ^name predict-no)
<=WM: (13619: O1941 ^name predict-yes)
<=WM: (13618: R974 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1943 = 0.8784169509457307)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.1215994040064755)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.9999810901454903)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1942 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1941 = 0.1215994040064755)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1941 = 0.8784169509457307)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13639: S1 ^operator O1943)

   972:    O: O1943 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N972 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N971 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13640: I3 ^predict-yes N972)
<=WM: (13626: N971 ^status complete)
<=WM: (13625: I3 ^predict-no N971)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13644: I2 ^dir U)
=>WM: (13643: I2 ^reward 1)
=>WM: (13642: I2 ^see 1)
=>WM: (13641: N972 ^status complete)
<=WM: (13629: I2 ^dir R)
<=WM: (13628: I2 ^reward 1)
<=WM: (13627: I2 ^see 0)
=>WM: (13645: I2 ^level-1 R1-root)
<=WM: (13630: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Firing propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Firing propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1944 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1943 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1944 ^name predict-no +)
 (S1 ^operator O1944 +)
Retracting propose*predict-yes
 -->
 (O1943 ^name predict-yes +)
 (S1 ^operator O1943 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R975 ^value 1 +)
 (R1 ^reward R975 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1944 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1943 = 0.1215994040064755)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1943 = 0.8784169509457307)
=>WM: (13653: S1 ^operator O1946 +)
=>WM: (13652: S1 ^operator O1945 +)
=>WM: (13651: I3 ^dir U)
=>WM: (13650: O1946 ^name predict-no)
=>WM: (13649: O1945 ^name predict-yes)
=>WM: (13648: R976 ^value 1)
=>WM: (13647: R1 ^reward R976)
=>WM: (13646: I3 ^see 1)
<=WM: (13637: S1 ^operator O1943 +)
<=WM: (13639: S1 ^operator O1943)
<=WM: (13638: S1 ^operator O1944 +)
<=WM: (13636: I3 ^dir R)
<=WM: (13632: R1 ^reward R975)
<=WM: (13631: I3 ^see 0)
<=WM: (13635: O1944 ^name predict-no)
<=WM: (13634: O1943 ^name predict-yes)
<=WM: (13633: R975 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1945 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1946 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1944 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1943 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.860465,0.120767)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465488 0.412929 0.878417 -> 0.465487 0.412928 0.878415(R,m,v=1,1,0)
=>WM: (13654: S1 ^operator O1946)

   973:    O: O1946 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N973 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N972 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13655: I3 ^predict-no N973)
<=WM: (13641: N972 ^status complete)
<=WM: (13640: I3 ^predict-yes N972)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13659: I2 ^dir L)
=>WM: (13658: I2 ^reward 1)
=>WM: (13657: I2 ^see 0)
=>WM: (13656: N973 ^status complete)
<=WM: (13644: I2 ^dir U)
<=WM: (13643: I2 ^reward 1)
<=WM: (13642: I2 ^see 1)
=>WM: (13660: I2 ^level-1 R1-root)
<=WM: (13645: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1946 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1945 = 0.609338805157315)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Firing propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Firing propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1946 = 0.3145032394390637)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1945 = 0.3907810808803528)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1946 ^name predict-no +)
 (S1 ^operator O1946 +)
Retracting propose*predict-yes
 -->
 (O1945 ^name predict-yes +)
 (S1 ^operator O1945 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R976 ^value 1 +)
 (R1 ^reward R976 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1946 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1945 = 0.)
=>WM: (13668: S1 ^operator O1948 +)
=>WM: (13667: S1 ^operator O1947 +)
=>WM: (13666: I3 ^dir L)
=>WM: (13665: O1948 ^name predict-no)
=>WM: (13664: O1947 ^name predict-yes)
=>WM: (13663: R977 ^value 1)
=>WM: (13662: R1 ^reward R977)
=>WM: (13661: I3 ^see 0)
<=WM: (13652: S1 ^operator O1945 +)
<=WM: (13653: S1 ^operator O1946 +)
<=WM: (13654: S1 ^operator O1946)
<=WM: (13651: I3 ^dir U)
<=WM: (13647: R1 ^reward R976)
<=WM: (13646: I3 ^see 1)
<=WM: (13650: O1946 ^name predict-no)
<=WM: (13649: O1945 ^name predict-yes)
<=WM: (13648: R976 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1947 = 0.609338805157315)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.3907810808803528)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1948 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 0.3145032394390637)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1946 = 0.3145032394390637)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1946 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1945 = 0.3907810808803528)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1945 = 0.609338805157315)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13669: S1 ^operator O1947)

   974:    O: O1947 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N974 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N973 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13670: I3 ^predict-yes N974)
<=WM: (13656: N973 ^status complete)
<=WM: (13655: I3 ^predict-no N973)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13674: I2 ^dir L)
=>WM: (13673: I2 ^reward 1)
=>WM: (13672: I2 ^see 1)
=>WM: (13671: N974 ^status complete)
<=WM: (13659: I2 ^dir L)
<=WM: (13658: I2 ^reward 1)
<=WM: (13657: I2 ^see 0)
=>WM: (13675: I2 ^level-1 L1-root)
<=WM: (13660: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1947 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1948 = 0.6855461517499103)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Firing propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Firing propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 0.3145032394390637)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.3907810808803528)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1948 ^name predict-no +)
 (S1 ^operator O1948 +)
Retracting propose*predict-yes
 -->
 (O1947 ^name predict-yes +)
 (S1 ^operator O1947 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R977 ^value 1 +)
 (R1 ^reward R977 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 0.3145032394390637)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1948 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.3907810808803528)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1947 = 0.609338805157315)
=>WM: (13682: S1 ^operator O1950 +)
=>WM: (13681: S1 ^operator O1949 +)
=>WM: (13680: O1950 ^name predict-no)
=>WM: (13679: O1949 ^name predict-yes)
=>WM: (13678: R978 ^value 1)
=>WM: (13677: R1 ^reward R978)
=>WM: (13676: I3 ^see 1)
<=WM: (13667: S1 ^operator O1947 +)
<=WM: (13669: S1 ^operator O1947)
<=WM: (13668: S1 ^operator O1948 +)
<=WM: (13662: R1 ^reward R977)
<=WM: (13661: I3 ^see 0)
<=WM: (13665: O1948 ^name predict-no)
<=WM: (13664: O1947 ^name predict-yes)
<=WM: (13663: R977 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1949 = 0.3907810808803528)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1949 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1950 = 0.3145032394390637)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1950 = 0.6855461517499103)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1948 = 0.3145032394390637)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1948 = 0.6855461517499103)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1947 = 0.3907810808803528)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1947 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472327 -0.0815454 0.390781 -> 0.472318 -0.0815469 0.390771(R,m,v=1,0.942308,0.0547146)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527776 0.0815632 0.609339 -> 0.527766 0.0815615 0.609327(R,m,v=1,1,0)
=>WM: (13683: S1 ^operator O1950)

   975:    O: O1950 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N975 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N974 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13684: I3 ^predict-no N975)
<=WM: (13671: N974 ^status complete)
<=WM: (13670: I3 ^predict-yes N974)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13688: I2 ^dir U)
=>WM: (13687: I2 ^reward 1)
=>WM: (13686: I2 ^see 0)
=>WM: (13685: N975 ^status complete)
<=WM: (13674: I2 ^dir L)
<=WM: (13673: I2 ^reward 1)
<=WM: (13672: I2 ^see 1)
=>WM: (13689: I2 ^level-1 L0-root)
<=WM: (13675: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Firing propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Firing propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1950 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1949 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1950 ^name predict-no +)
 (S1 ^operator O1950 +)
Retracting propose*predict-yes
 -->
 (O1949 ^name predict-yes +)
 (S1 ^operator O1949 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R978 ^value 1 +)
 (R1 ^reward R978 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1950 = 0.6855461517499103)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1950 = 0.3145032394390637)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1949 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1949 = 0.3907711727075364)
=>WM: (13697: S1 ^operator O1952 +)
=>WM: (13696: S1 ^operator O1951 +)
=>WM: (13695: I3 ^dir U)
=>WM: (13694: O1952 ^name predict-no)
=>WM: (13693: O1951 ^name predict-yes)
=>WM: (13692: R979 ^value 1)
=>WM: (13691: R1 ^reward R979)
=>WM: (13690: I3 ^see 0)
<=WM: (13681: S1 ^operator O1949 +)
<=WM: (13682: S1 ^operator O1950 +)
<=WM: (13683: S1 ^operator O1950)
<=WM: (13666: I3 ^dir L)
<=WM: (13677: R1 ^reward R978)
<=WM: (13676: I3 ^see 1)
<=WM: (13680: O1950 ^name predict-no)
<=WM: (13679: O1949 ^name predict-yes)
<=WM: (13678: R978 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1951 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1952 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1950 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1949 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478552 -0.164048 0.314503 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.92053,0.0736424)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521493 0.164053 0.685546 -> 0.521489 0.164052 0.685541(R,m,v=1,1,0)
=>WM: (13698: S1 ^operator O1952)

   976:    O: O1952 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N976 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N975 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13699: I3 ^predict-no N976)
<=WM: (13685: N975 ^status complete)
<=WM: (13684: I3 ^predict-no N975)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13703: I2 ^dir L)
=>WM: (13702: I2 ^reward 1)
=>WM: (13701: I2 ^see 0)
=>WM: (13700: N976 ^status complete)
<=WM: (13688: I2 ^dir U)
<=WM: (13687: I2 ^reward 1)
<=WM: (13686: I2 ^see 0)
=>WM: (13704: I2 ^level-1 L0-root)
<=WM: (13689: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1951 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1952 = 0.6854177156873388)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Firing propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Firing propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1952 = 0.3144991353263821)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1951 = 0.3907711727075364)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1952 ^name predict-no +)
 (S1 ^operator O1952 +)
Retracting propose*predict-yes
 -->
 (O1951 ^name predict-yes +)
 (S1 ^operator O1951 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R979 ^value 1 +)
 (R1 ^reward R979 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1952 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1951 = 0.)
=>WM: (13711: S1 ^operator O1954 +)
=>WM: (13710: S1 ^operator O1953 +)
=>WM: (13709: I3 ^dir L)
=>WM: (13708: O1954 ^name predict-no)
=>WM: (13707: O1953 ^name predict-yes)
=>WM: (13706: R980 ^value 1)
=>WM: (13705: R1 ^reward R980)
<=WM: (13696: S1 ^operator O1951 +)
<=WM: (13697: S1 ^operator O1952 +)
<=WM: (13698: S1 ^operator O1952)
<=WM: (13695: I3 ^dir U)
<=WM: (13691: R1 ^reward R979)
<=WM: (13694: O1952 ^name predict-no)
<=WM: (13693: O1951 ^name predict-yes)
<=WM: (13692: R979 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1953 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1953 = 0.3907711727075364)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1954 = 0.6854177156873388)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1954 = 0.3144991353263821)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1952 = 0.3144991353263821)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1952 = 0.6854177156873388)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1951 = 0.3907711727075364)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1951 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13712: S1 ^operator O1954)

   977:    O: O1954 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N977 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N976 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13713: I3 ^predict-no N977)
<=WM: (13700: N976 ^status complete)
<=WM: (13699: I3 ^predict-no N976)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13717: I2 ^dir R)
=>WM: (13716: I2 ^reward 1)
=>WM: (13715: I2 ^see 0)
=>WM: (13714: N977 ^status complete)
<=WM: (13703: I2 ^dir L)
<=WM: (13702: I2 ^reward 1)
<=WM: (13701: I2 ^see 0)
=>WM: (13718: I2 ^level-1 L0-root)
<=WM: (13704: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1953 = 0.8783927855286688)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Firing propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Firing propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1954 = 0.9999810901454903)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1953 = 0.1215980737936329)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1954 ^name predict-no +)
 (S1 ^operator O1954 +)
Retracting propose*predict-yes
 -->
 (O1953 ^name predict-yes +)
 (S1 ^operator O1953 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R980 ^value 1 +)
 (R1 ^reward R980 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1954 = 0.3144991353263821)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1954 = 0.6854177156873388)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1953 = 0.3907711727075364)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1953 = -0.208713043145708)
=>WM: (13725: S1 ^operator O1956 +)
=>WM: (13724: S1 ^operator O1955 +)
=>WM: (13723: I3 ^dir R)
=>WM: (13722: O1956 ^name predict-no)
=>WM: (13721: O1955 ^name predict-yes)
=>WM: (13720: R981 ^value 1)
=>WM: (13719: R1 ^reward R981)
<=WM: (13710: S1 ^operator O1953 +)
<=WM: (13711: S1 ^operator O1954 +)
<=WM: (13712: S1 ^operator O1954)
<=WM: (13709: I3 ^dir L)
<=WM: (13705: R1 ^reward R980)
<=WM: (13708: O1954 ^name predict-no)
<=WM: (13707: O1953 ^name predict-yes)
<=WM: (13706: R980 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1955 = 0.8783927855286688)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1955 = 0.1215980737936329)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1956 = 0.9999810901454903)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1954 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1953 = 0.1215980737936329)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1953 = 0.8783927855286688)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478554 -0.164048 0.314506(R,m,v=1,0.921053,0.0731962)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521377 0.164041 0.685418 -> 0.521384 0.164042 0.685426(R,m,v=1,1,0)
=>WM: (13726: S1 ^operator O1955)

   978:    O: O1955 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N978 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N977 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13727: I3 ^predict-yes N978)
<=WM: (13714: N977 ^status complete)
<=WM: (13713: I3 ^predict-no N977)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13731: I2 ^dir L)
=>WM: (13730: I2 ^reward 1)
=>WM: (13729: I2 ^see 1)
=>WM: (13728: N978 ^status complete)
<=WM: (13717: I2 ^dir R)
<=WM: (13716: I2 ^reward 1)
<=WM: (13715: I2 ^see 0)
=>WM: (13732: I2 ^level-1 R1-root)
<=WM: (13718: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1956 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1955 = 0.6093273841659509)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Firing propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Firing propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.3145060369395525)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.3907711727075364)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1956 ^name predict-no +)
 (S1 ^operator O1956 +)
Retracting propose*predict-yes
 -->
 (O1955 ^name predict-yes +)
 (S1 ^operator O1955 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R981 ^value 1 +)
 (R1 ^reward R981 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1956 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1955 = 0.1215980737936329)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1955 = 0.8783927855286688)
=>WM: (13740: S1 ^operator O1958 +)
=>WM: (13739: S1 ^operator O1957 +)
=>WM: (13738: I3 ^dir L)
=>WM: (13737: O1958 ^name predict-no)
=>WM: (13736: O1957 ^name predict-yes)
=>WM: (13735: R982 ^value 1)
=>WM: (13734: R1 ^reward R982)
=>WM: (13733: I3 ^see 1)
<=WM: (13724: S1 ^operator O1955 +)
<=WM: (13726: S1 ^operator O1955)
<=WM: (13725: S1 ^operator O1956 +)
<=WM: (13723: I3 ^dir R)
<=WM: (13719: R1 ^reward R981)
<=WM: (13690: I3 ^see 0)
<=WM: (13722: O1956 ^name predict-no)
<=WM: (13721: O1955 ^name predict-yes)
<=WM: (13720: R981 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.3907711727075364)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1957 = 0.6093273841659509)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 0.3145060369395525)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1958 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1956 = 0.3145060369395525)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1956 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1955 = 0.3907711727075364)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1955 = 0.6093273841659509)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.861272,0.120177)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465468 0.412925 0.878393 -> 0.465469 0.412925 0.878394(R,m,v=1,1,0)
=>WM: (13741: S1 ^operator O1957)

   979:    O: O1957 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N979 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N978 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13742: I3 ^predict-yes N979)
<=WM: (13728: N978 ^status complete)
<=WM: (13727: I3 ^predict-yes N978)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13746: I2 ^dir R)
=>WM: (13745: I2 ^reward 1)
=>WM: (13744: I2 ^see 1)
=>WM: (13743: N979 ^status complete)
<=WM: (13731: I2 ^dir L)
<=WM: (13730: I2 ^reward 1)
<=WM: (13729: I2 ^see 1)
=>WM: (13747: I2 ^level-1 L1-root)
<=WM: (13732: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1957 = 0.8784154092082219)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Firing propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Firing propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1958 = 0.9999810901454903)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1957 = 0.1215988165406292)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1958 ^name predict-no +)
 (S1 ^operator O1958 +)
Retracting propose*predict-yes
 -->
 (O1957 ^name predict-yes +)
 (S1 ^operator O1957 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R982 ^value 1 +)
 (R1 ^reward R982 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1958 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1958 = 0.3145060369395525)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1957 = 0.6093273841659509)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1957 = 0.3907711727075364)
=>WM: (13754: S1 ^operator O1960 +)
=>WM: (13753: S1 ^operator O1959 +)
=>WM: (13752: I3 ^dir R)
=>WM: (13751: O1960 ^name predict-no)
=>WM: (13750: O1959 ^name predict-yes)
=>WM: (13749: R983 ^value 1)
=>WM: (13748: R1 ^reward R983)
<=WM: (13739: S1 ^operator O1957 +)
<=WM: (13741: S1 ^operator O1957)
<=WM: (13740: S1 ^operator O1958 +)
<=WM: (13738: I3 ^dir L)
<=WM: (13734: R1 ^reward R982)
<=WM: (13737: O1958 ^name predict-no)
<=WM: (13736: O1957 ^name predict-yes)
<=WM: (13735: R982 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.1215988165406292)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1959 = 0.8784154092082219)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999810901454903)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1958 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1957 = 0.1215988165406292)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1957 = 0.8784154092082219)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472318 -0.0815469 0.390771 -> 0.472311 -0.0815481 0.390763(R,m,v=1,0.942675,0.0543851)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527766 0.0815615 0.609327 -> 0.527758 0.0815601 0.609318(R,m,v=1,1,0)
=>WM: (13755: S1 ^operator O1959)

   980:    O: O1959 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N980 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N979 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13756: I3 ^predict-yes N980)
<=WM: (13743: N979 ^status complete)
<=WM: (13742: I3 ^predict-yes N979)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (13760: I2 ^dir R)
=>WM: (13759: I2 ^reward 1)
=>WM: (13758: I2 ^see 1)
=>WM: (13757: N980 ^status complete)
<=WM: (13746: I2 ^dir R)
<=WM: (13745: I2 ^reward 1)
<=WM: (13744: I2 ^see 1)
=>WM: (13761: I2 ^level-1 R1-root)
<=WM: (13747: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1959 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Firing propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Firing propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999810901454903)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.1215988165406292)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1960 ^name predict-no +)
 (S1 ^operator O1960 +)
Retracting propose*predict-yes
 -->
 (O1959 ^name predict-yes +)
 (S1 ^operator O1959 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R983 ^value 1 +)
 (R1 ^reward R983 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O1959 = 0.8784154092082219)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.1215988165406292)
=>WM: (13767: S1 ^operator O1962 +)
=>WM: (13766: S1 ^operator O1961 +)
=>WM: (13765: O1962 ^name predict-no)
=>WM: (13764: O1961 ^name predict-yes)
=>WM: (13763: R984 ^value 1)
=>WM: (13762: R1 ^reward R984)
<=WM: (13753: S1 ^operator O1959 +)
<=WM: (13755: S1 ^operator O1959)
<=WM: (13754: S1 ^operator O1960 +)
<=WM: (13748: R1 ^reward R983)
<=WM: (13751: O1960 ^name predict-no)
<=WM: (13750: O1959 ^name predict-yes)
<=WM: (13749: R983 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.1215988165406292)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1961 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.9999810901454903)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1960 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1959 = 0.1215988165406292)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1959 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.862069,0.119593)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465487 0.412928 0.878415 -> 0.465486 0.412928 0.878414(R,m,v=1,1,0)
=>WM: (13768: S1 ^operator O1962)

   981:    O: O1962 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N981 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N980 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13769: I3 ^predict-no N981)
<=WM: (13757: N980 ^status complete)
<=WM: (13756: I3 ^predict-yes N980)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (13773: I2 ^dir L)
=>WM: (13772: I2 ^reward 1)
=>WM: (13771: I2 ^see 0)
=>WM: (13770: N981 ^status complete)
<=WM: (13760: I2 ^dir R)
<=WM: (13759: I2 ^reward 1)
<=WM: (13758: I2 ^see 1)
=>WM: (13774: I2 ^level-1 R0-root)
<=WM: (13761: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1962 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1961 = 0.609089086334031)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Firing propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Firing propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1962 = 0.3145060369395525)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1961 = 0.39076303591152)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1962 ^name predict-no +)
 (S1 ^operator O1962 +)
Retracting propose*predict-yes
 -->
 (O1961 ^name predict-yes +)
 (S1 ^operator O1961 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R984 ^value 1 +)
 (R1 ^reward R984 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1962 = 0.9999810901454903)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1961 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1961 = 0.1215976616761118)
=>WM: (13782: S1 ^operator O1964 +)
=>WM: (13781: S1 ^operator O1963 +)
=>WM: (13780: I3 ^dir L)
=>WM: (13779: O1964 ^name predict-no)
=>WM: (13778: O1963 ^name predict-yes)
=>WM: (13777: R985 ^value 1)
=>WM: (13776: R1 ^reward R985)
=>WM: (13775: I3 ^see 0)
<=WM: (13766: S1 ^operator O1961 +)
<=WM: (13767: S1 ^operator O1962 +)
<=WM: (13768: S1 ^operator O1962)
<=WM: (13752: I3 ^dir R)
<=WM: (13762: R1 ^reward R984)
<=WM: (13733: I3 ^see 1)
<=WM: (13765: O1962 ^name predict-no)
<=WM: (13764: O1961 ^name predict-yes)
<=WM: (13763: R984 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.39076303591152)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1963 = 0.609089086334031)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.3145060369395525)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1964 = -0.1984300550322165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1962 = 0.3145060369395525)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1962 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1961 = 0.39076303591152)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1961 = 0.609089086334031)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999981 0 0.999981 -> 0.999984 0 0.999984(R,m,v=1,0.937143,0.0592447)
=>WM: (13783: S1 ^operator O1963)

   982:    O: O1963 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N982 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N981 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13784: I3 ^predict-yes N982)
<=WM: (13770: N981 ^status complete)
<=WM: (13769: I3 ^predict-no N981)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13788: I2 ^dir L)
=>WM: (13787: I2 ^reward 1)
=>WM: (13786: I2 ^see 1)
=>WM: (13785: N982 ^status complete)
<=WM: (13773: I2 ^dir L)
<=WM: (13772: I2 ^reward 1)
<=WM: (13771: I2 ^see 0)
=>WM: (13789: I2 ^level-1 L1-root)
<=WM: (13774: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1963 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1964 = 0.6855414715988584)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Firing propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Firing propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.3145060369395525)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.39076303591152)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1964 ^name predict-no +)
 (S1 ^operator O1964 +)
Retracting propose*predict-yes
 -->
 (O1963 ^name predict-yes +)
 (S1 ^operator O1963 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R985 ^value 1 +)
 (R1 ^reward R985 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1964 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.3145060369395525)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1963 = 0.609089086334031)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.39076303591152)
=>WM: (13796: S1 ^operator O1966 +)
=>WM: (13795: S1 ^operator O1965 +)
=>WM: (13794: O1966 ^name predict-no)
=>WM: (13793: O1965 ^name predict-yes)
=>WM: (13792: R986 ^value 1)
=>WM: (13791: R1 ^reward R986)
=>WM: (13790: I3 ^see 1)
<=WM: (13781: S1 ^operator O1963 +)
<=WM: (13783: S1 ^operator O1963)
<=WM: (13782: S1 ^operator O1964 +)
<=WM: (13776: R1 ^reward R985)
<=WM: (13775: I3 ^see 0)
<=WM: (13779: O1964 ^name predict-no)
<=WM: (13778: O1963 ^name predict-yes)
<=WM: (13777: R985 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1965 = 0.39076303591152)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1965 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1966 = 0.3145060369395525)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1966 = 0.6855414715988584)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1964 = 0.3145060369395525)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1964 = 0.6855414715988584)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1963 = 0.39076303591152)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1963 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472311 -0.0815481 0.390763 -> 0.472322 -0.0815463 0.390775(R,m,v=1,0.943038,0.0540595)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527563 0.0815262 0.609089 -> 0.527575 0.0815283 0.609103(R,m,v=1,1,0)
=>WM: (13797: S1 ^operator O1966)

   983:    O: O1966 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N983 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N982 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13798: I3 ^predict-no N983)
<=WM: (13785: N982 ^status complete)
<=WM: (13784: I3 ^predict-yes N982)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13802: I2 ^dir R)
=>WM: (13801: I2 ^reward 1)
=>WM: (13800: I2 ^see 0)
=>WM: (13799: N983 ^status complete)
<=WM: (13788: I2 ^dir L)
<=WM: (13787: I2 ^reward 1)
<=WM: (13786: I2 ^see 1)
=>WM: (13803: I2 ^level-1 L0-root)
<=WM: (13789: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1965 = 0.8783936611550894)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Firing propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Firing propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1966 = 0.9999841575438704)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1965 = 0.1215976616761118)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1966 ^name predict-no +)
 (S1 ^operator O1966 +)
Retracting propose*predict-yes
 -->
 (O1965 ^name predict-yes +)
 (S1 ^operator O1965 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R986 ^value 1 +)
 (R1 ^reward R986 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1966 = 0.6855414715988584)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1966 = 0.3145060369395525)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1965 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1965 = 0.390775231823802)
=>WM: (13811: S1 ^operator O1968 +)
=>WM: (13810: S1 ^operator O1967 +)
=>WM: (13809: I3 ^dir R)
=>WM: (13808: O1968 ^name predict-no)
=>WM: (13807: O1967 ^name predict-yes)
=>WM: (13806: R987 ^value 1)
=>WM: (13805: R1 ^reward R987)
=>WM: (13804: I3 ^see 0)
<=WM: (13795: S1 ^operator O1965 +)
<=WM: (13796: S1 ^operator O1966 +)
<=WM: (13797: S1 ^operator O1966)
<=WM: (13780: I3 ^dir L)
<=WM: (13791: R1 ^reward R986)
<=WM: (13790: I3 ^see 1)
<=WM: (13794: O1966 ^name predict-no)
<=WM: (13793: O1965 ^name predict-yes)
<=WM: (13792: R986 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1967 = 0.1215976616761118)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1967 = 0.8783936611550894)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1968 = 0.9999841575438704)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1966 = 0.9999841575438704)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1965 = 0.1215976616761118)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1965 = 0.8783936611550894)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478554 -0.164048 0.314506 -> 0.478551 -0.164048 0.314502(R,m,v=1,0.921569,0.0727554)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521489 0.164052 0.685541 -> 0.521485 0.164052 0.685537(R,m,v=1,1,0)
=>WM: (13812: S1 ^operator O1967)

   984:    O: O1967 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N984 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N983 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13813: I3 ^predict-yes N984)
<=WM: (13799: N983 ^status complete)
<=WM: (13798: I3 ^predict-no N983)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13817: I2 ^dir U)
=>WM: (13816: I2 ^reward 1)
=>WM: (13815: I2 ^see 1)
=>WM: (13814: N984 ^status complete)
<=WM: (13802: I2 ^dir R)
<=WM: (13801: I2 ^reward 1)
<=WM: (13800: I2 ^see 0)
=>WM: (13818: I2 ^level-1 R1-root)
<=WM: (13803: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Firing propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Firing propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1968 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1967 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1968 ^name predict-no +)
 (S1 ^operator O1968 +)
Retracting propose*predict-yes
 -->
 (O1967 ^name predict-yes +)
 (S1 ^operator O1967 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R987 ^value 1 +)
 (R1 ^reward R987 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1968 = 0.9999841575438704)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1967 = 0.8783936611550894)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1967 = 0.1215976616761118)
=>WM: (13826: S1 ^operator O1970 +)
=>WM: (13825: S1 ^operator O1969 +)
=>WM: (13824: I3 ^dir U)
=>WM: (13823: O1970 ^name predict-no)
=>WM: (13822: O1969 ^name predict-yes)
=>WM: (13821: R988 ^value 1)
=>WM: (13820: R1 ^reward R988)
=>WM: (13819: I3 ^see 1)
<=WM: (13810: S1 ^operator O1967 +)
<=WM: (13812: S1 ^operator O1967)
<=WM: (13811: S1 ^operator O1968 +)
<=WM: (13809: I3 ^dir R)
<=WM: (13805: R1 ^reward R987)
<=WM: (13804: I3 ^see 0)
<=WM: (13808: O1968 ^name predict-no)
<=WM: (13807: O1967 ^name predict-yes)
<=WM: (13806: R987 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1969 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1970 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1968 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1967 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.862857,0.119015)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465469 0.412925 0.878394 -> 0.46547 0.412925 0.878394(R,m,v=1,1,0)
=>WM: (13827: S1 ^operator O1970)

   985:    O: O1970 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N985 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N984 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13828: I3 ^predict-no N985)
<=WM: (13814: N984 ^status complete)
<=WM: (13813: I3 ^predict-yes N984)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (13832: I2 ^dir L)
=>WM: (13831: I2 ^reward 1)
=>WM: (13830: I2 ^see 0)
=>WM: (13829: N985 ^status complete)
<=WM: (13817: I2 ^dir U)
<=WM: (13816: I2 ^reward 1)
<=WM: (13815: I2 ^see 1)
=>WM: (13833: I2 ^level-1 R1-root)
<=WM: (13818: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1970 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1969 = 0.6093180204125221)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Firing propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Firing propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1970 = 0.3145020978774952)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1969 = 0.390775231823802)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1970 ^name predict-no +)
 (S1 ^operator O1970 +)
Retracting propose*predict-yes
 -->
 (O1969 ^name predict-yes +)
 (S1 ^operator O1969 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R988 ^value 1 +)
 (R1 ^reward R988 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1970 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1969 = 0.)
=>WM: (13841: S1 ^operator O1972 +)
=>WM: (13840: S1 ^operator O1971 +)
=>WM: (13839: I3 ^dir L)
=>WM: (13838: O1972 ^name predict-no)
=>WM: (13837: O1971 ^name predict-yes)
=>WM: (13836: R989 ^value 1)
=>WM: (13835: R1 ^reward R989)
=>WM: (13834: I3 ^see 0)
<=WM: (13825: S1 ^operator O1969 +)
<=WM: (13826: S1 ^operator O1970 +)
<=WM: (13827: S1 ^operator O1970)
<=WM: (13824: I3 ^dir U)
<=WM: (13820: R1 ^reward R988)
<=WM: (13819: I3 ^see 1)
<=WM: (13823: O1970 ^name predict-no)
<=WM: (13822: O1969 ^name predict-yes)
<=WM: (13821: R988 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1971 = 0.6093180204125221)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1971 = 0.390775231823802)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1972 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1972 = 0.3145020978774952)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1970 = 0.3145020978774952)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1970 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1969 = 0.390775231823802)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1969 = 0.6093180204125221)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13842: S1 ^operator O1971)

   986:    O: O1971 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N986 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N985 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13843: I3 ^predict-yes N986)
<=WM: (13829: N985 ^status complete)
<=WM: (13828: I3 ^predict-no N985)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13847: I2 ^dir L)
=>WM: (13846: I2 ^reward 1)
=>WM: (13845: I2 ^see 1)
=>WM: (13844: N986 ^status complete)
<=WM: (13832: I2 ^dir L)
<=WM: (13831: I2 ^reward 1)
<=WM: (13830: I2 ^see 0)
=>WM: (13848: I2 ^level-1 L1-root)
<=WM: (13833: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1971 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1972 = 0.6855369815787629)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Firing propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Firing propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1972 = 0.3145020978774952)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1971 = 0.390775231823802)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1972 ^name predict-no +)
 (S1 ^operator O1972 +)
Retracting propose*predict-yes
 -->
 (O1971 ^name predict-yes +)
 (S1 ^operator O1971 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R989 ^value 1 +)
 (R1 ^reward R989 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1972 = 0.3145020978774952)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O1972 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1971 = 0.390775231823802)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O1971 = 0.6093180204125221)
=>WM: (13855: S1 ^operator O1974 +)
=>WM: (13854: S1 ^operator O1973 +)
=>WM: (13853: O1974 ^name predict-no)
=>WM: (13852: O1973 ^name predict-yes)
=>WM: (13851: R990 ^value 1)
=>WM: (13850: R1 ^reward R990)
=>WM: (13849: I3 ^see 1)
<=WM: (13840: S1 ^operator O1971 +)
<=WM: (13842: S1 ^operator O1971)
<=WM: (13841: S1 ^operator O1972 +)
<=WM: (13835: R1 ^reward R989)
<=WM: (13834: I3 ^see 0)
<=WM: (13838: O1972 ^name predict-no)
<=WM: (13837: O1971 ^name predict-yes)
<=WM: (13836: R989 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1973 = 0.390775231823802)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1973 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1974 = 0.3145020978774952)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1974 = 0.6855369815787629)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1972 = 0.3145020978774952)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1972 = 0.6855369815787629)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1971 = 0.390775231823802)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1971 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472322 -0.0815463 0.390775 -> 0.472315 -0.0815474 0.390768(R,m,v=1,0.943396,0.0537378)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527758 0.0815601 0.609318 -> 0.52775 0.0815588 0.609309(R,m,v=1,1,0)
=>WM: (13856: S1 ^operator O1974)

   987:    O: O1974 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N987 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N986 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13857: I3 ^predict-no N987)
<=WM: (13844: N986 ^status complete)
<=WM: (13843: I3 ^predict-yes N986)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13861: I2 ^dir R)
=>WM: (13860: I2 ^reward 1)
=>WM: (13859: I2 ^see 0)
=>WM: (13858: N987 ^status complete)
<=WM: (13847: I2 ^dir L)
<=WM: (13846: I2 ^reward 1)
<=WM: (13845: I2 ^see 1)
=>WM: (13862: I2 ^level-1 L0-root)
<=WM: (13848: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1973 = 0.8783944900614931)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Firing propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Firing propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1974 = 0.9999841575438704)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1973 = 0.1215983654449722)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1974 ^name predict-no +)
 (S1 ^operator O1974 +)
Retracting propose*predict-yes
 -->
 (O1973 ^name predict-yes +)
 (S1 ^operator O1973 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R990 ^value 1 +)
 (R1 ^reward R990 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1974 = 0.6855369815787629)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1974 = 0.3145020978774952)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1973 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1973 = 0.3907675490335307)
=>WM: (13870: S1 ^operator O1976 +)
=>WM: (13869: S1 ^operator O1975 +)
=>WM: (13868: I3 ^dir R)
=>WM: (13867: O1976 ^name predict-no)
=>WM: (13866: O1975 ^name predict-yes)
=>WM: (13865: R991 ^value 1)
=>WM: (13864: R1 ^reward R991)
=>WM: (13863: I3 ^see 0)
<=WM: (13854: S1 ^operator O1973 +)
<=WM: (13855: S1 ^operator O1974 +)
<=WM: (13856: S1 ^operator O1974)
<=WM: (13839: I3 ^dir L)
<=WM: (13850: R1 ^reward R990)
<=WM: (13849: I3 ^see 1)
<=WM: (13853: O1974 ^name predict-no)
<=WM: (13852: O1973 ^name predict-yes)
<=WM: (13851: R990 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1975 = 0.1215983654449722)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1975 = 0.8783944900614931)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1976 = 0.9999841575438704)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1974 = 0.9999841575438704)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1973 = 0.1215983654449722)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1973 = 0.8783944900614931)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314502 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.922078,0.0723198)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521485 0.164052 0.685537 -> 0.521482 0.164052 0.685533(R,m,v=1,1,0)
=>WM: (13871: S1 ^operator O1975)

   988:    O: O1975 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N988 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N987 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13872: I3 ^predict-yes N988)
<=WM: (13858: N987 ^status complete)
<=WM: (13857: I3 ^predict-no N987)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13876: I2 ^dir R)
=>WM: (13875: I2 ^reward 1)
=>WM: (13874: I2 ^see 1)
=>WM: (13873: N988 ^status complete)
<=WM: (13861: I2 ^dir R)
<=WM: (13860: I2 ^reward 1)
<=WM: (13859: I2 ^see 0)
=>WM: (13877: I2 ^level-1 R1-root)
<=WM: (13862: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1975 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Firing propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Firing propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1976 = 0.9999841575438704)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1975 = 0.1215983654449722)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1976 ^name predict-no +)
 (S1 ^operator O1976 +)
Retracting propose*predict-yes
 -->
 (O1975 ^name predict-yes +)
 (S1 ^operator O1975 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R991 ^value 1 +)
 (R1 ^reward R991 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1976 = 0.9999841575438704)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1975 = 0.8783944900614931)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1975 = 0.1215983654449722)
=>WM: (13884: S1 ^operator O1978 +)
=>WM: (13883: S1 ^operator O1977 +)
=>WM: (13882: O1978 ^name predict-no)
=>WM: (13881: O1977 ^name predict-yes)
=>WM: (13880: R992 ^value 1)
=>WM: (13879: R1 ^reward R992)
=>WM: (13878: I3 ^see 1)
<=WM: (13869: S1 ^operator O1975 +)
<=WM: (13871: S1 ^operator O1975)
<=WM: (13870: S1 ^operator O1976 +)
<=WM: (13864: R1 ^reward R991)
<=WM: (13863: I3 ^see 0)
<=WM: (13867: O1976 ^name predict-no)
<=WM: (13866: O1975 ^name predict-yes)
<=WM: (13865: R991 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1977 = 0.1215983654449722)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1977 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1978 = 0.9999841575438704)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1976 = 0.9999841575438704)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1975 = 0.1215983654449722)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1975 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.863636,0.118442)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.46547 0.412925 0.878394 -> 0.46547 0.412925 0.878395(R,m,v=1,1,0)
=>WM: (13885: S1 ^operator O1978)

   989:    O: O1978 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N989 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N988 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13886: I3 ^predict-no N989)
<=WM: (13873: N988 ^status complete)
<=WM: (13872: I3 ^predict-yes N988)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (13890: I2 ^dir U)
=>WM: (13889: I2 ^reward 1)
=>WM: (13888: I2 ^see 0)
=>WM: (13887: N989 ^status complete)
<=WM: (13876: I2 ^dir R)
<=WM: (13875: I2 ^reward 1)
<=WM: (13874: I2 ^see 1)
=>WM: (13891: I2 ^level-1 R0-root)
<=WM: (13877: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Firing propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Firing propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1978 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1977 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1978 ^name predict-no +)
 (S1 ^operator O1978 +)
Retracting propose*predict-yes
 -->
 (O1977 ^name predict-yes +)
 (S1 ^operator O1977 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R992 ^value 1 +)
 (R1 ^reward R992 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1978 = 0.9999841575438704)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O1977 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1977 = 0.1215989443698621)
=>WM: (13899: S1 ^operator O1980 +)
=>WM: (13898: S1 ^operator O1979 +)
=>WM: (13897: I3 ^dir U)
=>WM: (13896: O1980 ^name predict-no)
=>WM: (13895: O1979 ^name predict-yes)
=>WM: (13894: R993 ^value 1)
=>WM: (13893: R1 ^reward R993)
=>WM: (13892: I3 ^see 0)
<=WM: (13883: S1 ^operator O1977 +)
<=WM: (13884: S1 ^operator O1978 +)
<=WM: (13885: S1 ^operator O1978)
<=WM: (13868: I3 ^dir R)
<=WM: (13879: R1 ^reward R992)
<=WM: (13878: I3 ^see 1)
<=WM: (13882: O1978 ^name predict-no)
<=WM: (13881: O1977 ^name predict-yes)
<=WM: (13880: R992 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1979 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1980 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1978 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1977 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999984 0 0.999984 -> 0.999987 0 0.999987(R,m,v=1,0.9375,0.0589286)
=>WM: (13900: S1 ^operator O1980)

   990:    O: O1980 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N990 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N989 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13901: I3 ^predict-no N990)
<=WM: (13887: N989 ^status complete)
<=WM: (13886: I3 ^predict-no N989)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13905: I2 ^dir R)
=>WM: (13904: I2 ^reward 1)
=>WM: (13903: I2 ^see 0)
=>WM: (13902: N990 ^status complete)
<=WM: (13890: I2 ^dir U)
<=WM: (13889: I2 ^reward 1)
<=WM: (13888: I2 ^see 0)
=>WM: (13906: I2 ^level-1 R0-root)
<=WM: (13891: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O1979 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Firing propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Firing propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1980 = 0.9999867250014868)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1979 = 0.1215989443698621)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1980 ^name predict-no +)
 (S1 ^operator O1980 +)
Retracting propose*predict-yes
 -->
 (O1979 ^name predict-yes +)
 (S1 ^operator O1979 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R993 ^value 1 +)
 (R1 ^reward R993 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1980 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1979 = 0.)
=>WM: (13913: S1 ^operator O1982 +)
=>WM: (13912: S1 ^operator O1981 +)
=>WM: (13911: I3 ^dir R)
=>WM: (13910: O1982 ^name predict-no)
=>WM: (13909: O1981 ^name predict-yes)
=>WM: (13908: R994 ^value 1)
=>WM: (13907: R1 ^reward R994)
<=WM: (13898: S1 ^operator O1979 +)
<=WM: (13899: S1 ^operator O1980 +)
<=WM: (13900: S1 ^operator O1980)
<=WM: (13897: I3 ^dir U)
<=WM: (13893: R1 ^reward R993)
<=WM: (13896: O1980 ^name predict-no)
<=WM: (13895: O1979 ^name predict-yes)
<=WM: (13894: R993 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O1981 = -0.1512366769350551)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1981 = 0.1215989443698621)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1982 = 0.9999867250014868)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1980 = 0.9999867250014868)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1979 = 0.1215989443698621)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O1979 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13914: S1 ^operator O1982)

   991:    O: O1982 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N991 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N990 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13915: I3 ^predict-no N991)
<=WM: (13902: N990 ^status complete)
<=WM: (13901: I3 ^predict-no N990)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (13919: I2 ^dir U)
=>WM: (13918: I2 ^reward 1)
=>WM: (13917: I2 ^see 0)
=>WM: (13916: N991 ^status complete)
<=WM: (13905: I2 ^dir R)
<=WM: (13904: I2 ^reward 1)
<=WM: (13903: I2 ^see 0)
=>WM: (13920: I2 ^level-1 R0-root)
<=WM: (13906: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Firing propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Firing propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1982 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1981 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1982 ^name predict-no +)
 (S1 ^operator O1982 +)
Retracting propose*predict-yes
 -->
 (O1981 ^name predict-yes +)
 (S1 ^operator O1981 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R994 ^value 1 +)
 (R1 ^reward R994 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1982 = 0.9999867250014868)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1981 = 0.1215989443698621)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O1981 = -0.1512366769350551)
=>WM: (13927: S1 ^operator O1984 +)
=>WM: (13926: S1 ^operator O1983 +)
=>WM: (13925: I3 ^dir U)
=>WM: (13924: O1984 ^name predict-no)
=>WM: (13923: O1983 ^name predict-yes)
=>WM: (13922: R995 ^value 1)
=>WM: (13921: R1 ^reward R995)
<=WM: (13912: S1 ^operator O1981 +)
<=WM: (13913: S1 ^operator O1982 +)
<=WM: (13914: S1 ^operator O1982)
<=WM: (13911: I3 ^dir R)
<=WM: (13907: R1 ^reward R994)
<=WM: (13910: O1982 ^name predict-no)
<=WM: (13909: O1981 ^name predict-yes)
<=WM: (13908: R994 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1982 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1981 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999987 0 0.999987 -> 0.999989 0 0.999989(R,m,v=1,0.937853,0.0586158)
=>WM: (13928: S1 ^operator O1984)

   992:    O: O1984 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N992 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N991 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13929: I3 ^predict-no N992)
<=WM: (13916: N991 ^status complete)
<=WM: (13915: I3 ^predict-no N991)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (13933: I2 ^dir L)
=>WM: (13932: I2 ^reward 1)
=>WM: (13931: I2 ^see 0)
=>WM: (13930: N992 ^status complete)
<=WM: (13919: I2 ^dir U)
<=WM: (13918: I2 ^reward 1)
<=WM: (13917: I2 ^see 0)
=>WM: (13934: I2 ^level-1 R0-root)
<=WM: (13920: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1984 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1983 = 0.6091029227055655)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Firing propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Firing propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 0.3144988611901438)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.3907675490335307)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1984 ^name predict-no +)
 (S1 ^operator O1984 +)
Retracting propose*predict-yes
 -->
 (O1983 ^name predict-yes +)
 (S1 ^operator O1983 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R995 ^value 1 +)
 (R1 ^reward R995 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1984 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1983 = 0.)
=>WM: (13941: S1 ^operator O1986 +)
=>WM: (13940: S1 ^operator O1985 +)
=>WM: (13939: I3 ^dir L)
=>WM: (13938: O1986 ^name predict-no)
=>WM: (13937: O1985 ^name predict-yes)
=>WM: (13936: R996 ^value 1)
=>WM: (13935: R1 ^reward R996)
<=WM: (13926: S1 ^operator O1983 +)
<=WM: (13927: S1 ^operator O1984 +)
<=WM: (13928: S1 ^operator O1984)
<=WM: (13925: I3 ^dir U)
<=WM: (13921: R1 ^reward R995)
<=WM: (13924: O1984 ^name predict-no)
<=WM: (13923: O1983 ^name predict-yes)
<=WM: (13922: R995 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1985 = 0.6091029227055655)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1985 = 0.3907675490335307)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1986 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1986 = 0.3144988611901438)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1984 = 0.3144988611901438)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1984 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1983 = 0.3907675490335307)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1983 = 0.6091029227055655)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13942: S1 ^operator O1985)

   993:    O: O1985 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N993 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N992 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13943: I3 ^predict-yes N993)
<=WM: (13930: N992 ^status complete)
<=WM: (13929: I3 ^predict-no N992)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (13947: I2 ^dir U)
=>WM: (13946: I2 ^reward 1)
=>WM: (13945: I2 ^see 1)
=>WM: (13944: N993 ^status complete)
<=WM: (13933: I2 ^dir L)
<=WM: (13932: I2 ^reward 1)
<=WM: (13931: I2 ^see 0)
=>WM: (13948: I2 ^level-1 L1-root)
<=WM: (13934: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Firing propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Firing propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1986 ^name predict-no +)
 (S1 ^operator O1986 +)
Retracting propose*predict-yes
 -->
 (O1985 ^name predict-yes +)
 (S1 ^operator O1985 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R996 ^value 1 +)
 (R1 ^reward R996 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1986 = 0.3144988611901438)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O1986 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1985 = 0.3907675490335307)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O1985 = 0.6091029227055655)
=>WM: (13956: S1 ^operator O1988 +)
=>WM: (13955: S1 ^operator O1987 +)
=>WM: (13954: I3 ^dir U)
=>WM: (13953: O1988 ^name predict-no)
=>WM: (13952: O1987 ^name predict-yes)
=>WM: (13951: R997 ^value 1)
=>WM: (13950: R1 ^reward R997)
=>WM: (13949: I3 ^see 1)
<=WM: (13940: S1 ^operator O1985 +)
<=WM: (13942: S1 ^operator O1985)
<=WM: (13941: S1 ^operator O1986 +)
<=WM: (13939: I3 ^dir L)
<=WM: (13935: R1 ^reward R996)
<=WM: (13892: I3 ^see 0)
<=WM: (13938: O1986 ^name predict-no)
<=WM: (13937: O1985 ^name predict-yes)
<=WM: (13936: R996 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1987 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1988 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1986 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1985 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472315 -0.0815474 0.390768 -> 0.472324 -0.0815458 0.390778(R,m,v=1,0.94375,0.0534198)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527575 0.0815283 0.609103 -> 0.527585 0.0815301 0.609115(R,m,v=1,1,0)
=>WM: (13957: S1 ^operator O1988)

   994:    O: O1988 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N994 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N993 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13958: I3 ^predict-no N994)
<=WM: (13944: N993 ^status complete)
<=WM: (13943: I3 ^predict-yes N993)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (13962: I2 ^dir L)
=>WM: (13961: I2 ^reward 1)
=>WM: (13960: I2 ^see 0)
=>WM: (13959: N994 ^status complete)
<=WM: (13947: I2 ^dir U)
<=WM: (13946: I2 ^reward 1)
<=WM: (13945: I2 ^see 1)
=>WM: (13963: I2 ^level-1 L1-root)
<=WM: (13948: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1987 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1988 = 0.685533297663165)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Firing propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Firing propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1988 = 0.3144988611901438)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1987 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O1988 ^name predict-no +)
 (S1 ^operator O1988 +)
Retracting propose*predict-yes
 -->
 (O1987 ^name predict-yes +)
 (S1 ^operator O1987 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R997 ^value 1 +)
 (R1 ^reward R997 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1988 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1987 = 0.)
=>WM: (13971: S1 ^operator O1990 +)
=>WM: (13970: S1 ^operator O1989 +)
=>WM: (13969: I3 ^dir L)
=>WM: (13968: O1990 ^name predict-no)
=>WM: (13967: O1989 ^name predict-yes)
=>WM: (13966: R998 ^value 1)
=>WM: (13965: R1 ^reward R998)
=>WM: (13964: I3 ^see 0)
<=WM: (13955: S1 ^operator O1987 +)
<=WM: (13956: S1 ^operator O1988 +)
<=WM: (13957: S1 ^operator O1988)
<=WM: (13954: I3 ^dir U)
<=WM: (13950: R1 ^reward R997)
<=WM: (13949: I3 ^see 1)
<=WM: (13953: O1988 ^name predict-no)
<=WM: (13952: O1987 ^name predict-yes)
<=WM: (13951: R997 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1989 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1990 = 0.685533297663165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 0.3144988611901438)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1988 = 0.3144988611901438)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1988 = 0.685533297663165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1987 = 0.3907782094907327)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1987 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (13972: S1 ^operator O1990)

   995:    O: O1990 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N995 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N994 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13973: I3 ^predict-no N995)
<=WM: (13959: N994 ^status complete)
<=WM: (13958: I3 ^predict-no N994)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (13977: I2 ^dir L)
=>WM: (13976: I2 ^reward 1)
=>WM: (13975: I2 ^see 0)
=>WM: (13974: N995 ^status complete)
<=WM: (13962: I2 ^dir L)
<=WM: (13961: I2 ^reward 1)
<=WM: (13960: I2 ^see 0)
=>WM: (13978: I2 ^level-1 L0-root)
<=WM: (13963: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1989 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1990 = 0.6854257503571404)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Firing propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Firing propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 0.3144988611901438)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1990 ^name predict-no +)
 (S1 ^operator O1990 +)
Retracting propose*predict-yes
 -->
 (O1989 ^name predict-yes +)
 (S1 ^operator O1989 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R998 ^value 1 +)
 (R1 ^reward R998 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 0.3144988611901438)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O1990 = 0.685533297663165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.3907782094907327)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O1989 = -0.2062723012911647)
=>WM: (13984: S1 ^operator O1992 +)
=>WM: (13983: S1 ^operator O1991 +)
=>WM: (13982: O1992 ^name predict-no)
=>WM: (13981: O1991 ^name predict-yes)
=>WM: (13980: R999 ^value 1)
=>WM: (13979: R1 ^reward R999)
<=WM: (13970: S1 ^operator O1989 +)
<=WM: (13971: S1 ^operator O1990 +)
<=WM: (13972: S1 ^operator O1990)
<=WM: (13965: R1 ^reward R998)
<=WM: (13968: O1990 ^name predict-no)
<=WM: (13967: O1989 ^name predict-yes)
<=WM: (13966: R998 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1991 = -0.208713043145708)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.3144988611901438)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1992 = 0.6854257503571404)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1990 = 0.3144988611901438)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1990 = 0.6854257503571404)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1989 = 0.3907782094907327)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1989 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478545 -0.164049 0.314496(R,m,v=1,0.922581,0.0718894)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521482 0.164052 0.685533 -> 0.521479 0.164051 0.68553(R,m,v=1,1,0)
=>WM: (13985: S1 ^operator O1992)

   996:    O: O1992 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N996 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N995 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13986: I3 ^predict-no N996)
<=WM: (13974: N995 ^status complete)
<=WM: (13973: I3 ^predict-no N995)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (13990: I2 ^dir L)
=>WM: (13989: I2 ^reward 1)
=>WM: (13988: I2 ^see 0)
=>WM: (13987: N996 ^status complete)
<=WM: (13977: I2 ^dir L)
<=WM: (13976: I2 ^reward 1)
<=WM: (13975: I2 ^see 0)
=>WM: (13991: I2 ^level-1 L0-root)
<=WM: (13978: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1991 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1992 = 0.6854257503571404)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Firing propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Firing propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.3144962005421928)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1992 ^name predict-no +)
 (S1 ^operator O1992 +)
Retracting propose*predict-yes
 -->
 (O1991 ^name predict-yes +)
 (S1 ^operator O1991 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R999 ^value 1 +)
 (R1 ^reward R999 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1992 = 0.6854257503571404)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.3144962005421928)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1991 = -0.208713043145708)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.3907782094907327)
=>WM: (13997: S1 ^operator O1994 +)
=>WM: (13996: S1 ^operator O1993 +)
=>WM: (13995: O1994 ^name predict-no)
=>WM: (13994: O1993 ^name predict-yes)
=>WM: (13993: R1000 ^value 1)
=>WM: (13992: R1 ^reward R1000)
<=WM: (13983: S1 ^operator O1991 +)
<=WM: (13984: S1 ^operator O1992 +)
<=WM: (13985: S1 ^operator O1992)
<=WM: (13979: R1 ^reward R999)
<=WM: (13982: O1992 ^name predict-no)
<=WM: (13981: O1991 ^name predict-yes)
<=WM: (13980: R999 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1993 = -0.208713043145708)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.3144962005421928)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1994 = 0.6854257503571404)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1992 = 0.3144962005421928)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1992 = 0.6854257503571404)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1991 = 0.3907782094907327)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1991 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478545 -0.164049 0.314496 -> 0.478551 -0.164048 0.314503(R,m,v=1,0.923077,0.071464)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521384 0.164042 0.685426 -> 0.521391 0.164042 0.685433(R,m,v=1,1,0)
=>WM: (13998: S1 ^operator O1994)

   997:    O: O1994 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N997 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N996 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (13999: I3 ^predict-no N997)
<=WM: (13987: N996 ^status complete)
<=WM: (13986: I3 ^predict-no N996)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14003: I2 ^dir U)
=>WM: (14002: I2 ^reward 1)
=>WM: (14001: I2 ^see 0)
=>WM: (14000: N997 ^status complete)
<=WM: (13990: I2 ^dir L)
<=WM: (13989: I2 ^reward 1)
<=WM: (13988: I2 ^see 0)
=>WM: (14004: I2 ^level-1 L0-root)
<=WM: (13991: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Firing propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Firing propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1994 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1993 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1994 ^name predict-no +)
 (S1 ^operator O1994 +)
Retracting propose*predict-yes
 -->
 (O1993 ^name predict-yes +)
 (S1 ^operator O1993 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1000 ^value 1 +)
 (R1 ^reward R1000 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O1994 = 0.6854332700385593)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O1994 = 0.3145026510346156)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O1993 = -0.208713043145708)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O1993 = 0.3907782094907327)
=>WM: (14011: S1 ^operator O1996 +)
=>WM: (14010: S1 ^operator O1995 +)
=>WM: (14009: I3 ^dir U)
=>WM: (14008: O1996 ^name predict-no)
=>WM: (14007: O1995 ^name predict-yes)
=>WM: (14006: R1001 ^value 1)
=>WM: (14005: R1 ^reward R1001)
<=WM: (13996: S1 ^operator O1993 +)
<=WM: (13997: S1 ^operator O1994 +)
<=WM: (13998: S1 ^operator O1994)
<=WM: (13969: I3 ^dir L)
<=WM: (13992: R1 ^reward R1000)
<=WM: (13995: O1994 ^name predict-no)
<=WM: (13994: O1993 ^name predict-yes)
<=WM: (13993: R1000 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1994 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1993 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314503 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.923567,0.0710436)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521391 0.164042 0.685433 -> 0.521396 0.164043 0.685439(R,m,v=1,1,0)
=>WM: (14012: S1 ^operator O1996)

   998:    O: O1996 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N998 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N997 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14013: I3 ^predict-no N998)
<=WM: (14000: N997 ^status complete)
<=WM: (13999: I3 ^predict-no N997)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14017: I2 ^dir R)
=>WM: (14016: I2 ^reward 1)
=>WM: (14015: I2 ^see 0)
=>WM: (14014: N998 ^status complete)
<=WM: (14003: I2 ^dir U)
<=WM: (14002: I2 ^reward 1)
<=WM: (14001: I2 ^see 0)
=>WM: (14018: I2 ^level-1 L0-root)
<=WM: (14004: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1995 = 0.8783951706845293)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Firing propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Firing propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1996 = 0.9999888743986174)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1995 = 0.1215989443698621)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1996 ^name predict-no +)
 (S1 ^operator O1996 +)
Retracting propose*predict-yes
 -->
 (O1995 ^name predict-yes +)
 (S1 ^operator O1995 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1001 ^value 1 +)
 (R1 ^reward R1001 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1996 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1995 = 0.)
=>WM: (14025: S1 ^operator O1998 +)
=>WM: (14024: S1 ^operator O1997 +)
=>WM: (14023: I3 ^dir R)
=>WM: (14022: O1998 ^name predict-no)
=>WM: (14021: O1997 ^name predict-yes)
=>WM: (14020: R1002 ^value 1)
=>WM: (14019: R1 ^reward R1002)
<=WM: (14010: S1 ^operator O1995 +)
<=WM: (14011: S1 ^operator O1996 +)
<=WM: (14012: S1 ^operator O1996)
<=WM: (14009: I3 ^dir U)
<=WM: (14005: R1 ^reward R1001)
<=WM: (14008: O1996 ^name predict-no)
<=WM: (14007: O1995 ^name predict-yes)
<=WM: (14006: R1001 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1997 = 0.8783951706845293)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1997 = 0.1215989443698621)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1998 = 0.9999888743986174)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1996 = 0.9999888743986174)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1995 = 0.1215989443698621)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1995 = 0.8783951706845293)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14026: S1 ^operator O1997)

   999:    O: O1997 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N999 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N998 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14027: I3 ^predict-yes N999)
<=WM: (14014: N998 ^status complete)
<=WM: (14013: I3 ^predict-no N998)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14031: I2 ^dir U)
=>WM: (14030: I2 ^reward 1)
=>WM: (14029: I2 ^see 1)
=>WM: (14028: N999 ^status complete)
<=WM: (14017: I2 ^dir R)
<=WM: (14016: I2 ^reward 1)
<=WM: (14015: I2 ^see 0)
=>WM: (14032: I2 ^level-1 R1-root)
<=WM: (14018: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Firing propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Firing propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O1998 ^name predict-no +)
 (S1 ^operator O1998 +)
Retracting propose*predict-yes
 -->
 (O1997 ^name predict-yes +)
 (S1 ^operator O1997 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1002 ^value 1 +)
 (R1 ^reward R1002 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O1998 = 0.9999888743986174)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O1997 = 0.1215989443698621)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O1997 = 0.8783951706845293)
=>WM: (14040: S1 ^operator O2000 +)
=>WM: (14039: S1 ^operator O1999 +)
=>WM: (14038: I3 ^dir U)
=>WM: (14037: O2000 ^name predict-no)
=>WM: (14036: O1999 ^name predict-yes)
=>WM: (14035: R1003 ^value 1)
=>WM: (14034: R1 ^reward R1003)
=>WM: (14033: I3 ^see 1)
<=WM: (14024: S1 ^operator O1997 +)
<=WM: (14026: S1 ^operator O1997)
<=WM: (14025: S1 ^operator O1998 +)
<=WM: (14023: I3 ^dir R)
<=WM: (14019: R1 ^reward R1002)
<=WM: (13964: I3 ^see 0)
<=WM: (14022: O1998 ^name predict-no)
<=WM: (14021: O1997 ^name predict-yes)
<=WM: (14020: R1002 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1999 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2000 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O1998 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1997 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.864407,0.117874)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.46547 0.412925 0.878395 -> 0.465471 0.412925 0.878396(R,m,v=1,1,0)
=>WM: (14041: S1 ^operator O2000)

  1000:    O: O2000 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1000 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N999 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14042: I3 ^predict-no N1000)
<=WM: (14028: N999 ^status complete)
<=WM: (14027: I3 ^predict-yes N999)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/|\-/|--- Input Phase --- 
=>WM: (14046: I2 ^dir U)
=>WM: (14045: I2 ^reward 1)
=>WM: (14044: I2 ^see 0)
=>WM: (14043: N1000 ^status complete)
<=WM: (14031: I2 ^dir U)
<=WM: (14030: I2 ^reward 1)
<=WM: (14029: I2 ^see 1)
=>WM: (14047: I2 ^level-1 R1-root)
<=WM: (14032: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Firing propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Firing propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2000 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1999 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2000 ^name predict-no +)
 (S1 ^operator O2000 +)
Retracting propose*predict-yes
 -->
 (O1999 ^name predict-yes +)
 (S1 ^operator O1999 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1003 ^value 1 +)
 (R1 ^reward R1003 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2000 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1999 = 0.)
=>WM: (14054: S1 ^operator O2002 +)
=>WM: (14053: S1 ^operator O2001 +)
=>WM: (14052: O2002 ^name predict-no)
=>WM: (14051: O2001 ^name predict-yes)
=>WM: (14050: R1004 ^value 1)
=>WM: (14049: R1 ^reward R1004)
=>WM: (14048: I3 ^see 0)
<=WM: (14039: S1 ^operator O1999 +)
<=WM: (14040: S1 ^operator O2000 +)
<=WM: (14041: S1 ^operator O2000)
<=WM: (14034: R1 ^reward R1003)
<=WM: (14033: I3 ^see 1)
<=WM: (14037: O2000 ^name predict-no)
<=WM: (14036: O1999 ^name predict-yes)
<=WM: (14035: R1003 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2000 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O1999 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14055: S1 ^operator O2002)

  1001:    O: O2002 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1001 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1000 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14056: I3 ^predict-no N1001)
<=WM: (14043: N1000 ^status complete)
<=WM: (14042: I3 ^predict-no N1000)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (14060: I2 ^dir U)
=>WM: (14059: I2 ^reward 1)
=>WM: (14058: I2 ^see 0)
=>WM: (14057: N1001 ^status complete)
<=WM: (14046: I2 ^dir U)
<=WM: (14045: I2 ^reward 1)
<=WM: (14044: I2 ^see 0)
=>WM: (14061: I2 ^level-1 R1-root)
<=WM: (14047: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Firing propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Firing propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2002 ^name predict-no +)
 (S1 ^operator O2002 +)
Retracting propose*predict-yes
 -->
 (O2001 ^name predict-yes +)
 (S1 ^operator O2001 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1004 ^value 1 +)
 (R1 ^reward R1004 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.)
=>WM: (14067: S1 ^operator O2004 +)
=>WM: (14066: S1 ^operator O2003 +)
=>WM: (14065: O2004 ^name predict-no)
=>WM: (14064: O2003 ^name predict-yes)
=>WM: (14063: R1005 ^value 1)
=>WM: (14062: R1 ^reward R1005)
<=WM: (14053: S1 ^operator O2001 +)
<=WM: (14054: S1 ^operator O2002 +)
<=WM: (14055: S1 ^operator O2002)
<=WM: (14049: R1 ^reward R1004)
<=WM: (14052: O2002 ^name predict-no)
<=WM: (14051: O2001 ^name predict-yes)
<=WM: (14050: R1004 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2002 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2001 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14068: S1 ^operator O2004)

  1002:    O: O2004 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1002 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1001 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14069: I3 ^predict-no N1002)
<=WM: (14057: N1001 ^status complete)
<=WM: (14056: I3 ^predict-no N1001)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14073: I2 ^dir R)
=>WM: (14072: I2 ^reward 1)
=>WM: (14071: I2 ^see 0)
=>WM: (14070: N1002 ^status complete)
<=WM: (14060: I2 ^dir U)
<=WM: (14059: I2 ^reward 1)
<=WM: (14058: I2 ^see 0)
=>WM: (14074: I2 ^level-1 R1-root)
<=WM: (14061: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2003 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Firing propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Firing propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2004 = 0.9999888743986174)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2003 = 0.1215994207949702)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2004 ^name predict-no +)
 (S1 ^operator O2004 +)
Retracting propose*predict-yes
 -->
 (O2003 ^name predict-yes +)
 (S1 ^operator O2003 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1005 ^value 1 +)
 (R1 ^reward R1005 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2004 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2003 = 0.)
=>WM: (14081: S1 ^operator O2006 +)
=>WM: (14080: S1 ^operator O2005 +)
=>WM: (14079: I3 ^dir R)
=>WM: (14078: O2006 ^name predict-no)
=>WM: (14077: O2005 ^name predict-yes)
=>WM: (14076: R1006 ^value 1)
=>WM: (14075: R1 ^reward R1006)
<=WM: (14066: S1 ^operator O2003 +)
<=WM: (14067: S1 ^operator O2004 +)
<=WM: (14068: S1 ^operator O2004)
<=WM: (14038: I3 ^dir U)
<=WM: (14062: R1 ^reward R1005)
<=WM: (14065: O2004 ^name predict-no)
<=WM: (14064: O2003 ^name predict-yes)
<=WM: (14063: R1005 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2005 = -0.04253361215288998)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.1215994207949702)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.9999888743986174)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2004 = 0.9999888743986174)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2003 = 0.1215994207949702)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2003 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14082: S1 ^operator O2006)

  1003:    O: O2006 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1003 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1002 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14083: I3 ^predict-no N1003)
<=WM: (14070: N1002 ^status complete)
<=WM: (14069: I3 ^predict-no N1002)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14087: I2 ^dir R)
=>WM: (14086: I2 ^reward 1)
=>WM: (14085: I2 ^see 0)
=>WM: (14084: N1003 ^status complete)
<=WM: (14073: I2 ^dir R)
<=WM: (14072: I2 ^reward 1)
<=WM: (14071: I2 ^see 0)
=>WM: (14088: I2 ^level-1 R0-root)
<=WM: (14074: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2005 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Firing propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Firing propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.9999888743986174)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.1215994207949702)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2006 ^name predict-no +)
 (S1 ^operator O2006 +)
Retracting propose*predict-yes
 -->
 (O2005 ^name predict-yes +)
 (S1 ^operator O2005 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1006 ^value 1 +)
 (R1 ^reward R1006 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.9999888743986174)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.1215994207949702)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2005 = -0.04253361215288998)
=>WM: (14094: S1 ^operator O2008 +)
=>WM: (14093: S1 ^operator O2007 +)
=>WM: (14092: O2008 ^name predict-no)
=>WM: (14091: O2007 ^name predict-yes)
=>WM: (14090: R1007 ^value 1)
=>WM: (14089: R1 ^reward R1007)
<=WM: (14080: S1 ^operator O2005 +)
<=WM: (14081: S1 ^operator O2006 +)
<=WM: (14082: S1 ^operator O2006)
<=WM: (14075: R1 ^reward R1006)
<=WM: (14078: O2006 ^name predict-no)
<=WM: (14077: O2005 ^name predict-yes)
<=WM: (14076: R1006 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.1215994207949702)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2007 = -0.1512366769350551)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.9999888743986174)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2006 = 0.9999888743986174)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2005 = 0.1215994207949702)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2005 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999989 0 0.999989 -> 0.999991 0 0.999991(R,m,v=1,0.938202,0.0583064)
=>WM: (14095: S1 ^operator O2008)

  1004:    O: O2008 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1004 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1003 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14096: I3 ^predict-no N1004)
<=WM: (14084: N1003 ^status complete)
<=WM: (14083: I3 ^predict-no N1003)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14100: I2 ^dir U)
=>WM: (14099: I2 ^reward 1)
=>WM: (14098: I2 ^see 0)
=>WM: (14097: N1004 ^status complete)
<=WM: (14087: I2 ^dir R)
<=WM: (14086: I2 ^reward 1)
<=WM: (14085: I2 ^see 0)
=>WM: (14101: I2 ^level-1 R0-root)
<=WM: (14088: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Firing propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Firing propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2008 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2007 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2008 ^name predict-no +)
 (S1 ^operator O2008 +)
Retracting propose*predict-yes
 -->
 (O2007 ^name predict-yes +)
 (S1 ^operator O2007 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1007 ^value 1 +)
 (R1 ^reward R1007 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2008 = 0.9999906741383352)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2007 = -0.1512366769350551)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2007 = 0.1215994207949702)
=>WM: (14108: S1 ^operator O2010 +)
=>WM: (14107: S1 ^operator O2009 +)
=>WM: (14106: I3 ^dir U)
=>WM: (14105: O2010 ^name predict-no)
=>WM: (14104: O2009 ^name predict-yes)
=>WM: (14103: R1008 ^value 1)
=>WM: (14102: R1 ^reward R1008)
<=WM: (14093: S1 ^operator O2007 +)
<=WM: (14094: S1 ^operator O2008 +)
<=WM: (14095: S1 ^operator O2008)
<=WM: (14079: I3 ^dir R)
<=WM: (14089: R1 ^reward R1007)
<=WM: (14092: O2008 ^name predict-no)
<=WM: (14091: O2007 ^name predict-yes)
<=WM: (14090: R1007 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2009 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2010 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2008 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2007 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999991 0 0.999991 -> 0.999992 0 0.999992(R,m,v=1,0.938547,0.0580001)
=>WM: (14109: S1 ^operator O2010)

  1005:    O: O2010 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1005 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1004 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14110: I3 ^predict-no N1005)
<=WM: (14097: N1004 ^status complete)
<=WM: (14096: I3 ^predict-no N1004)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14114: I2 ^dir U)
=>WM: (14113: I2 ^reward 1)
=>WM: (14112: I2 ^see 0)
=>WM: (14111: N1005 ^status complete)
<=WM: (14100: I2 ^dir U)
<=WM: (14099: I2 ^reward 1)
<=WM: (14098: I2 ^see 0)
=>WM: (14115: I2 ^level-1 R0-root)
<=WM: (14101: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Firing propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Firing propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2010 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2009 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2010 ^name predict-no +)
 (S1 ^operator O2010 +)
Retracting propose*predict-yes
 -->
 (O2009 ^name predict-yes +)
 (S1 ^operator O2009 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1008 ^value 1 +)
 (R1 ^reward R1008 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2010 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2009 = 0.)
=>WM: (14121: S1 ^operator O2012 +)
=>WM: (14120: S1 ^operator O2011 +)
=>WM: (14119: O2012 ^name predict-no)
=>WM: (14118: O2011 ^name predict-yes)
=>WM: (14117: R1009 ^value 1)
=>WM: (14116: R1 ^reward R1009)
<=WM: (14107: S1 ^operator O2009 +)
<=WM: (14108: S1 ^operator O2010 +)
<=WM: (14109: S1 ^operator O2010)
<=WM: (14102: R1 ^reward R1008)
<=WM: (14105: O2010 ^name predict-no)
<=WM: (14104: O2009 ^name predict-yes)
<=WM: (14103: R1008 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2011 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2012 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2010 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2009 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14122: S1 ^operator O2012)

  1006:    O: O2012 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1006 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1005 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14123: I3 ^predict-no N1006)
<=WM: (14111: N1005 ^status complete)
<=WM: (14110: I3 ^predict-no N1005)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14127: I2 ^dir L)
=>WM: (14126: I2 ^reward 1)
=>WM: (14125: I2 ^see 0)
=>WM: (14124: N1006 ^status complete)
<=WM: (14114: I2 ^dir U)
<=WM: (14113: I2 ^reward 1)
<=WM: (14112: I2 ^see 0)
=>WM: (14128: I2 ^level-1 R0-root)
<=WM: (14115: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2012 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2011 = 0.6091150129894595)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Firing propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Firing propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2012 = 0.3145079413521559)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2011 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2012 ^name predict-no +)
 (S1 ^operator O2012 +)
Retracting propose*predict-yes
 -->
 (O2011 ^name predict-yes +)
 (S1 ^operator O2011 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1009 ^value 1 +)
 (R1 ^reward R1009 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2012 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2011 = 0.)
=>WM: (14135: S1 ^operator O2014 +)
=>WM: (14134: S1 ^operator O2013 +)
=>WM: (14133: I3 ^dir L)
=>WM: (14132: O2014 ^name predict-no)
=>WM: (14131: O2013 ^name predict-yes)
=>WM: (14130: R1010 ^value 1)
=>WM: (14129: R1 ^reward R1010)
<=WM: (14120: S1 ^operator O2011 +)
<=WM: (14121: S1 ^operator O2012 +)
<=WM: (14122: S1 ^operator O2012)
<=WM: (14106: I3 ^dir U)
<=WM: (14116: R1 ^reward R1009)
<=WM: (14119: O2012 ^name predict-no)
<=WM: (14118: O2011 ^name predict-yes)
<=WM: (14117: R1009 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2013 = 0.6091150129894595)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2013 = 0.3907782094907327)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2014 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2014 = 0.3145079413521559)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2012 = 0.3145079413521559)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2012 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2011 = 0.3907782094907327)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2011 = 0.6091150129894595)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14136: S1 ^operator O2013)

  1007:    O: O2013 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1007 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1006 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14137: I3 ^predict-yes N1007)
<=WM: (14124: N1006 ^status complete)
<=WM: (14123: I3 ^predict-no N1006)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14141: I2 ^dir R)
=>WM: (14140: I2 ^reward 1)
=>WM: (14139: I2 ^see 1)
=>WM: (14138: N1007 ^status complete)
<=WM: (14127: I2 ^dir L)
<=WM: (14126: I2 ^reward 1)
<=WM: (14125: I2 ^see 0)
=>WM: (14142: I2 ^level-1 L1-root)
<=WM: (14128: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2013 = 0.8784140715701729)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1011 ^value 1 +)
 (R1 ^reward R1011 +)
Firing propose*predict-yes
 -->
 (O2015 ^name predict-yes +)
 (S1 ^operator O2015 +)
Firing propose*predict-no
 -->
 (O2016 ^name predict-no +)
 (S1 ^operator O2016 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.1215994207949702)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2014 ^name predict-no +)
 (S1 ^operator O2014 +)
Retracting propose*predict-yes
 -->
 (O2013 ^name predict-yes +)
 (S1 ^operator O2013 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1010 ^value 1 +)
 (R1 ^reward R1010 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2014 = 0.3145079413521559)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2014 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2013 = 0.3907782094907327)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2013 = 0.6091150129894595)
=>WM: (14150: S1 ^operator O2016 +)
=>WM: (14149: S1 ^operator O2015 +)
=>WM: (14148: I3 ^dir R)
=>WM: (14147: O2016 ^name predict-no)
=>WM: (14146: O2015 ^name predict-yes)
=>WM: (14145: R1011 ^value 1)
=>WM: (14144: R1 ^reward R1011)
=>WM: (14143: I3 ^see 1)
<=WM: (14134: S1 ^operator O2013 +)
<=WM: (14136: S1 ^operator O2013)
<=WM: (14135: S1 ^operator O2014 +)
<=WM: (14133: I3 ^dir L)
<=WM: (14129: R1 ^reward R1010)
<=WM: (14048: I3 ^see 0)
<=WM: (14132: O2014 ^name predict-no)
<=WM: (14131: O2013 ^name predict-yes)
<=WM: (14130: R1010 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2015 = 0.1215994207949702)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2015 = 0.8784140715701729)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2016 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2014 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2013 = 0.1215994207949702)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2013 = 0.8784140715701729)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472324 -0.0815458 0.390778 -> 0.472332 -0.0815445 0.390787(R,m,v=1,0.944099,0.0531056)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527585 0.0815301 0.609115 -> 0.527593 0.0815315 0.609125(R,m,v=1,1,0)
=>WM: (14151: S1 ^operator O2015)

  1008:    O: O2015 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1008 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1007 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14152: I3 ^predict-yes N1008)
<=WM: (14138: N1007 ^status complete)
<=WM: (14137: I3 ^predict-yes N1007)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14156: I2 ^dir L)
=>WM: (14155: I2 ^reward 1)
=>WM: (14154: I2 ^see 1)
=>WM: (14153: N1008 ^status complete)
<=WM: (14141: I2 ^dir R)
<=WM: (14140: I2 ^reward 1)
<=WM: (14139: I2 ^see 1)
=>WM: (14157: I2 ^level-1 R1-root)
<=WM: (14142: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2016 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2015 = 0.6093091841289463)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1012 ^value 1 +)
 (R1 ^reward R1012 +)
Firing propose*predict-yes
 -->
 (O2017 ^name predict-yes +)
 (S1 ^operator O2017 +)
Firing propose*predict-no
 -->
 (O2018 ^name predict-no +)
 (S1 ^operator O2018 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2016 = 0.3145079413521559)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2015 = 0.3907869885089824)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2016 ^name predict-no +)
 (S1 ^operator O2016 +)
Retracting propose*predict-yes
 -->
 (O2015 ^name predict-yes +)
 (S1 ^operator O2015 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1011 ^value 1 +)
 (R1 ^reward R1011 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2016 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2015 = 0.8784140715701729)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2015 = 0.1215994207949702)
=>WM: (14164: S1 ^operator O2018 +)
=>WM: (14163: S1 ^operator O2017 +)
=>WM: (14162: I3 ^dir L)
=>WM: (14161: O2018 ^name predict-no)
=>WM: (14160: O2017 ^name predict-yes)
=>WM: (14159: R1012 ^value 1)
=>WM: (14158: R1 ^reward R1012)
<=WM: (14149: S1 ^operator O2015 +)
<=WM: (14151: S1 ^operator O2015)
<=WM: (14150: S1 ^operator O2016 +)
<=WM: (14148: I3 ^dir R)
<=WM: (14144: R1 ^reward R1011)
<=WM: (14147: O2016 ^name predict-no)
<=WM: (14146: O2015 ^name predict-yes)
<=WM: (14145: R1011 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2017 = 0.3907869885089824)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2017 = 0.6093091841289463)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2018 = 0.3145079413521559)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2018 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2016 = 0.3145079413521559)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2016 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2015 = 0.3907869885089824)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2015 = 0.6093091841289463)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.865169,0.117311)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465486 0.412928 0.878414 -> 0.465485 0.412928 0.878413(R,m,v=1,1,0)
=>WM: (14165: S1 ^operator O2017)

  1009:    O: O2017 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1009 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1008 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14166: I3 ^predict-yes N1009)
<=WM: (14153: N1008 ^status complete)
<=WM: (14152: I3 ^predict-yes N1008)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14170: I2 ^dir L)
=>WM: (14169: I2 ^reward 1)
=>WM: (14168: I2 ^see 1)
=>WM: (14167: N1009 ^status complete)
<=WM: (14156: I2 ^dir L)
<=WM: (14155: I2 ^reward 1)
<=WM: (14154: I2 ^see 1)
=>WM: (14171: I2 ^level-1 L1-root)
<=WM: (14157: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2017 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2018 = 0.685530273786795)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1013 ^value 1 +)
 (R1 ^reward R1013 +)
Firing propose*predict-yes
 -->
 (O2019 ^name predict-yes +)
 (S1 ^operator O2019 +)
Firing propose*predict-no
 -->
 (O2020 ^name predict-no +)
 (S1 ^operator O2020 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2018 = 0.3145079413521559)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2017 = 0.3907869885089824)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2018 ^name predict-no +)
 (S1 ^operator O2018 +)
Retracting propose*predict-yes
 -->
 (O2017 ^name predict-yes +)
 (S1 ^operator O2017 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1012 ^value 1 +)
 (R1 ^reward R1012 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2018 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2018 = 0.3145079413521559)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2017 = 0.6093091841289463)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2017 = 0.3907869885089824)
=>WM: (14177: S1 ^operator O2020 +)
=>WM: (14176: S1 ^operator O2019 +)
=>WM: (14175: O2020 ^name predict-no)
=>WM: (14174: O2019 ^name predict-yes)
=>WM: (14173: R1013 ^value 1)
=>WM: (14172: R1 ^reward R1013)
<=WM: (14163: S1 ^operator O2017 +)
<=WM: (14165: S1 ^operator O2017)
<=WM: (14164: S1 ^operator O2018 +)
<=WM: (14158: R1 ^reward R1012)
<=WM: (14161: O2018 ^name predict-no)
<=WM: (14160: O2017 ^name predict-yes)
<=WM: (14159: R1012 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2019 = 0.3907869885089824)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2019 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2020 = 0.3145079413521559)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2020 = 0.685530273786795)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2018 = 0.3145079413521559)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2018 = 0.685530273786795)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2017 = 0.3907869885089824)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2017 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472332 -0.0815445 0.390787 -> 0.472325 -0.0815457 0.390779(R,m,v=1,0.944444,0.052795)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.52775 0.0815588 0.609309 -> 0.527743 0.0815574 0.6093(R,m,v=1,1,0)
=>WM: (14178: S1 ^operator O2020)

  1010:    O: O2020 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1010 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1009 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14179: I3 ^predict-no N1010)
<=WM: (14167: N1009 ^status complete)
<=WM: (14166: I3 ^predict-yes N1009)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14183: I2 ^dir R)
=>WM: (14182: I2 ^reward 1)
=>WM: (14181: I2 ^see 0)
=>WM: (14180: N1010 ^status complete)
<=WM: (14170: I2 ^dir L)
<=WM: (14169: I2 ^reward 1)
<=WM: (14168: I2 ^see 1)
=>WM: (14184: I2 ^level-1 L0-root)
<=WM: (14171: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2019 = 0.8783957298051434)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1014 ^value 1 +)
 (R1 ^reward R1014 +)
Firing propose*predict-yes
 -->
 (O2021 ^name predict-yes +)
 (S1 ^operator O2021 +)
Firing propose*predict-no
 -->
 (O2022 ^name predict-no +)
 (S1 ^operator O2022 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2020 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2019 = 0.121598329494617)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2020 ^name predict-no +)
 (S1 ^operator O2020 +)
Retracting propose*predict-yes
 -->
 (O2019 ^name predict-yes +)
 (S1 ^operator O2019 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1013 ^value 1 +)
 (R1 ^reward R1013 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2020 = 0.685530273786795)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2020 = 0.3145079413521559)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2019 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2019 = 0.3907790894440122)
=>WM: (14192: S1 ^operator O2022 +)
=>WM: (14191: S1 ^operator O2021 +)
=>WM: (14190: I3 ^dir R)
=>WM: (14189: O2022 ^name predict-no)
=>WM: (14188: O2021 ^name predict-yes)
=>WM: (14187: R1014 ^value 1)
=>WM: (14186: R1 ^reward R1014)
=>WM: (14185: I3 ^see 0)
<=WM: (14176: S1 ^operator O2019 +)
<=WM: (14177: S1 ^operator O2020 +)
<=WM: (14178: S1 ^operator O2020)
<=WM: (14162: I3 ^dir L)
<=WM: (14172: R1 ^reward R1013)
<=WM: (14143: I3 ^see 1)
<=WM: (14175: O2020 ^name predict-no)
<=WM: (14174: O2019 ^name predict-yes)
<=WM: (14173: R1013 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2021 = 0.121598329494617)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2021 = 0.8783957298051434)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2022 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2020 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2019 = 0.121598329494617)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2019 = 0.8783957298051434)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478553 -0.164048 0.314505(R,m,v=1,0.924051,0.0706281)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521479 0.164051 0.68553 -> 0.521476 0.164051 0.685527(R,m,v=1,1,0)
=>WM: (14193: S1 ^operator O2021)

  1011:    O: O2021 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1011 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1010 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14194: I3 ^predict-yes N1011)
<=WM: (14180: N1010 ^status complete)
<=WM: (14179: I3 ^predict-no N1010)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (14198: I2 ^dir U)
=>WM: (14197: I2 ^reward 1)
=>WM: (14196: I2 ^see 1)
=>WM: (14195: N1011 ^status complete)
<=WM: (14183: I2 ^dir R)
<=WM: (14182: I2 ^reward 1)
<=WM: (14181: I2 ^see 0)
=>WM: (14199: I2 ^level-1 R1-root)
<=WM: (14184: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1015 ^value 1 +)
 (R1 ^reward R1015 +)
Firing propose*predict-yes
 -->
 (O2023 ^name predict-yes +)
 (S1 ^operator O2023 +)
Firing propose*predict-no
 -->
 (O2024 ^name predict-no +)
 (S1 ^operator O2024 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2022 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2021 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2022 ^name predict-no +)
 (S1 ^operator O2022 +)
Retracting propose*predict-yes
 -->
 (O2021 ^name predict-yes +)
 (S1 ^operator O2021 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1014 ^value 1 +)
 (R1 ^reward R1014 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2022 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2021 = 0.8783957298051434)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2021 = 0.121598329494617)
=>WM: (14207: S1 ^operator O2024 +)
=>WM: (14206: S1 ^operator O2023 +)
=>WM: (14205: I3 ^dir U)
=>WM: (14204: O2024 ^name predict-no)
=>WM: (14203: O2023 ^name predict-yes)
=>WM: (14202: R1015 ^value 1)
=>WM: (14201: R1 ^reward R1015)
=>WM: (14200: I3 ^see 1)
<=WM: (14191: S1 ^operator O2021 +)
<=WM: (14193: S1 ^operator O2021)
<=WM: (14192: S1 ^operator O2022 +)
<=WM: (14190: I3 ^dir R)
<=WM: (14186: R1 ^reward R1014)
<=WM: (14185: I3 ^see 0)
<=WM: (14189: O2022 ^name predict-no)
<=WM: (14188: O2021 ^name predict-yes)
<=WM: (14187: R1014 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2023 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2024 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2022 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2021 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.865922,0.116753)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465471 0.412925 0.878396 -> 0.465471 0.412925 0.878396(R,m,v=1,1,0)
=>WM: (14208: S1 ^operator O2024)

  1012:    O: O2024 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1012 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1011 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14209: I3 ^predict-no N1012)
<=WM: (14195: N1011 ^status complete)
<=WM: (14194: I3 ^predict-yes N1011)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14213: I2 ^dir U)
=>WM: (14212: I2 ^reward 1)
=>WM: (14211: I2 ^see 0)
=>WM: (14210: N1012 ^status complete)
<=WM: (14198: I2 ^dir U)
<=WM: (14197: I2 ^reward 1)
<=WM: (14196: I2 ^see 1)
=>WM: (14214: I2 ^level-1 R1-root)
<=WM: (14199: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1016 ^value 1 +)
 (R1 ^reward R1016 +)
Firing propose*predict-yes
 -->
 (O2025 ^name predict-yes +)
 (S1 ^operator O2025 +)
Firing propose*predict-no
 -->
 (O2026 ^name predict-no +)
 (S1 ^operator O2026 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2024 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2023 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2024 ^name predict-no +)
 (S1 ^operator O2024 +)
Retracting propose*predict-yes
 -->
 (O2023 ^name predict-yes +)
 (S1 ^operator O2023 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1015 ^value 1 +)
 (R1 ^reward R1015 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2024 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2023 = 0.)
=>WM: (14221: S1 ^operator O2026 +)
=>WM: (14220: S1 ^operator O2025 +)
=>WM: (14219: O2026 ^name predict-no)
=>WM: (14218: O2025 ^name predict-yes)
=>WM: (14217: R1016 ^value 1)
=>WM: (14216: R1 ^reward R1016)
=>WM: (14215: I3 ^see 0)
<=WM: (14206: S1 ^operator O2023 +)
<=WM: (14207: S1 ^operator O2024 +)
<=WM: (14208: S1 ^operator O2024)
<=WM: (14201: R1 ^reward R1015)
<=WM: (14200: I3 ^see 1)
<=WM: (14204: O2024 ^name predict-no)
<=WM: (14203: O2023 ^name predict-yes)
<=WM: (14202: R1015 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2024 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2023 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14222: S1 ^operator O2026)

  1013:    O: O2026 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1013 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1012 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14223: I3 ^predict-no N1013)
<=WM: (14210: N1012 ^status complete)
<=WM: (14209: I3 ^predict-no N1012)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14227: I2 ^dir U)
=>WM: (14226: I2 ^reward 1)
=>WM: (14225: I2 ^see 0)
=>WM: (14224: N1013 ^status complete)
<=WM: (14213: I2 ^dir U)
<=WM: (14212: I2 ^reward 1)
<=WM: (14211: I2 ^see 0)
=>WM: (14228: I2 ^level-1 R1-root)
<=WM: (14214: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1017 ^value 1 +)
 (R1 ^reward R1017 +)
Firing propose*predict-yes
 -->
 (O2027 ^name predict-yes +)
 (S1 ^operator O2027 +)
Firing propose*predict-no
 -->
 (O2028 ^name predict-no +)
 (S1 ^operator O2028 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2026 ^name predict-no +)
 (S1 ^operator O2026 +)
Retracting propose*predict-yes
 -->
 (O2025 ^name predict-yes +)
 (S1 ^operator O2025 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1016 ^value 1 +)
 (R1 ^reward R1016 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.)
=>WM: (14234: S1 ^operator O2028 +)
=>WM: (14233: S1 ^operator O2027 +)
=>WM: (14232: O2028 ^name predict-no)
=>WM: (14231: O2027 ^name predict-yes)
=>WM: (14230: R1017 ^value 1)
=>WM: (14229: R1 ^reward R1017)
<=WM: (14220: S1 ^operator O2025 +)
<=WM: (14221: S1 ^operator O2026 +)
<=WM: (14222: S1 ^operator O2026)
<=WM: (14216: R1 ^reward R1016)
<=WM: (14219: O2026 ^name predict-no)
<=WM: (14218: O2025 ^name predict-yes)
<=WM: (14217: R1016 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2027 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2028 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2026 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2025 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14235: S1 ^operator O2028)

  1014:    O: O2028 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1014 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1013 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14236: I3 ^predict-no N1014)
<=WM: (14224: N1013 ^status complete)
<=WM: (14223: I3 ^predict-no N1013)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (14240: I2 ^dir L)
=>WM: (14239: I2 ^reward 1)
=>WM: (14238: I2 ^see 0)
=>WM: (14237: N1014 ^status complete)
<=WM: (14227: I2 ^dir U)
<=WM: (14226: I2 ^reward 1)
<=WM: (14225: I2 ^see 0)
=>WM: (14241: I2 ^level-1 R1-root)
<=WM: (14228: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2028 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2027 = 0.6093000948769637)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1018 ^value 1 +)
 (R1 ^reward R1018 +)
Firing propose*predict-yes
 -->
 (O2029 ^name predict-yes +)
 (S1 ^operator O2029 +)
Firing propose*predict-no
 -->
 (O2030 ^name predict-no +)
 (S1 ^operator O2030 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2028 = 0.3145047896375236)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2027 = 0.3907790894440122)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2028 ^name predict-no +)
 (S1 ^operator O2028 +)
Retracting propose*predict-yes
 -->
 (O2027 ^name predict-yes +)
 (S1 ^operator O2027 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1017 ^value 1 +)
 (R1 ^reward R1017 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2028 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2027 = 0.)
=>WM: (14248: S1 ^operator O2030 +)
=>WM: (14247: S1 ^operator O2029 +)
=>WM: (14246: I3 ^dir L)
=>WM: (14245: O2030 ^name predict-no)
=>WM: (14244: O2029 ^name predict-yes)
=>WM: (14243: R1018 ^value 1)
=>WM: (14242: R1 ^reward R1018)
<=WM: (14233: S1 ^operator O2027 +)
<=WM: (14234: S1 ^operator O2028 +)
<=WM: (14235: S1 ^operator O2028)
<=WM: (14205: I3 ^dir U)
<=WM: (14229: R1 ^reward R1017)
<=WM: (14232: O2028 ^name predict-no)
<=WM: (14231: O2027 ^name predict-yes)
<=WM: (14230: R1017 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2029 = 0.6093000948769637)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2029 = 0.3907790894440122)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2030 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2030 = 0.3145047896375236)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2028 = 0.3145047896375236)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2028 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2027 = 0.3907790894440122)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2027 = 0.6093000948769637)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14249: S1 ^operator O2029)

  1015:    O: O2029 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1015 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1014 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14250: I3 ^predict-yes N1015)
<=WM: (14237: N1014 ^status complete)
<=WM: (14236: I3 ^predict-no N1014)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14254: I2 ^dir U)
=>WM: (14253: I2 ^reward 1)
=>WM: (14252: I2 ^see 1)
=>WM: (14251: N1015 ^status complete)
<=WM: (14240: I2 ^dir L)
<=WM: (14239: I2 ^reward 1)
<=WM: (14238: I2 ^see 0)
=>WM: (14255: I2 ^level-1 L1-root)
<=WM: (14241: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1019 ^value 1 +)
 (R1 ^reward R1019 +)
Firing propose*predict-yes
 -->
 (O2031 ^name predict-yes +)
 (S1 ^operator O2031 +)
Firing propose*predict-no
 -->
 (O2032 ^name predict-no +)
 (S1 ^operator O2032 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2030 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2029 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2030 ^name predict-no +)
 (S1 ^operator O2030 +)
Retracting propose*predict-yes
 -->
 (O2029 ^name predict-yes +)
 (S1 ^operator O2029 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1018 ^value 1 +)
 (R1 ^reward R1018 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2030 = 0.3145047896375236)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2030 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2029 = 0.3907790894440122)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2029 = 0.6093000948769637)
=>WM: (14263: S1 ^operator O2032 +)
=>WM: (14262: S1 ^operator O2031 +)
=>WM: (14261: I3 ^dir U)
=>WM: (14260: O2032 ^name predict-no)
=>WM: (14259: O2031 ^name predict-yes)
=>WM: (14258: R1019 ^value 1)
=>WM: (14257: R1 ^reward R1019)
=>WM: (14256: I3 ^see 1)
<=WM: (14247: S1 ^operator O2029 +)
<=WM: (14249: S1 ^operator O2029)
<=WM: (14248: S1 ^operator O2030 +)
<=WM: (14246: I3 ^dir L)
<=WM: (14242: R1 ^reward R1018)
<=WM: (14215: I3 ^see 0)
<=WM: (14245: O2030 ^name predict-no)
<=WM: (14244: O2029 ^name predict-yes)
<=WM: (14243: R1018 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2031 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2032 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2030 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2029 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472325 -0.0815457 0.390779 -> 0.472319 -0.0815467 0.390773(R,m,v=1,0.944785,0.0524881)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527743 0.0815574 0.6093 -> 0.527736 0.0815563 0.609293(R,m,v=1,1,0)
=>WM: (14264: S1 ^operator O2032)

  1016:    O: O2032 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1016 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1015 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14265: I3 ^predict-no N1016)
<=WM: (14251: N1015 ^status complete)
<=WM: (14250: I3 ^predict-yes N1015)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14269: I2 ^dir U)
=>WM: (14268: I2 ^reward 1)
=>WM: (14267: I2 ^see 0)
=>WM: (14266: N1016 ^status complete)
<=WM: (14254: I2 ^dir U)
<=WM: (14253: I2 ^reward 1)
<=WM: (14252: I2 ^see 1)
=>WM: (14270: I2 ^level-1 L1-root)
<=WM: (14255: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1020 ^value 1 +)
 (R1 ^reward R1020 +)
Firing propose*predict-yes
 -->
 (O2033 ^name predict-yes +)
 (S1 ^operator O2033 +)
Firing propose*predict-no
 -->
 (O2034 ^name predict-no +)
 (S1 ^operator O2034 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2032 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2031 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2032 ^name predict-no +)
 (S1 ^operator O2032 +)
Retracting propose*predict-yes
 -->
 (O2031 ^name predict-yes +)
 (S1 ^operator O2031 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1019 ^value 1 +)
 (R1 ^reward R1019 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2032 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2031 = 0.)
=>WM: (14277: S1 ^operator O2034 +)
=>WM: (14276: S1 ^operator O2033 +)
=>WM: (14275: O2034 ^name predict-no)
=>WM: (14274: O2033 ^name predict-yes)
=>WM: (14273: R1020 ^value 1)
=>WM: (14272: R1 ^reward R1020)
=>WM: (14271: I3 ^see 0)
<=WM: (14262: S1 ^operator O2031 +)
<=WM: (14263: S1 ^operator O2032 +)
<=WM: (14264: S1 ^operator O2032)
<=WM: (14257: R1 ^reward R1019)
<=WM: (14256: I3 ^see 1)
<=WM: (14260: O2032 ^name predict-no)
<=WM: (14259: O2031 ^name predict-yes)
<=WM: (14258: R1019 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2033 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2034 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2032 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2031 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14278: S1 ^operator O2034)

  1017:    O: O2034 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1017 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1016 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14279: I3 ^predict-no N1017)
<=WM: (14266: N1016 ^status complete)
<=WM: (14265: I3 ^predict-no N1016)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14283: I2 ^dir U)
=>WM: (14282: I2 ^reward 1)
=>WM: (14281: I2 ^see 0)
=>WM: (14280: N1017 ^status complete)
<=WM: (14269: I2 ^dir U)
<=WM: (14268: I2 ^reward 1)
<=WM: (14267: I2 ^see 0)
=>WM: (14284: I2 ^level-1 L1-root)
<=WM: (14270: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1021 ^value 1 +)
 (R1 ^reward R1021 +)
Firing propose*predict-yes
 -->
 (O2035 ^name predict-yes +)
 (S1 ^operator O2035 +)
Firing propose*predict-no
 -->
 (O2036 ^name predict-no +)
 (S1 ^operator O2036 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2034 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2033 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2034 ^name predict-no +)
 (S1 ^operator O2034 +)
Retracting propose*predict-yes
 -->
 (O2033 ^name predict-yes +)
 (S1 ^operator O2033 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1020 ^value 1 +)
 (R1 ^reward R1020 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2034 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2033 = 0.)
=>WM: (14290: S1 ^operator O2036 +)
=>WM: (14289: S1 ^operator O2035 +)
=>WM: (14288: O2036 ^name predict-no)
=>WM: (14287: O2035 ^name predict-yes)
=>WM: (14286: R1021 ^value 1)
=>WM: (14285: R1 ^reward R1021)
<=WM: (14276: S1 ^operator O2033 +)
<=WM: (14277: S1 ^operator O2034 +)
<=WM: (14278: S1 ^operator O2034)
<=WM: (14272: R1 ^reward R1020)
<=WM: (14275: O2034 ^name predict-no)
<=WM: (14274: O2033 ^name predict-yes)
<=WM: (14273: R1020 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2035 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2036 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2034 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2033 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14291: S1 ^operator O2036)

  1018:    O: O2036 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1018 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1017 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14292: I3 ^predict-no N1018)
<=WM: (14280: N1017 ^status complete)
<=WM: (14279: I3 ^predict-no N1017)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14296: I2 ^dir R)
=>WM: (14295: I2 ^reward 1)
=>WM: (14294: I2 ^see 0)
=>WM: (14293: N1018 ^status complete)
<=WM: (14283: I2 ^dir U)
<=WM: (14282: I2 ^reward 1)
<=WM: (14281: I2 ^see 0)
=>WM: (14297: I2 ^level-1 L1-root)
<=WM: (14284: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2035 = 0.8784128060439984)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1022 ^value 1 +)
 (R1 ^reward R1022 +)
Firing propose*predict-yes
 -->
 (O2037 ^name predict-yes +)
 (S1 ^operator O2037 +)
Firing propose*predict-no
 -->
 (O2038 ^name predict-no +)
 (S1 ^operator O2038 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2036 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2035 = 0.1215988095600619)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2036 ^name predict-no +)
 (S1 ^operator O2036 +)
Retracting propose*predict-yes
 -->
 (O2035 ^name predict-yes +)
 (S1 ^operator O2035 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1021 ^value 1 +)
 (R1 ^reward R1021 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2036 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2035 = 0.)
=>WM: (14304: S1 ^operator O2038 +)
=>WM: (14303: S1 ^operator O2037 +)
=>WM: (14302: I3 ^dir R)
=>WM: (14301: O2038 ^name predict-no)
=>WM: (14300: O2037 ^name predict-yes)
=>WM: (14299: R1022 ^value 1)
=>WM: (14298: R1 ^reward R1022)
<=WM: (14289: S1 ^operator O2035 +)
<=WM: (14290: S1 ^operator O2036 +)
<=WM: (14291: S1 ^operator O2036)
<=WM: (14261: I3 ^dir U)
<=WM: (14285: R1 ^reward R1021)
<=WM: (14288: O2036 ^name predict-no)
<=WM: (14287: O2035 ^name predict-yes)
<=WM: (14286: R1021 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2037 = 0.8784128060439984)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2037 = 0.1215988095600619)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2038 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2036 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2035 = 0.1215988095600619)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2035 = 0.8784128060439984)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14305: S1 ^operator O2037)

  1019:    O: O2037 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1019 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1018 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14306: I3 ^predict-yes N1019)
<=WM: (14293: N1018 ^status complete)
<=WM: (14292: I3 ^predict-no N1018)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14310: I2 ^dir L)
=>WM: (14309: I2 ^reward 1)
=>WM: (14308: I2 ^see 1)
=>WM: (14307: N1019 ^status complete)
<=WM: (14296: I2 ^dir R)
<=WM: (14295: I2 ^reward 1)
<=WM: (14294: I2 ^see 0)
=>WM: (14311: I2 ^level-1 R1-root)
<=WM: (14297: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2038 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2037 = 0.6092926303832609)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1023 ^value 1 +)
 (R1 ^reward R1023 +)
Firing propose*predict-yes
 -->
 (O2039 ^name predict-yes +)
 (S1 ^operator O2039 +)
Firing propose*predict-no
 -->
 (O2040 ^name predict-no +)
 (S1 ^operator O2040 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2038 = 0.3145047896375236)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2037 = 0.3907725922691719)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2038 ^name predict-no +)
 (S1 ^operator O2038 +)
Retracting propose*predict-yes
 -->
 (O2037 ^name predict-yes +)
 (S1 ^operator O2037 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1022 ^value 1 +)
 (R1 ^reward R1022 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2038 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2037 = 0.1215988095600619)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2037 = 0.8784128060439984)
=>WM: (14319: S1 ^operator O2040 +)
=>WM: (14318: S1 ^operator O2039 +)
=>WM: (14317: I3 ^dir L)
=>WM: (14316: O2040 ^name predict-no)
=>WM: (14315: O2039 ^name predict-yes)
=>WM: (14314: R1023 ^value 1)
=>WM: (14313: R1 ^reward R1023)
=>WM: (14312: I3 ^see 1)
<=WM: (14303: S1 ^operator O2037 +)
<=WM: (14305: S1 ^operator O2037)
<=WM: (14304: S1 ^operator O2038 +)
<=WM: (14302: I3 ^dir R)
<=WM: (14298: R1 ^reward R1022)
<=WM: (14271: I3 ^see 0)
<=WM: (14301: O2038 ^name predict-no)
<=WM: (14300: O2037 ^name predict-yes)
<=WM: (14299: R1022 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.3907725922691719)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2039 = 0.6092926303832609)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 0.3145047896375236)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2040 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2038 = 0.3145047896375236)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2038 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2037 = 0.3907725922691719)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2037 = 0.6092926303832609)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.866667,0.116201)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465485 0.412928 0.878413 -> 0.465484 0.412928 0.878412(R,m,v=1,1,0)
=>WM: (14320: S1 ^operator O2039)

  1020:    O: O2039 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1020 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1019 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14321: I3 ^predict-yes N1020)
<=WM: (14307: N1019 ^status complete)
<=WM: (14306: I3 ^predict-yes N1019)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\---- Input Phase --- 
=>WM: (14325: I2 ^dir L)
=>WM: (14324: I2 ^reward 1)
=>WM: (14323: I2 ^see 1)
=>WM: (14322: N1020 ^status complete)
<=WM: (14310: I2 ^dir L)
<=WM: (14309: I2 ^reward 1)
<=WM: (14308: I2 ^see 1)
=>WM: (14326: I2 ^level-1 L1-root)
<=WM: (14311: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2039 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2040 = 0.6855266893701198)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1024 ^value 1 +)
 (R1 ^reward R1024 +)
Firing propose*predict-yes
 -->
 (O2041 ^name predict-yes +)
 (S1 ^operator O2041 +)
Firing propose*predict-no
 -->
 (O2042 ^name predict-no +)
 (S1 ^operator O2042 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 0.3145047896375236)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.3907725922691719)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2040 ^name predict-no +)
 (S1 ^operator O2040 +)
Retracting propose*predict-yes
 -->
 (O2039 ^name predict-yes +)
 (S1 ^operator O2039 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1023 ^value 1 +)
 (R1 ^reward R1023 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2040 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 0.3145047896375236)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2039 = 0.6092926303832609)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.3907725922691719)
=>WM: (14332: S1 ^operator O2042 +)
=>WM: (14331: S1 ^operator O2041 +)
=>WM: (14330: O2042 ^name predict-no)
=>WM: (14329: O2041 ^name predict-yes)
=>WM: (14328: R1024 ^value 1)
=>WM: (14327: R1 ^reward R1024)
<=WM: (14318: S1 ^operator O2039 +)
<=WM: (14320: S1 ^operator O2039)
<=WM: (14319: S1 ^operator O2040 +)
<=WM: (14313: R1 ^reward R1023)
<=WM: (14316: O2040 ^name predict-no)
<=WM: (14315: O2039 ^name predict-yes)
<=WM: (14314: R1023 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2041 = 0.3907725922691719)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2041 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2042 = 0.3145047896375236)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2042 = 0.6855266893701198)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2040 = 0.3145047896375236)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2040 = 0.6855266893701198)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2039 = 0.3907725922691719)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2039 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472319 -0.0815467 0.390773 -> 0.472315 -0.0815475 0.390767(R,m,v=1,0.945122,0.0521846)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527736 0.0815563 0.609293 -> 0.527731 0.0815554 0.609286(R,m,v=1,1,0)
=>WM: (14333: S1 ^operator O2042)

  1021:    O: O2042 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1021 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1020 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14334: I3 ^predict-no N1021)
<=WM: (14322: N1020 ^status complete)
<=WM: (14321: I3 ^predict-yes N1020)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (14338: I2 ^dir R)
=>WM: (14337: I2 ^reward 1)
=>WM: (14336: I2 ^see 0)
=>WM: (14335: N1021 ^status complete)
<=WM: (14325: I2 ^dir L)
<=WM: (14324: I2 ^reward 1)
<=WM: (14323: I2 ^see 1)
=>WM: (14339: I2 ^level-1 L0-root)
<=WM: (14326: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2041 = 0.8783962927268922)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1025 ^value 1 +)
 (R1 ^reward R1025 +)
Firing propose*predict-yes
 -->
 (O2043 ^name predict-yes +)
 (S1 ^operator O2043 +)
Firing propose*predict-no
 -->
 (O2044 ^name predict-no +)
 (S1 ^operator O2044 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2042 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2041 = 0.1215978717524572)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2042 ^name predict-no +)
 (S1 ^operator O2042 +)
Retracting propose*predict-yes
 -->
 (O2041 ^name predict-yes +)
 (S1 ^operator O2041 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1024 ^value 1 +)
 (R1 ^reward R1024 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2042 = 0.6855266893701198)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2042 = 0.3145047896375236)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2041 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2041 = 0.3907672460330531)
=>WM: (14347: S1 ^operator O2044 +)
=>WM: (14346: S1 ^operator O2043 +)
=>WM: (14345: I3 ^dir R)
=>WM: (14344: O2044 ^name predict-no)
=>WM: (14343: O2043 ^name predict-yes)
=>WM: (14342: R1025 ^value 1)
=>WM: (14341: R1 ^reward R1025)
=>WM: (14340: I3 ^see 0)
<=WM: (14331: S1 ^operator O2041 +)
<=WM: (14332: S1 ^operator O2042 +)
<=WM: (14333: S1 ^operator O2042)
<=WM: (14317: I3 ^dir L)
<=WM: (14327: R1 ^reward R1024)
<=WM: (14312: I3 ^see 1)
<=WM: (14330: O2042 ^name predict-no)
<=WM: (14329: O2041 ^name predict-yes)
<=WM: (14328: R1024 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2043 = 0.1215978717524572)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2043 = 0.8783962927268922)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2044 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2042 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2041 = 0.1215978717524572)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2041 = 0.8783962927268922)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478553 -0.164048 0.314505 -> 0.478551 -0.164048 0.314502(R,m,v=1,0.924528,0.0702173)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521476 0.164051 0.685527 -> 0.521473 0.164051 0.685524(R,m,v=1,1,0)
=>WM: (14348: S1 ^operator O2043)

  1022:    O: O2043 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1022 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1021 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14349: I3 ^predict-yes N1022)
<=WM: (14335: N1021 ^status complete)
<=WM: (14334: I3 ^predict-no N1021)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14353: I2 ^dir U)
=>WM: (14352: I2 ^reward 1)
=>WM: (14351: I2 ^see 1)
=>WM: (14350: N1022 ^status complete)
<=WM: (14338: I2 ^dir R)
<=WM: (14337: I2 ^reward 1)
<=WM: (14336: I2 ^see 0)
=>WM: (14354: I2 ^level-1 R1-root)
<=WM: (14339: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1026 ^value 1 +)
 (R1 ^reward R1026 +)
Firing propose*predict-yes
 -->
 (O2045 ^name predict-yes +)
 (S1 ^operator O2045 +)
Firing propose*predict-no
 -->
 (O2046 ^name predict-no +)
 (S1 ^operator O2046 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2044 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2043 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2044 ^name predict-no +)
 (S1 ^operator O2044 +)
Retracting propose*predict-yes
 -->
 (O2043 ^name predict-yes +)
 (S1 ^operator O2043 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1025 ^value 1 +)
 (R1 ^reward R1025 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2044 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2043 = 0.8783962927268922)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2043 = 0.1215978717524572)
=>WM: (14362: S1 ^operator O2046 +)
=>WM: (14361: S1 ^operator O2045 +)
=>WM: (14360: I3 ^dir U)
=>WM: (14359: O2046 ^name predict-no)
=>WM: (14358: O2045 ^name predict-yes)
=>WM: (14357: R1026 ^value 1)
=>WM: (14356: R1 ^reward R1026)
=>WM: (14355: I3 ^see 1)
<=WM: (14346: S1 ^operator O2043 +)
<=WM: (14348: S1 ^operator O2043)
<=WM: (14347: S1 ^operator O2044 +)
<=WM: (14345: I3 ^dir R)
<=WM: (14341: R1 ^reward R1025)
<=WM: (14340: I3 ^see 0)
<=WM: (14344: O2044 ^name predict-no)
<=WM: (14343: O2043 ^name predict-yes)
<=WM: (14342: R1025 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2045 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2046 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2044 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2043 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.867403,0.115654)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465471 0.412925 0.878396 -> 0.465472 0.412925 0.878397(R,m,v=1,1,0)
=>WM: (14363: S1 ^operator O2046)

  1023:    O: O2046 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1023 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1022 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14364: I3 ^predict-no N1023)
<=WM: (14350: N1022 ^status complete)
<=WM: (14349: I3 ^predict-yes N1022)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (14368: I2 ^dir L)
=>WM: (14367: I2 ^reward 1)
=>WM: (14366: I2 ^see 0)
=>WM: (14365: N1023 ^status complete)
<=WM: (14353: I2 ^dir U)
<=WM: (14352: I2 ^reward 1)
<=WM: (14351: I2 ^see 1)
=>WM: (14369: I2 ^level-1 R1-root)
<=WM: (14354: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2046 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2045 = 0.6092864975390457)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1027 ^value 1 +)
 (R1 ^reward R1027 +)
Firing propose*predict-yes
 -->
 (O2047 ^name predict-yes +)
 (S1 ^operator O2047 +)
Firing propose*predict-no
 -->
 (O2048 ^name predict-no +)
 (S1 ^operator O2048 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2046 = 0.314502196170351)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2045 = 0.3907672460330531)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2046 ^name predict-no +)
 (S1 ^operator O2046 +)
Retracting propose*predict-yes
 -->
 (O2045 ^name predict-yes +)
 (S1 ^operator O2045 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1026 ^value 1 +)
 (R1 ^reward R1026 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2046 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2045 = 0.)
=>WM: (14377: S1 ^operator O2048 +)
=>WM: (14376: S1 ^operator O2047 +)
=>WM: (14375: I3 ^dir L)
=>WM: (14374: O2048 ^name predict-no)
=>WM: (14373: O2047 ^name predict-yes)
=>WM: (14372: R1027 ^value 1)
=>WM: (14371: R1 ^reward R1027)
=>WM: (14370: I3 ^see 0)
<=WM: (14361: S1 ^operator O2045 +)
<=WM: (14362: S1 ^operator O2046 +)
<=WM: (14363: S1 ^operator O2046)
<=WM: (14360: I3 ^dir U)
<=WM: (14356: R1 ^reward R1026)
<=WM: (14355: I3 ^see 1)
<=WM: (14359: O2046 ^name predict-no)
<=WM: (14358: O2045 ^name predict-yes)
<=WM: (14357: R1026 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2047 = 0.6092864975390457)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2047 = 0.3907672460330531)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2048 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2048 = 0.314502196170351)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2046 = 0.314502196170351)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2046 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2045 = 0.3907672460330531)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2045 = 0.6092864975390457)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14378: S1 ^operator O2047)

  1024:    O: O2047 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1024 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1023 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14379: I3 ^predict-yes N1024)
<=WM: (14365: N1023 ^status complete)
<=WM: (14364: I3 ^predict-no N1023)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14383: I2 ^dir U)
=>WM: (14382: I2 ^reward 1)
=>WM: (14381: I2 ^see 1)
=>WM: (14380: N1024 ^status complete)
<=WM: (14368: I2 ^dir L)
<=WM: (14367: I2 ^reward 1)
<=WM: (14366: I2 ^see 0)
=>WM: (14384: I2 ^level-1 L1-root)
<=WM: (14369: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1028 ^value 1 +)
 (R1 ^reward R1028 +)
Firing propose*predict-yes
 -->
 (O2049 ^name predict-yes +)
 (S1 ^operator O2049 +)
Firing propose*predict-no
 -->
 (O2050 ^name predict-no +)
 (S1 ^operator O2050 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2048 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2047 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2048 ^name predict-no +)
 (S1 ^operator O2048 +)
Retracting propose*predict-yes
 -->
 (O2047 ^name predict-yes +)
 (S1 ^operator O2047 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1027 ^value 1 +)
 (R1 ^reward R1027 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2048 = 0.314502196170351)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2048 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2047 = 0.3907672460330531)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2047 = 0.6092864975390457)
=>WM: (14392: S1 ^operator O2050 +)
=>WM: (14391: S1 ^operator O2049 +)
=>WM: (14390: I3 ^dir U)
=>WM: (14389: O2050 ^name predict-no)
=>WM: (14388: O2049 ^name predict-yes)
=>WM: (14387: R1028 ^value 1)
=>WM: (14386: R1 ^reward R1028)
=>WM: (14385: I3 ^see 1)
<=WM: (14376: S1 ^operator O2047 +)
<=WM: (14378: S1 ^operator O2047)
<=WM: (14377: S1 ^operator O2048 +)
<=WM: (14375: I3 ^dir L)
<=WM: (14371: R1 ^reward R1027)
<=WM: (14370: I3 ^see 0)
<=WM: (14374: O2048 ^name predict-no)
<=WM: (14373: O2047 ^name predict-yes)
<=WM: (14372: R1027 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2049 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2050 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2048 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2047 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472315 -0.0815475 0.390767 -> 0.472311 -0.0815481 0.390763(R,m,v=1,0.945455,0.0518847)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527731 0.0815554 0.609286 -> 0.527727 0.0815547 0.609281(R,m,v=1,1,0)
=>WM: (14393: S1 ^operator O2050)

  1025:    O: O2050 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1025 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1024 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14394: I3 ^predict-no N1025)
<=WM: (14380: N1024 ^status complete)
<=WM: (14379: I3 ^predict-yes N1024)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14398: I2 ^dir U)
=>WM: (14397: I2 ^reward 1)
=>WM: (14396: I2 ^see 0)
=>WM: (14395: N1025 ^status complete)
<=WM: (14383: I2 ^dir U)
<=WM: (14382: I2 ^reward 1)
<=WM: (14381: I2 ^see 1)
=>WM: (14399: I2 ^level-1 L1-root)
<=WM: (14384: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1029 ^value 1 +)
 (R1 ^reward R1029 +)
Firing propose*predict-yes
 -->
 (O2051 ^name predict-yes +)
 (S1 ^operator O2051 +)
Firing propose*predict-no
 -->
 (O2052 ^name predict-no +)
 (S1 ^operator O2052 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2050 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2049 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2050 ^name predict-no +)
 (S1 ^operator O2050 +)
Retracting propose*predict-yes
 -->
 (O2049 ^name predict-yes +)
 (S1 ^operator O2049 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1028 ^value 1 +)
 (R1 ^reward R1028 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2050 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2049 = 0.)
=>WM: (14406: S1 ^operator O2052 +)
=>WM: (14405: S1 ^operator O2051 +)
=>WM: (14404: O2052 ^name predict-no)
=>WM: (14403: O2051 ^name predict-yes)
=>WM: (14402: R1029 ^value 1)
=>WM: (14401: R1 ^reward R1029)
=>WM: (14400: I3 ^see 0)
<=WM: (14391: S1 ^operator O2049 +)
<=WM: (14392: S1 ^operator O2050 +)
<=WM: (14393: S1 ^operator O2050)
<=WM: (14386: R1 ^reward R1028)
<=WM: (14385: I3 ^see 1)
<=WM: (14389: O2050 ^name predict-no)
<=WM: (14388: O2049 ^name predict-yes)
<=WM: (14387: R1028 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2051 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2052 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2050 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2049 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14407: S1 ^operator O2052)

  1026:    O: O2052 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1026 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1025 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14408: I3 ^predict-no N1026)
<=WM: (14395: N1025 ^status complete)
<=WM: (14394: I3 ^predict-no N1025)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14412: I2 ^dir R)
=>WM: (14411: I2 ^reward 1)
=>WM: (14410: I2 ^see 0)
=>WM: (14409: N1026 ^status complete)
<=WM: (14398: I2 ^dir U)
<=WM: (14397: I2 ^reward 1)
<=WM: (14396: I2 ^see 0)
=>WM: (14413: I2 ^level-1 L1-root)
<=WM: (14399: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2051 = 0.8784117192151244)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1030 ^value 1 +)
 (R1 ^reward R1030 +)
Firing propose*predict-yes
 -->
 (O2053 ^name predict-yes +)
 (S1 ^operator O2053 +)
Firing propose*predict-no
 -->
 (O2054 ^name predict-no +)
 (S1 ^operator O2054 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2052 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2051 = 0.1215983424730706)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2052 ^name predict-no +)
 (S1 ^operator O2052 +)
Retracting propose*predict-yes
 -->
 (O2051 ^name predict-yes +)
 (S1 ^operator O2051 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1029 ^value 1 +)
 (R1 ^reward R1029 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2052 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2051 = 0.)
=>WM: (14420: S1 ^operator O2054 +)
=>WM: (14419: S1 ^operator O2053 +)
=>WM: (14418: I3 ^dir R)
=>WM: (14417: O2054 ^name predict-no)
=>WM: (14416: O2053 ^name predict-yes)
=>WM: (14415: R1030 ^value 1)
=>WM: (14414: R1 ^reward R1030)
<=WM: (14405: S1 ^operator O2051 +)
<=WM: (14406: S1 ^operator O2052 +)
<=WM: (14407: S1 ^operator O2052)
<=WM: (14390: I3 ^dir U)
<=WM: (14401: R1 ^reward R1029)
<=WM: (14404: O2052 ^name predict-no)
<=WM: (14403: O2051 ^name predict-yes)
<=WM: (14402: R1029 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2053 = 0.8784117192151244)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2053 = 0.1215983424730706)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2054 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2052 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2051 = 0.1215983424730706)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2051 = 0.8784117192151244)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14421: S1 ^operator O2053)

  1027:    O: O2053 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1027 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1026 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14422: I3 ^predict-yes N1027)
<=WM: (14409: N1026 ^status complete)
<=WM: (14408: I3 ^predict-no N1026)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14426: I2 ^dir U)
=>WM: (14425: I2 ^reward 1)
=>WM: (14424: I2 ^see 1)
=>WM: (14423: N1027 ^status complete)
<=WM: (14412: I2 ^dir R)
<=WM: (14411: I2 ^reward 1)
<=WM: (14410: I2 ^see 0)
=>WM: (14427: I2 ^level-1 R1-root)
<=WM: (14413: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1031 ^value 1 +)
 (R1 ^reward R1031 +)
Firing propose*predict-yes
 -->
 (O2055 ^name predict-yes +)
 (S1 ^operator O2055 +)
Firing propose*predict-no
 -->
 (O2056 ^name predict-no +)
 (S1 ^operator O2056 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2054 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2053 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2054 ^name predict-no +)
 (S1 ^operator O2054 +)
Retracting propose*predict-yes
 -->
 (O2053 ^name predict-yes +)
 (S1 ^operator O2053 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1030 ^value 1 +)
 (R1 ^reward R1030 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2054 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2053 = 0.1215983424730706)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2053 = 0.8784117192151244)
=>WM: (14435: S1 ^operator O2056 +)
=>WM: (14434: S1 ^operator O2055 +)
=>WM: (14433: I3 ^dir U)
=>WM: (14432: O2056 ^name predict-no)
=>WM: (14431: O2055 ^name predict-yes)
=>WM: (14430: R1031 ^value 1)
=>WM: (14429: R1 ^reward R1031)
=>WM: (14428: I3 ^see 1)
<=WM: (14419: S1 ^operator O2053 +)
<=WM: (14421: S1 ^operator O2053)
<=WM: (14420: S1 ^operator O2054 +)
<=WM: (14418: I3 ^dir R)
<=WM: (14414: R1 ^reward R1030)
<=WM: (14400: I3 ^see 0)
<=WM: (14417: O2054 ^name predict-no)
<=WM: (14416: O2053 ^name predict-yes)
<=WM: (14415: R1030 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2055 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2056 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2054 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2053 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.868132,0.115111)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465484 0.412928 0.878412 -> 0.465483 0.412928 0.878411(R,m,v=1,1,0)
=>WM: (14436: S1 ^operator O2056)

  1028:    O: O2056 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1028 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1027 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14437: I3 ^predict-no N1028)
<=WM: (14423: N1027 ^status complete)
<=WM: (14422: I3 ^predict-yes N1027)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14441: I2 ^dir L)
=>WM: (14440: I2 ^reward 1)
=>WM: (14439: I2 ^see 0)
=>WM: (14438: N1028 ^status complete)
<=WM: (14426: I2 ^dir U)
<=WM: (14425: I2 ^reward 1)
<=WM: (14424: I2 ^see 1)
=>WM: (14442: I2 ^level-1 R1-root)
<=WM: (14427: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2056 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2055 = 0.6092814566217208)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1032 ^value 1 +)
 (R1 ^reward R1032 +)
Firing propose*predict-yes
 -->
 (O2057 ^name predict-yes +)
 (S1 ^operator O2057 +)
Firing propose*predict-no
 -->
 (O2058 ^name predict-no +)
 (S1 ^operator O2058 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2056 = 0.314502196170351)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2055 = 0.3907628451116619)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2056 ^name predict-no +)
 (S1 ^operator O2056 +)
Retracting propose*predict-yes
 -->
 (O2055 ^name predict-yes +)
 (S1 ^operator O2055 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1031 ^value 1 +)
 (R1 ^reward R1031 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2056 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2055 = 0.)
=>WM: (14450: S1 ^operator O2058 +)
=>WM: (14449: S1 ^operator O2057 +)
=>WM: (14448: I3 ^dir L)
=>WM: (14447: O2058 ^name predict-no)
=>WM: (14446: O2057 ^name predict-yes)
=>WM: (14445: R1032 ^value 1)
=>WM: (14444: R1 ^reward R1032)
=>WM: (14443: I3 ^see 0)
<=WM: (14434: S1 ^operator O2055 +)
<=WM: (14435: S1 ^operator O2056 +)
<=WM: (14436: S1 ^operator O2056)
<=WM: (14433: I3 ^dir U)
<=WM: (14429: R1 ^reward R1031)
<=WM: (14428: I3 ^see 1)
<=WM: (14432: O2056 ^name predict-no)
<=WM: (14431: O2055 ^name predict-yes)
<=WM: (14430: R1031 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2057 = 0.6092814566217208)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2057 = 0.3907628451116619)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2058 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2058 = 0.314502196170351)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2056 = 0.314502196170351)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2056 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2055 = 0.3907628451116619)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2055 = 0.6092814566217208)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14451: S1 ^operator O2057)

  1029:    O: O2057 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1029 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1028 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14452: I3 ^predict-yes N1029)
<=WM: (14438: N1028 ^status complete)
<=WM: (14437: I3 ^predict-no N1028)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (14456: I2 ^dir L)
=>WM: (14455: I2 ^reward 1)
=>WM: (14454: I2 ^see 1)
=>WM: (14453: N1029 ^status complete)
<=WM: (14441: I2 ^dir L)
<=WM: (14440: I2 ^reward 1)
<=WM: (14439: I2 ^see 0)
=>WM: (14457: I2 ^level-1 L1-root)
<=WM: (14442: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2057 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2058 = 0.6855237439964433)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1033 ^value 1 +)
 (R1 ^reward R1033 +)
Firing propose*predict-yes
 -->
 (O2059 ^name predict-yes +)
 (S1 ^operator O2059 +)
Firing propose*predict-no
 -->
 (O2060 ^name predict-no +)
 (S1 ^operator O2060 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2058 = 0.314502196170351)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2057 = 0.3907628451116619)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2058 ^name predict-no +)
 (S1 ^operator O2058 +)
Retracting propose*predict-yes
 -->
 (O2057 ^name predict-yes +)
 (S1 ^operator O2057 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1032 ^value 1 +)
 (R1 ^reward R1032 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2058 = 0.314502196170351)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2058 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2057 = 0.3907628451116619)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2057 = 0.6092814566217208)
=>WM: (14464: S1 ^operator O2060 +)
=>WM: (14463: S1 ^operator O2059 +)
=>WM: (14462: O2060 ^name predict-no)
=>WM: (14461: O2059 ^name predict-yes)
=>WM: (14460: R1033 ^value 1)
=>WM: (14459: R1 ^reward R1033)
=>WM: (14458: I3 ^see 1)
<=WM: (14449: S1 ^operator O2057 +)
<=WM: (14451: S1 ^operator O2057)
<=WM: (14450: S1 ^operator O2058 +)
<=WM: (14444: R1 ^reward R1032)
<=WM: (14443: I3 ^see 0)
<=WM: (14447: O2058 ^name predict-no)
<=WM: (14446: O2057 ^name predict-yes)
<=WM: (14445: R1032 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2059 = 0.3907628451116619)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2059 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2060 = 0.314502196170351)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2060 = 0.6855237439964433)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2058 = 0.314502196170351)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2058 = 0.6855237439964433)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2057 = 0.3907628451116619)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2057 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472311 -0.0815481 0.390763 -> 0.472308 -0.0815487 0.390759(R,m,v=1,0.945783,0.0515882)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527727 0.0815547 0.609281 -> 0.527723 0.0815541 0.609277(R,m,v=1,1,0)
=>WM: (14465: S1 ^operator O2060)

  1030:    O: O2060 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1030 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1029 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14466: I3 ^predict-no N1030)
<=WM: (14453: N1029 ^status complete)
<=WM: (14452: I3 ^predict-yes N1029)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14470: I2 ^dir R)
=>WM: (14469: I2 ^reward 1)
=>WM: (14468: I2 ^see 0)
=>WM: (14467: N1030 ^status complete)
<=WM: (14456: I2 ^dir L)
<=WM: (14455: I2 ^reward 1)
<=WM: (14454: I2 ^see 1)
=>WM: (14471: I2 ^level-1 L0-root)
<=WM: (14457: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2059 = 0.8783968442404908)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1034 ^value 1 +)
 (R1 ^reward R1034 +)
Firing propose*predict-yes
 -->
 (O2061 ^name predict-yes +)
 (S1 ^operator O2061 +)
Firing propose*predict-no
 -->
 (O2062 ^name predict-no +)
 (S1 ^operator O2062 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2060 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2059 = 0.1215975315706407)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2060 ^name predict-no +)
 (S1 ^operator O2060 +)
Retracting propose*predict-yes
 -->
 (O2059 ^name predict-yes +)
 (S1 ^operator O2059 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1033 ^value 1 +)
 (R1 ^reward R1033 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2060 = 0.6855237439964433)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2060 = 0.314502196170351)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2059 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2059 = 0.3907592209442947)
=>WM: (14479: S1 ^operator O2062 +)
=>WM: (14478: S1 ^operator O2061 +)
=>WM: (14477: I3 ^dir R)
=>WM: (14476: O2062 ^name predict-no)
=>WM: (14475: O2061 ^name predict-yes)
=>WM: (14474: R1034 ^value 1)
=>WM: (14473: R1 ^reward R1034)
=>WM: (14472: I3 ^see 0)
<=WM: (14463: S1 ^operator O2059 +)
<=WM: (14464: S1 ^operator O2060 +)
<=WM: (14465: S1 ^operator O2060)
<=WM: (14448: I3 ^dir L)
<=WM: (14459: R1 ^reward R1033)
<=WM: (14458: I3 ^see 1)
<=WM: (14462: O2060 ^name predict-no)
<=WM: (14461: O2059 ^name predict-yes)
<=WM: (14460: R1033 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2061 = 0.1215975315706407)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2061 = 0.8783968442404908)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2062 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2060 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2059 = 0.1215975315706407)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2059 = 0.8783968442404908)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314502 -> 0.478549 -0.164049 0.3145(R,m,v=1,0.925,0.0698113)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521473 0.164051 0.685524 -> 0.521471 0.164051 0.685521(R,m,v=1,1,0)
=>WM: (14480: S1 ^operator O2061)

  1031:    O: O2061 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1031 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1030 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14481: I3 ^predict-yes N1031)
<=WM: (14467: N1030 ^status complete)
<=WM: (14466: I3 ^predict-no N1030)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
---- Input Phase --- 
=>WM: (14485: I2 ^dir U)
=>WM: (14484: I2 ^reward 1)
=>WM: (14483: I2 ^see 1)
=>WM: (14482: N1031 ^status complete)
<=WM: (14470: I2 ^dir R)
<=WM: (14469: I2 ^reward 1)
<=WM: (14468: I2 ^see 0)
=>WM: (14486: I2 ^level-1 R1-root)
<=WM: (14471: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1035 ^value 1 +)
 (R1 ^reward R1035 +)
Firing propose*predict-yes
 -->
 (O2063 ^name predict-yes +)
 (S1 ^operator O2063 +)
Firing propose*predict-no
 -->
 (O2064 ^name predict-no +)
 (S1 ^operator O2064 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2062 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2061 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2062 ^name predict-no +)
 (S1 ^operator O2062 +)
Retracting propose*predict-yes
 -->
 (O2061 ^name predict-yes +)
 (S1 ^operator O2061 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1034 ^value 1 +)
 (R1 ^reward R1034 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2062 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2061 = 0.8783968442404908)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2061 = 0.1215975315706407)
=>WM: (14494: S1 ^operator O2064 +)
=>WM: (14493: S1 ^operator O2063 +)
=>WM: (14492: I3 ^dir U)
=>WM: (14491: O2064 ^name predict-no)
=>WM: (14490: O2063 ^name predict-yes)
=>WM: (14489: R1035 ^value 1)
=>WM: (14488: R1 ^reward R1035)
=>WM: (14487: I3 ^see 1)
<=WM: (14478: S1 ^operator O2061 +)
<=WM: (14480: S1 ^operator O2061)
<=WM: (14479: S1 ^operator O2062 +)
<=WM: (14477: I3 ^dir R)
<=WM: (14473: R1 ^reward R1034)
<=WM: (14472: I3 ^see 0)
<=WM: (14476: O2062 ^name predict-no)
<=WM: (14475: O2061 ^name predict-yes)
<=WM: (14474: R1034 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2063 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2064 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2062 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2061 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.868852,0.114574)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465472 0.412925 0.878397 -> 0.465472 0.412925 0.878397(R,m,v=1,1,0)
=>WM: (14495: S1 ^operator O2064)

  1032:    O: O2064 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1032 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1031 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14496: I3 ^predict-no N1032)
<=WM: (14482: N1031 ^status complete)
<=WM: (14481: I3 ^predict-yes N1031)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14500: I2 ^dir R)
=>WM: (14499: I2 ^reward 1)
=>WM: (14498: I2 ^see 0)
=>WM: (14497: N1032 ^status complete)
<=WM: (14485: I2 ^dir U)
<=WM: (14484: I2 ^reward 1)
<=WM: (14483: I2 ^see 1)
=>WM: (14501: I2 ^level-1 R1-root)
<=WM: (14486: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2063 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1036 ^value 1 +)
 (R1 ^reward R1036 +)
Firing propose*predict-yes
 -->
 (O2065 ^name predict-yes +)
 (S1 ^operator O2065 +)
Firing propose*predict-no
 -->
 (O2066 ^name predict-no +)
 (S1 ^operator O2066 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2064 = 0.9999921813761182)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2063 = 0.1215979844413558)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2064 ^name predict-no +)
 (S1 ^operator O2064 +)
Retracting propose*predict-yes
 -->
 (O2063 ^name predict-yes +)
 (S1 ^operator O2063 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1035 ^value 1 +)
 (R1 ^reward R1035 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2064 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2063 = 0.)
=>WM: (14509: S1 ^operator O2066 +)
=>WM: (14508: S1 ^operator O2065 +)
=>WM: (14507: I3 ^dir R)
=>WM: (14506: O2066 ^name predict-no)
=>WM: (14505: O2065 ^name predict-yes)
=>WM: (14504: R1036 ^value 1)
=>WM: (14503: R1 ^reward R1036)
=>WM: (14502: I3 ^see 0)
<=WM: (14493: S1 ^operator O2063 +)
<=WM: (14494: S1 ^operator O2064 +)
<=WM: (14495: S1 ^operator O2064)
<=WM: (14492: I3 ^dir U)
<=WM: (14488: R1 ^reward R1035)
<=WM: (14487: I3 ^see 1)
<=WM: (14491: O2064 ^name predict-no)
<=WM: (14490: O2063 ^name predict-yes)
<=WM: (14489: R1035 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2065 = -0.04253361215288998)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2065 = 0.1215979844413558)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2066 = 0.9999921813761182)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2064 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2063 = 0.1215979844413558)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2063 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14510: S1 ^operator O2066)

  1033:    O: O2066 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1033 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1032 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14511: I3 ^predict-no N1033)
<=WM: (14497: N1032 ^status complete)
<=WM: (14496: I3 ^predict-no N1032)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14515: I2 ^dir L)
=>WM: (14514: I2 ^reward 1)
=>WM: (14513: I2 ^see 0)
=>WM: (14512: N1033 ^status complete)
<=WM: (14500: I2 ^dir R)
<=WM: (14499: I2 ^reward 1)
<=WM: (14498: I2 ^see 0)
=>WM: (14516: I2 ^level-1 R0-root)
<=WM: (14501: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2066 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2065 = 0.6091249560527634)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1037 ^value 1 +)
 (R1 ^reward R1037 +)
Firing propose*predict-yes
 -->
 (O2067 ^name predict-yes +)
 (S1 ^operator O2067 +)
Firing propose*predict-no
 -->
 (O2068 ^name predict-no +)
 (S1 ^operator O2068 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2066 = 0.314500061238283)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2065 = 0.3907592209442947)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2066 ^name predict-no +)
 (S1 ^operator O2066 +)
Retracting propose*predict-yes
 -->
 (O2065 ^name predict-yes +)
 (S1 ^operator O2065 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1036 ^value 1 +)
 (R1 ^reward R1036 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2066 = 0.9999921813761182)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2065 = 0.1215979844413558)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2065 = -0.04253361215288998)
=>WM: (14523: S1 ^operator O2068 +)
=>WM: (14522: S1 ^operator O2067 +)
=>WM: (14521: I3 ^dir L)
=>WM: (14520: O2068 ^name predict-no)
=>WM: (14519: O2067 ^name predict-yes)
=>WM: (14518: R1037 ^value 1)
=>WM: (14517: R1 ^reward R1037)
<=WM: (14508: S1 ^operator O2065 +)
<=WM: (14509: S1 ^operator O2066 +)
<=WM: (14510: S1 ^operator O2066)
<=WM: (14507: I3 ^dir R)
<=WM: (14503: R1 ^reward R1036)
<=WM: (14506: O2066 ^name predict-no)
<=WM: (14505: O2065 ^name predict-yes)
<=WM: (14504: R1036 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2067 = 0.3907592209442947)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2067 = 0.6091249560527634)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2068 = 0.314500061238283)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2068 = -0.1984300550322165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2066 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2066 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2065 = 0.3907592209442947)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2065 = 0.6091249560527634)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999992 0 0.999992 -> 0.999993 0 0.999993(R,m,v=1,0.938889,0.0576971)
=>WM: (14524: S1 ^operator O2067)

  1034:    O: O2067 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1034 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1033 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14525: I3 ^predict-yes N1034)
<=WM: (14512: N1033 ^status complete)
<=WM: (14511: I3 ^predict-no N1033)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14529: I2 ^dir U)
=>WM: (14528: I2 ^reward 1)
=>WM: (14527: I2 ^see 1)
=>WM: (14526: N1034 ^status complete)
<=WM: (14515: I2 ^dir L)
<=WM: (14514: I2 ^reward 1)
<=WM: (14513: I2 ^see 0)
=>WM: (14530: I2 ^level-1 L1-root)
<=WM: (14516: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1038 ^value 1 +)
 (R1 ^reward R1038 +)
Firing propose*predict-yes
 -->
 (O2069 ^name predict-yes +)
 (S1 ^operator O2069 +)
Firing propose*predict-no
 -->
 (O2070 ^name predict-no +)
 (S1 ^operator O2070 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2068 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2067 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2068 ^name predict-no +)
 (S1 ^operator O2068 +)
Retracting propose*predict-yes
 -->
 (O2067 ^name predict-yes +)
 (S1 ^operator O2067 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1037 ^value 1 +)
 (R1 ^reward R1037 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2068 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2068 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2067 = 0.6091249560527634)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2067 = 0.3907592209442947)
=>WM: (14538: S1 ^operator O2070 +)
=>WM: (14537: S1 ^operator O2069 +)
=>WM: (14536: I3 ^dir U)
=>WM: (14535: O2070 ^name predict-no)
=>WM: (14534: O2069 ^name predict-yes)
=>WM: (14533: R1038 ^value 1)
=>WM: (14532: R1 ^reward R1038)
=>WM: (14531: I3 ^see 1)
<=WM: (14522: S1 ^operator O2067 +)
<=WM: (14524: S1 ^operator O2067)
<=WM: (14523: S1 ^operator O2068 +)
<=WM: (14521: I3 ^dir L)
<=WM: (14517: R1 ^reward R1037)
<=WM: (14502: I3 ^see 0)
<=WM: (14520: O2068 ^name predict-no)
<=WM: (14519: O2067 ^name predict-yes)
<=WM: (14518: R1037 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2069 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2070 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2068 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2067 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472308 -0.0815487 0.390759 -> 0.472316 -0.0815473 0.390769(R,m,v=1,0.946108,0.051295)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527593 0.0815315 0.609125 -> 0.527603 0.0815331 0.609136(R,m,v=1,1,0)
=>WM: (14539: S1 ^operator O2070)

  1035:    O: O2070 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1035 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1034 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14540: I3 ^predict-no N1035)
<=WM: (14526: N1034 ^status complete)
<=WM: (14525: I3 ^predict-yes N1034)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14544: I2 ^dir R)
=>WM: (14543: I2 ^reward 1)
=>WM: (14542: I2 ^see 0)
=>WM: (14541: N1035 ^status complete)
<=WM: (14529: I2 ^dir U)
<=WM: (14528: I2 ^reward 1)
<=WM: (14527: I2 ^see 1)
=>WM: (14545: I2 ^level-1 L1-root)
<=WM: (14530: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2069 = 0.8784107800481358)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1039 ^value 1 +)
 (R1 ^reward R1039 +)
Firing propose*predict-yes
 -->
 (O2071 ^name predict-yes +)
 (S1 ^operator O2071 +)
Firing propose*predict-no
 -->
 (O2072 ^name predict-no +)
 (S1 ^operator O2072 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2070 = 0.9999934438786788)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2069 = 0.1215979844413558)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2070 ^name predict-no +)
 (S1 ^operator O2070 +)
Retracting propose*predict-yes
 -->
 (O2069 ^name predict-yes +)
 (S1 ^operator O2069 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1038 ^value 1 +)
 (R1 ^reward R1038 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2070 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2069 = 0.)
=>WM: (14553: S1 ^operator O2072 +)
=>WM: (14552: S1 ^operator O2071 +)
=>WM: (14551: I3 ^dir R)
=>WM: (14550: O2072 ^name predict-no)
=>WM: (14549: O2071 ^name predict-yes)
=>WM: (14548: R1039 ^value 1)
=>WM: (14547: R1 ^reward R1039)
=>WM: (14546: I3 ^see 0)
<=WM: (14537: S1 ^operator O2069 +)
<=WM: (14538: S1 ^operator O2070 +)
<=WM: (14539: S1 ^operator O2070)
<=WM: (14536: I3 ^dir U)
<=WM: (14532: R1 ^reward R1038)
<=WM: (14531: I3 ^see 1)
<=WM: (14535: O2070 ^name predict-no)
<=WM: (14534: O2069 ^name predict-yes)
<=WM: (14533: R1038 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2071 = 0.8784107800481358)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2071 = 0.1215979844413558)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2072 = 0.9999934438786788)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2070 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2069 = 0.1215979844413558)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2069 = 0.8784107800481358)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14554: S1 ^operator O2071)

  1036:    O: O2071 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1036 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1035 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14555: I3 ^predict-yes N1036)
<=WM: (14541: N1035 ^status complete)
<=WM: (14540: I3 ^predict-no N1035)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14559: I2 ^dir U)
=>WM: (14558: I2 ^reward 1)
=>WM: (14557: I2 ^see 1)
=>WM: (14556: N1036 ^status complete)
<=WM: (14544: I2 ^dir R)
<=WM: (14543: I2 ^reward 1)
<=WM: (14542: I2 ^see 0)
=>WM: (14560: I2 ^level-1 R1-root)
<=WM: (14545: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1040 ^value 1 +)
 (R1 ^reward R1040 +)
Firing propose*predict-yes
 -->
 (O2073 ^name predict-yes +)
 (S1 ^operator O2073 +)
Firing propose*predict-no
 -->
 (O2074 ^name predict-no +)
 (S1 ^operator O2074 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2072 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2071 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2072 ^name predict-no +)
 (S1 ^operator O2072 +)
Retracting propose*predict-yes
 -->
 (O2071 ^name predict-yes +)
 (S1 ^operator O2071 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1039 ^value 1 +)
 (R1 ^reward R1039 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2072 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2071 = 0.1215979844413558)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2071 = 0.8784107800481358)
=>WM: (14568: S1 ^operator O2074 +)
=>WM: (14567: S1 ^operator O2073 +)
=>WM: (14566: I3 ^dir U)
=>WM: (14565: O2074 ^name predict-no)
=>WM: (14564: O2073 ^name predict-yes)
=>WM: (14563: R1040 ^value 1)
=>WM: (14562: R1 ^reward R1040)
=>WM: (14561: I3 ^see 1)
<=WM: (14552: S1 ^operator O2071 +)
<=WM: (14554: S1 ^operator O2071)
<=WM: (14553: S1 ^operator O2072 +)
<=WM: (14551: I3 ^dir R)
<=WM: (14547: R1 ^reward R1039)
<=WM: (14546: I3 ^see 0)
<=WM: (14550: O2072 ^name predict-no)
<=WM: (14549: O2071 ^name predict-yes)
<=WM: (14548: R1039 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2073 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2074 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2072 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2071 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121597(R,m,v=1,0.869565,0.114041)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465483 0.412928 0.878411 -> 0.465483 0.412927 0.87841(R,m,v=1,1,0)
=>WM: (14569: S1 ^operator O2074)

  1037:    O: O2074 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1037 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1036 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14570: I3 ^predict-no N1037)
<=WM: (14556: N1036 ^status complete)
<=WM: (14555: I3 ^predict-yes N1036)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14574: I2 ^dir L)
=>WM: (14573: I2 ^reward 1)
=>WM: (14572: I2 ^see 0)
=>WM: (14571: N1037 ^status complete)
<=WM: (14559: I2 ^dir U)
<=WM: (14558: I2 ^reward 1)
<=WM: (14557: I2 ^see 1)
=>WM: (14575: I2 ^level-1 R1-root)
<=WM: (14560: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2074 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2073 = 0.6092773114732839)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1041 ^value 1 +)
 (R1 ^reward R1041 +)
Firing propose*predict-yes
 -->
 (O2075 ^name predict-yes +)
 (S1 ^operator O2075 +)
Firing propose*predict-no
 -->
 (O2076 ^name predict-no +)
 (S1 ^operator O2076 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2074 = 0.314500061238283)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2073 = 0.3907686867108918)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2074 ^name predict-no +)
 (S1 ^operator O2074 +)
Retracting propose*predict-yes
 -->
 (O2073 ^name predict-yes +)
 (S1 ^operator O2073 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1040 ^value 1 +)
 (R1 ^reward R1040 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2074 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2073 = 0.)
=>WM: (14583: S1 ^operator O2076 +)
=>WM: (14582: S1 ^operator O2075 +)
=>WM: (14581: I3 ^dir L)
=>WM: (14580: O2076 ^name predict-no)
=>WM: (14579: O2075 ^name predict-yes)
=>WM: (14578: R1041 ^value 1)
=>WM: (14577: R1 ^reward R1041)
=>WM: (14576: I3 ^see 0)
<=WM: (14567: S1 ^operator O2073 +)
<=WM: (14568: S1 ^operator O2074 +)
<=WM: (14569: S1 ^operator O2074)
<=WM: (14566: I3 ^dir U)
<=WM: (14562: R1 ^reward R1040)
<=WM: (14561: I3 ^see 1)
<=WM: (14565: O2074 ^name predict-no)
<=WM: (14564: O2073 ^name predict-yes)
<=WM: (14563: R1040 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2075 = 0.6092773114732839)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2075 = 0.3907686867108918)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2076 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2076 = 0.314500061238283)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2074 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2074 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2073 = 0.3907686867108918)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2073 = 0.6092773114732839)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14584: S1 ^operator O2075)

  1038:    O: O2075 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1038 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1037 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14585: I3 ^predict-yes N1038)
<=WM: (14571: N1037 ^status complete)
<=WM: (14570: I3 ^predict-no N1037)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14589: I2 ^dir R)
=>WM: (14588: I2 ^reward 1)
=>WM: (14587: I2 ^see 1)
=>WM: (14586: N1038 ^status complete)
<=WM: (14574: I2 ^dir L)
<=WM: (14573: I2 ^reward 1)
<=WM: (14572: I2 ^see 0)
=>WM: (14590: I2 ^level-1 L1-root)
<=WM: (14575: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2075 = 0.8784099639037452)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1042 ^value 1 +)
 (R1 ^reward R1042 +)
Firing propose*predict-yes
 -->
 (O2077 ^name predict-yes +)
 (S1 ^operator O2077 +)
Firing propose*predict-no
 -->
 (O2078 ^name predict-no +)
 (S1 ^operator O2078 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2076 = 0.9999934438786788)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2075 = 0.1215972793263044)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2076 ^name predict-no +)
 (S1 ^operator O2076 +)
Retracting propose*predict-yes
 -->
 (O2075 ^name predict-yes +)
 (S1 ^operator O2075 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1041 ^value 1 +)
 (R1 ^reward R1041 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2076 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2076 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2075 = 0.3907686867108918)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2075 = 0.6092773114732839)
=>WM: (14598: S1 ^operator O2078 +)
=>WM: (14597: S1 ^operator O2077 +)
=>WM: (14596: I3 ^dir R)
=>WM: (14595: O2078 ^name predict-no)
=>WM: (14594: O2077 ^name predict-yes)
=>WM: (14593: R1042 ^value 1)
=>WM: (14592: R1 ^reward R1042)
=>WM: (14591: I3 ^see 1)
<=WM: (14582: S1 ^operator O2075 +)
<=WM: (14584: S1 ^operator O2075)
<=WM: (14583: S1 ^operator O2076 +)
<=WM: (14581: I3 ^dir L)
<=WM: (14577: R1 ^reward R1041)
<=WM: (14576: I3 ^see 0)
<=WM: (14580: O2076 ^name predict-no)
<=WM: (14579: O2075 ^name predict-yes)
<=WM: (14578: R1041 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2077 = 0.1215972793263044)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2077 = 0.8784099639037452)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2078 = 0.9999934438786788)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2076 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2075 = 0.1215972793263044)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2075 = 0.8784099639037452)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472316 -0.0815473 0.390769 -> 0.472313 -0.0815478 0.390765(R,m,v=1,0.946429,0.0510051)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527723 0.0815541 0.609277 -> 0.52772 0.0815534 0.609273(R,m,v=1,1,0)
=>WM: (14599: S1 ^operator O2077)

  1039:    O: O2077 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1039 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1038 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14600: I3 ^predict-yes N1039)
<=WM: (14586: N1038 ^status complete)
<=WM: (14585: I3 ^predict-yes N1038)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14604: I2 ^dir L)
=>WM: (14603: I2 ^reward 1)
=>WM: (14602: I2 ^see 1)
=>WM: (14601: N1039 ^status complete)
<=WM: (14589: I2 ^dir R)
<=WM: (14588: I2 ^reward 1)
<=WM: (14587: I2 ^see 1)
=>WM: (14605: I2 ^level-1 R1-root)
<=WM: (14590: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2078 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2077 = 0.6092730179615714)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1043 ^value 1 +)
 (R1 ^reward R1043 +)
Firing propose*predict-yes
 -->
 (O2079 ^name predict-yes +)
 (S1 ^operator O2079 +)
Firing propose*predict-no
 -->
 (O2080 ^name predict-no +)
 (S1 ^operator O2080 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2078 = 0.314500061238283)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2077 = 0.3907649311218379)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2078 ^name predict-no +)
 (S1 ^operator O2078 +)
Retracting propose*predict-yes
 -->
 (O2077 ^name predict-yes +)
 (S1 ^operator O2077 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1042 ^value 1 +)
 (R1 ^reward R1042 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2078 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2077 = 0.8784099639037452)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2077 = 0.1215972793263044)
=>WM: (14612: S1 ^operator O2080 +)
=>WM: (14611: S1 ^operator O2079 +)
=>WM: (14610: I3 ^dir L)
=>WM: (14609: O2080 ^name predict-no)
=>WM: (14608: O2079 ^name predict-yes)
=>WM: (14607: R1043 ^value 1)
=>WM: (14606: R1 ^reward R1043)
<=WM: (14597: S1 ^operator O2077 +)
<=WM: (14599: S1 ^operator O2077)
<=WM: (14598: S1 ^operator O2078 +)
<=WM: (14596: I3 ^dir R)
<=WM: (14592: R1 ^reward R1042)
<=WM: (14595: O2078 ^name predict-no)
<=WM: (14594: O2077 ^name predict-yes)
<=WM: (14593: R1042 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2079 = 0.3907649311218379)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2079 = 0.6092730179615714)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2080 = 0.314500061238283)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2080 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2078 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2078 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2077 = 0.3907649311218379)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2077 = 0.6092730179615714)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.87027,0.113514)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465483 0.412927 0.87841 -> 0.465482 0.412927 0.878409(R,m,v=1,1,0)
=>WM: (14613: S1 ^operator O2079)

  1040:    O: O2079 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1040 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1039 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14614: I3 ^predict-yes N1040)
<=WM: (14601: N1039 ^status complete)
<=WM: (14600: I3 ^predict-yes N1039)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14618: I2 ^dir L)
=>WM: (14617: I2 ^reward 1)
=>WM: (14616: I2 ^see 1)
=>WM: (14615: N1040 ^status complete)
<=WM: (14604: I2 ^dir L)
<=WM: (14603: I2 ^reward 1)
<=WM: (14602: I2 ^see 1)
=>WM: (14619: I2 ^level-1 L1-root)
<=WM: (14605: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2079 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2080 = 0.6855213227180397)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1044 ^value 1 +)
 (R1 ^reward R1044 +)
Firing propose*predict-yes
 -->
 (O2081 ^name predict-yes +)
 (S1 ^operator O2081 +)
Firing propose*predict-no
 -->
 (O2082 ^name predict-no +)
 (S1 ^operator O2082 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2080 = 0.314500061238283)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2079 = 0.3907649311218379)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2080 ^name predict-no +)
 (S1 ^operator O2080 +)
Retracting propose*predict-yes
 -->
 (O2079 ^name predict-yes +)
 (S1 ^operator O2079 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1043 ^value 1 +)
 (R1 ^reward R1043 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2080 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2080 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2079 = 0.6092730179615714)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2079 = 0.3907649311218379)
=>WM: (14625: S1 ^operator O2082 +)
=>WM: (14624: S1 ^operator O2081 +)
=>WM: (14623: O2082 ^name predict-no)
=>WM: (14622: O2081 ^name predict-yes)
=>WM: (14621: R1044 ^value 1)
=>WM: (14620: R1 ^reward R1044)
<=WM: (14611: S1 ^operator O2079 +)
<=WM: (14613: S1 ^operator O2079)
<=WM: (14612: S1 ^operator O2080 +)
<=WM: (14606: R1 ^reward R1043)
<=WM: (14609: O2080 ^name predict-no)
<=WM: (14608: O2079 ^name predict-yes)
<=WM: (14607: R1043 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2081 = 0.3907649311218379)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2081 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2082 = 0.314500061238283)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2082 = 0.6855213227180397)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2080 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2080 = 0.6855213227180397)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2079 = 0.3907649311218379)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2079 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472313 -0.0815478 0.390765 -> 0.47231 -0.0815483 0.390762(R,m,v=1,0.946746,0.0507185)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.52772 0.0815534 0.609273 -> 0.527717 0.0815529 0.609269(R,m,v=1,1,0)
=>WM: (14626: S1 ^operator O2082)

  1041:    O: O2082 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1041 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1040 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14627: I3 ^predict-no N1041)
<=WM: (14615: N1040 ^status complete)
<=WM: (14614: I3 ^predict-yes N1040)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (14631: I2 ^dir R)
=>WM: (14630: I2 ^reward 1)
=>WM: (14629: I2 ^see 0)
=>WM: (14628: N1041 ^status complete)
<=WM: (14618: I2 ^dir L)
<=WM: (14617: I2 ^reward 1)
<=WM: (14616: I2 ^see 1)
=>WM: (14632: I2 ^level-1 L0-root)
<=WM: (14619: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2081 = 0.8783973744177012)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1045 ^value 1 +)
 (R1 ^reward R1045 +)
Firing propose*predict-yes
 -->
 (O2083 ^name predict-yes +)
 (S1 ^operator O2083 +)
Firing propose*predict-no
 -->
 (O2084 ^name predict-no +)
 (S1 ^operator O2084 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2082 = 0.9999934438786788)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2081 = 0.1215966971063918)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2082 ^name predict-no +)
 (S1 ^operator O2082 +)
Retracting propose*predict-yes
 -->
 (O2081 ^name predict-yes +)
 (S1 ^operator O2081 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1044 ^value 1 +)
 (R1 ^reward R1044 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2082 = 0.6855213227180397)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2082 = 0.314500061238283)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2081 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2081 = 0.3907618357131554)
=>WM: (14640: S1 ^operator O2084 +)
=>WM: (14639: S1 ^operator O2083 +)
=>WM: (14638: I3 ^dir R)
=>WM: (14637: O2084 ^name predict-no)
=>WM: (14636: O2083 ^name predict-yes)
=>WM: (14635: R1045 ^value 1)
=>WM: (14634: R1 ^reward R1045)
=>WM: (14633: I3 ^see 0)
<=WM: (14624: S1 ^operator O2081 +)
<=WM: (14625: S1 ^operator O2082 +)
<=WM: (14626: S1 ^operator O2082)
<=WM: (14610: I3 ^dir L)
<=WM: (14620: R1 ^reward R1044)
<=WM: (14591: I3 ^see 1)
<=WM: (14623: O2082 ^name predict-no)
<=WM: (14622: O2081 ^name predict-yes)
<=WM: (14621: R1044 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2083 = 0.1215966971063918)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2083 = 0.8783973744177012)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2084 = 0.9999934438786788)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2082 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2081 = 0.1215966971063918)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2081 = 0.8783973744177012)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478549 -0.164049 0.3145 -> 0.478547 -0.164049 0.314498(R,m,v=1,0.925466,0.0694099)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521471 0.164051 0.685521 -> 0.521469 0.16405 0.685519(R,m,v=1,1,0)
=>WM: (14641: S1 ^operator O2083)

  1042:    O: O2083 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1042 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1041 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14642: I3 ^predict-yes N1042)
<=WM: (14628: N1041 ^status complete)
<=WM: (14627: I3 ^predict-no N1041)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (14646: I2 ^dir R)
=>WM: (14645: I2 ^reward 1)
=>WM: (14644: I2 ^see 1)
=>WM: (14643: N1042 ^status complete)
<=WM: (14631: I2 ^dir R)
<=WM: (14630: I2 ^reward 1)
<=WM: (14629: I2 ^see 0)
=>WM: (14647: I2 ^level-1 R1-root)
<=WM: (14632: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2083 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1046 ^value 1 +)
 (R1 ^reward R1046 +)
Firing propose*predict-yes
 -->
 (O2085 ^name predict-yes +)
 (S1 ^operator O2085 +)
Firing propose*predict-no
 -->
 (O2086 ^name predict-no +)
 (S1 ^operator O2086 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2084 = 0.9999934438786788)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2083 = 0.1215966971063918)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2084 ^name predict-no +)
 (S1 ^operator O2084 +)
Retracting propose*predict-yes
 -->
 (O2083 ^name predict-yes +)
 (S1 ^operator O2083 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1045 ^value 1 +)
 (R1 ^reward R1045 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2084 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2083 = 0.8783973744177012)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2083 = 0.1215966971063918)
=>WM: (14654: S1 ^operator O2086 +)
=>WM: (14653: S1 ^operator O2085 +)
=>WM: (14652: O2086 ^name predict-no)
=>WM: (14651: O2085 ^name predict-yes)
=>WM: (14650: R1046 ^value 1)
=>WM: (14649: R1 ^reward R1046)
=>WM: (14648: I3 ^see 1)
<=WM: (14639: S1 ^operator O2083 +)
<=WM: (14641: S1 ^operator O2083)
<=WM: (14640: S1 ^operator O2084 +)
<=WM: (14634: R1 ^reward R1045)
<=WM: (14633: I3 ^see 0)
<=WM: (14637: O2084 ^name predict-no)
<=WM: (14636: O2083 ^name predict-yes)
<=WM: (14635: R1045 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2085 = 0.1215966971063918)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2085 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2086 = 0.9999934438786788)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2084 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2083 = 0.1215966971063918)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2083 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.870968,0.11299)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465472 0.412925 0.878397 -> 0.465473 0.412925 0.878398(R,m,v=1,1,0)
=>WM: (14655: S1 ^operator O2086)

  1043:    O: O2086 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1043 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1042 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14656: I3 ^predict-no N1043)
<=WM: (14643: N1042 ^status complete)
<=WM: (14642: I3 ^predict-yes N1042)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14660: I2 ^dir R)
=>WM: (14659: I2 ^reward 1)
=>WM: (14658: I2 ^see 0)
=>WM: (14657: N1043 ^status complete)
<=WM: (14646: I2 ^dir R)
<=WM: (14645: I2 ^reward 1)
<=WM: (14644: I2 ^see 1)
=>WM: (14661: I2 ^level-1 R0-root)
<=WM: (14647: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2085 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1047 ^value 1 +)
 (R1 ^reward R1047 +)
Firing propose*predict-yes
 -->
 (O2087 ^name predict-yes +)
 (S1 ^operator O2087 +)
Firing propose*predict-no
 -->
 (O2088 ^name predict-no +)
 (S1 ^operator O2088 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2086 = 0.9999934438786788)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2085 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2086 ^name predict-no +)
 (S1 ^operator O2086 +)
Retracting propose*predict-yes
 -->
 (O2085 ^name predict-yes +)
 (S1 ^operator O2085 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1046 ^value 1 +)
 (R1 ^reward R1046 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2086 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2085 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2085 = 0.1215971732320855)
=>WM: (14668: S1 ^operator O2088 +)
=>WM: (14667: S1 ^operator O2087 +)
=>WM: (14666: O2088 ^name predict-no)
=>WM: (14665: O2087 ^name predict-yes)
=>WM: (14664: R1047 ^value 1)
=>WM: (14663: R1 ^reward R1047)
=>WM: (14662: I3 ^see 0)
<=WM: (14653: S1 ^operator O2085 +)
<=WM: (14654: S1 ^operator O2086 +)
<=WM: (14655: S1 ^operator O2086)
<=WM: (14649: R1 ^reward R1046)
<=WM: (14648: I3 ^see 1)
<=WM: (14652: O2086 ^name predict-no)
<=WM: (14651: O2085 ^name predict-yes)
<=WM: (14650: R1046 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2087 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2087 = -0.1512366769350551)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2088 = 0.9999934438786788)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2086 = 0.9999934438786788)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2085 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2085 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999993 0 0.999993 -> 0.999995 0 0.999995(R,m,v=1,0.939227,0.0573972)
=>WM: (14669: S1 ^operator O2088)

  1044:    O: O2088 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1044 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1043 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14670: I3 ^predict-no N1044)
<=WM: (14657: N1043 ^status complete)
<=WM: (14656: I3 ^predict-no N1043)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14674: I2 ^dir U)
=>WM: (14673: I2 ^reward 1)
=>WM: (14672: I2 ^see 0)
=>WM: (14671: N1044 ^status complete)
<=WM: (14660: I2 ^dir R)
<=WM: (14659: I2 ^reward 1)
<=WM: (14658: I2 ^see 0)
=>WM: (14675: I2 ^level-1 R0-root)
<=WM: (14661: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1048 ^value 1 +)
 (R1 ^reward R1048 +)
Firing propose*predict-yes
 -->
 (O2089 ^name predict-yes +)
 (S1 ^operator O2089 +)
Firing propose*predict-no
 -->
 (O2090 ^name predict-no +)
 (S1 ^operator O2090 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2088 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2087 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2088 ^name predict-no +)
 (S1 ^operator O2088 +)
Retracting propose*predict-yes
 -->
 (O2087 ^name predict-yes +)
 (S1 ^operator O2087 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1047 ^value 1 +)
 (R1 ^reward R1047 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2088 = 0.999994501574002)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2087 = -0.1512366769350551)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2087 = 0.1215971732320855)
=>WM: (14682: S1 ^operator O2090 +)
=>WM: (14681: S1 ^operator O2089 +)
=>WM: (14680: I3 ^dir U)
=>WM: (14679: O2090 ^name predict-no)
=>WM: (14678: O2089 ^name predict-yes)
=>WM: (14677: R1048 ^value 1)
=>WM: (14676: R1 ^reward R1048)
<=WM: (14667: S1 ^operator O2087 +)
<=WM: (14668: S1 ^operator O2088 +)
<=WM: (14669: S1 ^operator O2088)
<=WM: (14638: I3 ^dir R)
<=WM: (14663: R1 ^reward R1047)
<=WM: (14666: O2088 ^name predict-no)
<=WM: (14665: O2087 ^name predict-yes)
<=WM: (14664: R1047 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2089 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2090 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2088 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2087 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999995 0 0.999995 -> 0.999995 0 0.999995(R,m,v=1,0.93956,0.0571004)
=>WM: (14683: S1 ^operator O2090)

  1045:    O: O2090 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1045 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1044 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14684: I3 ^predict-no N1045)
<=WM: (14671: N1044 ^status complete)
<=WM: (14670: I3 ^predict-no N1044)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14688: I2 ^dir R)
=>WM: (14687: I2 ^reward 1)
=>WM: (14686: I2 ^see 0)
=>WM: (14685: N1045 ^status complete)
<=WM: (14674: I2 ^dir U)
<=WM: (14673: I2 ^reward 1)
<=WM: (14672: I2 ^see 0)
=>WM: (14689: I2 ^level-1 R0-root)
<=WM: (14675: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2089 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1049 ^value 1 +)
 (R1 ^reward R1049 +)
Firing propose*predict-yes
 -->
 (O2091 ^name predict-yes +)
 (S1 ^operator O2091 +)
Firing propose*predict-no
 -->
 (O2092 ^name predict-no +)
 (S1 ^operator O2092 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2090 = 0.9999953878441619)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2089 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2090 ^name predict-no +)
 (S1 ^operator O2090 +)
Retracting propose*predict-yes
 -->
 (O2089 ^name predict-yes +)
 (S1 ^operator O2089 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1048 ^value 1 +)
 (R1 ^reward R1048 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2090 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2089 = 0.)
=>WM: (14696: S1 ^operator O2092 +)
=>WM: (14695: S1 ^operator O2091 +)
=>WM: (14694: I3 ^dir R)
=>WM: (14693: O2092 ^name predict-no)
=>WM: (14692: O2091 ^name predict-yes)
=>WM: (14691: R1049 ^value 1)
=>WM: (14690: R1 ^reward R1049)
<=WM: (14681: S1 ^operator O2089 +)
<=WM: (14682: S1 ^operator O2090 +)
<=WM: (14683: S1 ^operator O2090)
<=WM: (14680: I3 ^dir U)
<=WM: (14676: R1 ^reward R1048)
<=WM: (14679: O2090 ^name predict-no)
<=WM: (14678: O2089 ^name predict-yes)
<=WM: (14677: R1048 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2091 = -0.1512366769350551)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2091 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2092 = 0.9999953878441619)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2090 = 0.9999953878441619)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2089 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2089 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14697: S1 ^operator O2092)

  1046:    O: O2092 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1046 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1045 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14698: I3 ^predict-no N1046)
<=WM: (14685: N1045 ^status complete)
<=WM: (14684: I3 ^predict-no N1045)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14702: I2 ^dir U)
=>WM: (14701: I2 ^reward 1)
=>WM: (14700: I2 ^see 0)
=>WM: (14699: N1046 ^status complete)
<=WM: (14688: I2 ^dir R)
<=WM: (14687: I2 ^reward 1)
<=WM: (14686: I2 ^see 0)
=>WM: (14703: I2 ^level-1 R0-root)
<=WM: (14689: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1050 ^value 1 +)
 (R1 ^reward R1050 +)
Firing propose*predict-yes
 -->
 (O2093 ^name predict-yes +)
 (S1 ^operator O2093 +)
Firing propose*predict-no
 -->
 (O2094 ^name predict-no +)
 (S1 ^operator O2094 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2092 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2091 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2092 ^name predict-no +)
 (S1 ^operator O2092 +)
Retracting propose*predict-yes
 -->
 (O2091 ^name predict-yes +)
 (S1 ^operator O2091 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1049 ^value 1 +)
 (R1 ^reward R1049 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2092 = 0.9999953878441619)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2091 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2091 = -0.1512366769350551)
=>WM: (14710: S1 ^operator O2094 +)
=>WM: (14709: S1 ^operator O2093 +)
=>WM: (14708: I3 ^dir U)
=>WM: (14707: O2094 ^name predict-no)
=>WM: (14706: O2093 ^name predict-yes)
=>WM: (14705: R1050 ^value 1)
=>WM: (14704: R1 ^reward R1050)
<=WM: (14695: S1 ^operator O2091 +)
<=WM: (14696: S1 ^operator O2092 +)
<=WM: (14697: S1 ^operator O2092)
<=WM: (14694: I3 ^dir R)
<=WM: (14690: R1 ^reward R1049)
<=WM: (14693: O2092 ^name predict-no)
<=WM: (14692: O2091 ^name predict-yes)
<=WM: (14691: R1049 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2093 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2094 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2092 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2091 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999995 0 0.999995 -> 0.999996 0 0.999996(R,m,v=1,0.939891,0.0568066)
=>WM: (14711: S1 ^operator O2094)

  1047:    O: O2094 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1047 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1046 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14712: I3 ^predict-no N1047)
<=WM: (14699: N1046 ^status complete)
<=WM: (14698: I3 ^predict-no N1046)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14716: I2 ^dir R)
=>WM: (14715: I2 ^reward 1)
=>WM: (14714: I2 ^see 0)
=>WM: (14713: N1047 ^status complete)
<=WM: (14702: I2 ^dir U)
<=WM: (14701: I2 ^reward 1)
<=WM: (14700: I2 ^see 0)
=>WM: (14717: I2 ^level-1 R0-root)
<=WM: (14703: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2093 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1051 ^value 1 +)
 (R1 ^reward R1051 +)
Firing propose*predict-yes
 -->
 (O2095 ^name predict-yes +)
 (S1 ^operator O2095 +)
Firing propose*predict-no
 -->
 (O2096 ^name predict-no +)
 (S1 ^operator O2096 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2094 = 0.9999961306038242)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2093 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2094 ^name predict-no +)
 (S1 ^operator O2094 +)
Retracting propose*predict-yes
 -->
 (O2093 ^name predict-yes +)
 (S1 ^operator O2093 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1050 ^value 1 +)
 (R1 ^reward R1050 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2094 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2093 = 0.)
=>WM: (14724: S1 ^operator O2096 +)
=>WM: (14723: S1 ^operator O2095 +)
=>WM: (14722: I3 ^dir R)
=>WM: (14721: O2096 ^name predict-no)
=>WM: (14720: O2095 ^name predict-yes)
=>WM: (14719: R1051 ^value 1)
=>WM: (14718: R1 ^reward R1051)
<=WM: (14709: S1 ^operator O2093 +)
<=WM: (14710: S1 ^operator O2094 +)
<=WM: (14711: S1 ^operator O2094)
<=WM: (14708: I3 ^dir U)
<=WM: (14704: R1 ^reward R1050)
<=WM: (14707: O2094 ^name predict-no)
<=WM: (14706: O2093 ^name predict-yes)
<=WM: (14705: R1050 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2095 = -0.1512366769350551)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2095 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2096 = 0.9999961306038242)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2094 = 0.9999961306038242)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2093 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2093 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14725: S1 ^operator O2096)

  1048:    O: O2096 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1048 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1047 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14726: I3 ^predict-no N1048)
<=WM: (14713: N1047 ^status complete)
<=WM: (14712: I3 ^predict-no N1047)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14730: I2 ^dir R)
=>WM: (14729: I2 ^reward 1)
=>WM: (14728: I2 ^see 0)
=>WM: (14727: N1048 ^status complete)
<=WM: (14716: I2 ^dir R)
<=WM: (14715: I2 ^reward 1)
<=WM: (14714: I2 ^see 0)
=>WM: (14731: I2 ^level-1 R0-root)
<=WM: (14717: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2095 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1052 ^value 1 +)
 (R1 ^reward R1052 +)
Firing propose*predict-yes
 -->
 (O2097 ^name predict-yes +)
 (S1 ^operator O2097 +)
Firing propose*predict-no
 -->
 (O2098 ^name predict-no +)
 (S1 ^operator O2098 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2096 = 0.9999961306038242)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2095 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2096 ^name predict-no +)
 (S1 ^operator O2096 +)
Retracting propose*predict-yes
 -->
 (O2095 ^name predict-yes +)
 (S1 ^operator O2095 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1051 ^value 1 +)
 (R1 ^reward R1051 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2096 = 0.9999961306038242)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2095 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2095 = -0.1512366769350551)
=>WM: (14737: S1 ^operator O2098 +)
=>WM: (14736: S1 ^operator O2097 +)
=>WM: (14735: O2098 ^name predict-no)
=>WM: (14734: O2097 ^name predict-yes)
=>WM: (14733: R1052 ^value 1)
=>WM: (14732: R1 ^reward R1052)
<=WM: (14723: S1 ^operator O2095 +)
<=WM: (14724: S1 ^operator O2096 +)
<=WM: (14725: S1 ^operator O2096)
<=WM: (14718: R1 ^reward R1051)
<=WM: (14721: O2096 ^name predict-no)
<=WM: (14720: O2095 ^name predict-yes)
<=WM: (14719: R1051 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2097 = -0.1512366769350551)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2097 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2098 = 0.9999961306038242)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2096 = 0.9999961306038242)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2095 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2095 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999996 0 0.999996 -> 0.999997 0 0.999997(R,m,v=1,0.940217,0.0565158)
=>WM: (14738: S1 ^operator O2098)

  1049:    O: O2098 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1049 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1048 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14739: I3 ^predict-no N1049)
<=WM: (14727: N1048 ^status complete)
<=WM: (14726: I3 ^predict-no N1048)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14743: I2 ^dir L)
=>WM: (14742: I2 ^reward 1)
=>WM: (14741: I2 ^see 0)
=>WM: (14740: N1049 ^status complete)
<=WM: (14730: I2 ^dir R)
<=WM: (14729: I2 ^reward 1)
<=WM: (14728: I2 ^see 0)
=>WM: (14744: I2 ^level-1 R0-root)
<=WM: (14731: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2098 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2097 = 0.6091357162190356)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1053 ^value 1 +)
 (R1 ^reward R1053 +)
Firing propose*predict-yes
 -->
 (O2099 ^name predict-yes +)
 (S1 ^operator O2099 +)
Firing propose*predict-no
 -->
 (O2100 ^name predict-no +)
 (S1 ^operator O2100 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2098 = 0.314498303095341)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2097 = 0.3907618357131554)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2098 ^name predict-no +)
 (S1 ^operator O2098 +)
Retracting propose*predict-yes
 -->
 (O2097 ^name predict-yes +)
 (S1 ^operator O2097 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1052 ^value 1 +)
 (R1 ^reward R1052 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2098 = 0.9999967532001512)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2097 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2097 = -0.1512366769350551)
=>WM: (14751: S1 ^operator O2100 +)
=>WM: (14750: S1 ^operator O2099 +)
=>WM: (14749: I3 ^dir L)
=>WM: (14748: O2100 ^name predict-no)
=>WM: (14747: O2099 ^name predict-yes)
=>WM: (14746: R1053 ^value 1)
=>WM: (14745: R1 ^reward R1053)
<=WM: (14736: S1 ^operator O2097 +)
<=WM: (14737: S1 ^operator O2098 +)
<=WM: (14738: S1 ^operator O2098)
<=WM: (14722: I3 ^dir R)
<=WM: (14732: R1 ^reward R1052)
<=WM: (14735: O2098 ^name predict-no)
<=WM: (14734: O2097 ^name predict-yes)
<=WM: (14733: R1052 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2099 = 0.6091357162190356)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2099 = 0.3907618357131554)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2100 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2100 = 0.314498303095341)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2098 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2098 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2097 = 0.3907618357131554)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2097 = 0.6091357162190356)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999997 0 0.999997 -> 0.999997 0 0.999997(R,m,v=1,0.940541,0.056228)
=>WM: (14752: S1 ^operator O2099)

  1050:    O: O2099 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1050 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1049 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14753: I3 ^predict-yes N1050)
<=WM: (14740: N1049 ^status complete)
<=WM: (14739: I3 ^predict-no N1049)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14757: I2 ^dir U)
=>WM: (14756: I2 ^reward 1)
=>WM: (14755: I2 ^see 1)
=>WM: (14754: N1050 ^status complete)
<=WM: (14743: I2 ^dir L)
<=WM: (14742: I2 ^reward 1)
<=WM: (14741: I2 ^see 0)
=>WM: (14758: I2 ^level-1 L1-root)
<=WM: (14744: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1054 ^value 1 +)
 (R1 ^reward R1054 +)
Firing propose*predict-yes
 -->
 (O2101 ^name predict-yes +)
 (S1 ^operator O2101 +)
Firing propose*predict-no
 -->
 (O2102 ^name predict-no +)
 (S1 ^operator O2102 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2100 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2099 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2100 ^name predict-no +)
 (S1 ^operator O2100 +)
Retracting propose*predict-yes
 -->
 (O2099 ^name predict-yes +)
 (S1 ^operator O2099 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1053 ^value 1 +)
 (R1 ^reward R1053 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2100 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2100 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2099 = 0.3907618357131554)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2099 = 0.6091357162190356)
=>WM: (14766: S1 ^operator O2102 +)
=>WM: (14765: S1 ^operator O2101 +)
=>WM: (14764: I3 ^dir U)
=>WM: (14763: O2102 ^name predict-no)
=>WM: (14762: O2101 ^name predict-yes)
=>WM: (14761: R1054 ^value 1)
=>WM: (14760: R1 ^reward R1054)
=>WM: (14759: I3 ^see 1)
<=WM: (14750: S1 ^operator O2099 +)
<=WM: (14752: S1 ^operator O2099)
<=WM: (14751: S1 ^operator O2100 +)
<=WM: (14749: I3 ^dir L)
<=WM: (14745: R1 ^reward R1053)
<=WM: (14662: I3 ^see 0)
<=WM: (14748: O2100 ^name predict-no)
<=WM: (14747: O2099 ^name predict-yes)
<=WM: (14746: R1053 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2101 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2102 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2100 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2099 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.47231 -0.0815483 0.390762 -> 0.472317 -0.081547 0.39077(R,m,v=1,0.947059,0.0504351)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527603 0.0815331 0.609136 -> 0.527611 0.0815345 0.609145(R,m,v=1,1,0)
=>WM: (14767: S1 ^operator O2102)

  1051:    O: O2102 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1051 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1050 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14768: I3 ^predict-no N1051)
<=WM: (14754: N1050 ^status complete)
<=WM: (14753: I3 ^predict-yes N1050)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (14772: I2 ^dir R)
=>WM: (14771: I2 ^reward 1)
=>WM: (14770: I2 ^see 0)
=>WM: (14769: N1051 ^status complete)
<=WM: (14757: I2 ^dir U)
<=WM: (14756: I2 ^reward 1)
<=WM: (14755: I2 ^see 1)
=>WM: (14773: I2 ^level-1 L1-root)
<=WM: (14758: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2101 = 0.8784092909945846)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1055 ^value 1 +)
 (R1 ^reward R1055 +)
Firing propose*predict-yes
 -->
 (O2103 ^name predict-yes +)
 (S1 ^operator O2103 +)
Firing propose*predict-no
 -->
 (O2104 ^name predict-no +)
 (S1 ^operator O2104 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2102 = 0.9999972751638363)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2101 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2102 ^name predict-no +)
 (S1 ^operator O2102 +)
Retracting propose*predict-yes
 -->
 (O2101 ^name predict-yes +)
 (S1 ^operator O2101 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1054 ^value 1 +)
 (R1 ^reward R1054 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2102 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2101 = 0.)
=>WM: (14781: S1 ^operator O2104 +)
=>WM: (14780: S1 ^operator O2103 +)
=>WM: (14779: I3 ^dir R)
=>WM: (14778: O2104 ^name predict-no)
=>WM: (14777: O2103 ^name predict-yes)
=>WM: (14776: R1055 ^value 1)
=>WM: (14775: R1 ^reward R1055)
=>WM: (14774: I3 ^see 0)
<=WM: (14765: S1 ^operator O2101 +)
<=WM: (14766: S1 ^operator O2102 +)
<=WM: (14767: S1 ^operator O2102)
<=WM: (14764: I3 ^dir U)
<=WM: (14760: R1 ^reward R1054)
<=WM: (14759: I3 ^see 1)
<=WM: (14763: O2102 ^name predict-no)
<=WM: (14762: O2101 ^name predict-yes)
<=WM: (14761: R1054 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2103 = 0.8784092909945846)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2103 = 0.1215971732320855)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2104 = 0.9999972751638363)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2102 = 0.9999972751638363)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2101 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2101 = 0.8784092909945846)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14782: S1 ^operator O2103)

  1052:    O: O2103 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1052 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1051 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14783: I3 ^predict-yes N1052)
<=WM: (14769: N1051 ^status complete)
<=WM: (14768: I3 ^predict-no N1051)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14787: I2 ^dir L)
=>WM: (14786: I2 ^reward 1)
=>WM: (14785: I2 ^see 1)
=>WM: (14784: N1052 ^status complete)
<=WM: (14772: I2 ^dir R)
<=WM: (14771: I2 ^reward 1)
<=WM: (14770: I2 ^see 0)
=>WM: (14788: I2 ^level-1 R1-root)
<=WM: (14773: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2104 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2103 = 0.6092694841640142)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1056 ^value 1 +)
 (R1 ^reward R1056 +)
Firing propose*predict-yes
 -->
 (O2105 ^name predict-yes +)
 (S1 ^operator O2105 +)
Firing propose*predict-no
 -->
 (O2106 ^name predict-no +)
 (S1 ^operator O2106 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2104 = 0.314498303095341)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2103 = 0.3907701841024368)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2104 ^name predict-no +)
 (S1 ^operator O2104 +)
Retracting propose*predict-yes
 -->
 (O2103 ^name predict-yes +)
 (S1 ^operator O2103 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1055 ^value 1 +)
 (R1 ^reward R1055 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2104 = 0.9999972751638363)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2103 = 0.1215971732320855)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2103 = 0.8784092909945846)
=>WM: (14796: S1 ^operator O2106 +)
=>WM: (14795: S1 ^operator O2105 +)
=>WM: (14794: I3 ^dir L)
=>WM: (14793: O2106 ^name predict-no)
=>WM: (14792: O2105 ^name predict-yes)
=>WM: (14791: R1056 ^value 1)
=>WM: (14790: R1 ^reward R1056)
=>WM: (14789: I3 ^see 1)
<=WM: (14780: S1 ^operator O2103 +)
<=WM: (14782: S1 ^operator O2103)
<=WM: (14781: S1 ^operator O2104 +)
<=WM: (14779: I3 ^dir R)
<=WM: (14775: R1 ^reward R1055)
<=WM: (14774: I3 ^see 0)
<=WM: (14778: O2104 ^name predict-no)
<=WM: (14777: O2103 ^name predict-yes)
<=WM: (14776: R1055 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2105 = 0.3907701841024368)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2105 = 0.6092694841640142)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2106 = 0.314498303095341)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2106 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2104 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2104 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2103 = 0.3907701841024368)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2103 = 0.6092694841640142)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.871658,0.112472)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465482 0.412927 0.878409 -> 0.465481 0.412927 0.878409(R,m,v=1,1,0)
=>WM: (14797: S1 ^operator O2105)

  1053:    O: O2105 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1053 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1052 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14798: I3 ^predict-yes N1053)
<=WM: (14784: N1052 ^status complete)
<=WM: (14783: I3 ^predict-yes N1052)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14802: I2 ^dir U)
=>WM: (14801: I2 ^reward 1)
=>WM: (14800: I2 ^see 1)
=>WM: (14799: N1053 ^status complete)
<=WM: (14787: I2 ^dir L)
<=WM: (14786: I2 ^reward 1)
<=WM: (14785: I2 ^see 1)
=>WM: (14803: I2 ^level-1 L1-root)
<=WM: (14788: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1057 ^value 1 +)
 (R1 ^reward R1057 +)
Firing propose*predict-yes
 -->
 (O2107 ^name predict-yes +)
 (S1 ^operator O2107 +)
Firing propose*predict-no
 -->
 (O2108 ^name predict-no +)
 (S1 ^operator O2108 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2106 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2105 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2106 ^name predict-no +)
 (S1 ^operator O2106 +)
Retracting propose*predict-yes
 -->
 (O2105 ^name predict-yes +)
 (S1 ^operator O2105 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1056 ^value 1 +)
 (R1 ^reward R1056 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2106 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2106 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2105 = 0.6092694841640142)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2105 = 0.3907701841024368)
=>WM: (14810: S1 ^operator O2108 +)
=>WM: (14809: S1 ^operator O2107 +)
=>WM: (14808: I3 ^dir U)
=>WM: (14807: O2108 ^name predict-no)
=>WM: (14806: O2107 ^name predict-yes)
=>WM: (14805: R1057 ^value 1)
=>WM: (14804: R1 ^reward R1057)
<=WM: (14795: S1 ^operator O2105 +)
<=WM: (14797: S1 ^operator O2105)
<=WM: (14796: S1 ^operator O2106 +)
<=WM: (14794: I3 ^dir L)
<=WM: (14790: R1 ^reward R1056)
<=WM: (14793: O2106 ^name predict-no)
<=WM: (14792: O2105 ^name predict-yes)
<=WM: (14791: R1056 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2107 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2108 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2106 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2105 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472317 -0.081547 0.39077 -> 0.472314 -0.0815475 0.390767(R,m,v=1,0.947368,0.0501548)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527717 0.0815529 0.609269 -> 0.527713 0.0815524 0.609266(R,m,v=1,1,0)
=>WM: (14811: S1 ^operator O2108)

  1054:    O: O2108 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1054 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1053 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14812: I3 ^predict-no N1054)
<=WM: (14799: N1053 ^status complete)
<=WM: (14798: I3 ^predict-yes N1053)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14816: I2 ^dir R)
=>WM: (14815: I2 ^reward 1)
=>WM: (14814: I2 ^see 0)
=>WM: (14813: N1054 ^status complete)
<=WM: (14802: I2 ^dir U)
<=WM: (14801: I2 ^reward 1)
<=WM: (14800: I2 ^see 1)
=>WM: (14817: I2 ^level-1 L1-root)
<=WM: (14803: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2107 = 0.8784086918391858)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1058 ^value 1 +)
 (R1 ^reward R1058 +)
Firing propose*predict-yes
 -->
 (O2109 ^name predict-yes +)
 (S1 ^operator O2109 +)
Firing propose*predict-no
 -->
 (O2110 ^name predict-no +)
 (S1 ^operator O2110 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2108 = 0.9999972751638363)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2107 = 0.1215966545261001)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2108 ^name predict-no +)
 (S1 ^operator O2108 +)
Retracting propose*predict-yes
 -->
 (O2107 ^name predict-yes +)
 (S1 ^operator O2107 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1057 ^value 1 +)
 (R1 ^reward R1057 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2108 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2107 = 0.)
=>WM: (14825: S1 ^operator O2110 +)
=>WM: (14824: S1 ^operator O2109 +)
=>WM: (14823: I3 ^dir R)
=>WM: (14822: O2110 ^name predict-no)
=>WM: (14821: O2109 ^name predict-yes)
=>WM: (14820: R1058 ^value 1)
=>WM: (14819: R1 ^reward R1058)
=>WM: (14818: I3 ^see 0)
<=WM: (14809: S1 ^operator O2107 +)
<=WM: (14810: S1 ^operator O2108 +)
<=WM: (14811: S1 ^operator O2108)
<=WM: (14808: I3 ^dir U)
<=WM: (14804: R1 ^reward R1057)
<=WM: (14789: I3 ^see 1)
<=WM: (14807: O2108 ^name predict-no)
<=WM: (14806: O2107 ^name predict-yes)
<=WM: (14805: R1057 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2109 = 0.8784086918391858)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2109 = 0.1215966545261001)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2110 = 0.9999972751638363)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2108 = 0.9999972751638363)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2107 = 0.1215966545261001)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2107 = 0.8784086918391858)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14826: S1 ^operator O2109)

  1055:    O: O2109 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1055 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1054 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14827: I3 ^predict-yes N1055)
<=WM: (14813: N1054 ^status complete)
<=WM: (14812: I3 ^predict-no N1054)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (14831: I2 ^dir R)
=>WM: (14830: I2 ^reward 1)
=>WM: (14829: I2 ^see 1)
=>WM: (14828: N1055 ^status complete)
<=WM: (14816: I2 ^dir R)
<=WM: (14815: I2 ^reward 1)
<=WM: (14814: I2 ^see 0)
=>WM: (14832: I2 ^level-1 R1-root)
<=WM: (14817: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2109 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1059 ^value 1 +)
 (R1 ^reward R1059 +)
Firing propose*predict-yes
 -->
 (O2111 ^name predict-yes +)
 (S1 ^operator O2111 +)
Firing propose*predict-no
 -->
 (O2112 ^name predict-no +)
 (S1 ^operator O2112 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2110 = 0.9999972751638363)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2109 = 0.1215966545261001)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2110 ^name predict-no +)
 (S1 ^operator O2110 +)
Retracting propose*predict-yes
 -->
 (O2109 ^name predict-yes +)
 (S1 ^operator O2109 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1058 ^value 1 +)
 (R1 ^reward R1058 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2110 = 0.9999972751638363)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2109 = 0.1215966545261001)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2109 = 0.8784086918391858)
=>WM: (14839: S1 ^operator O2112 +)
=>WM: (14838: S1 ^operator O2111 +)
=>WM: (14837: O2112 ^name predict-no)
=>WM: (14836: O2111 ^name predict-yes)
=>WM: (14835: R1059 ^value 1)
=>WM: (14834: R1 ^reward R1059)
=>WM: (14833: I3 ^see 1)
<=WM: (14824: S1 ^operator O2109 +)
<=WM: (14826: S1 ^operator O2109)
<=WM: (14825: S1 ^operator O2110 +)
<=WM: (14819: R1 ^reward R1058)
<=WM: (14818: I3 ^see 0)
<=WM: (14822: O2110 ^name predict-no)
<=WM: (14821: O2109 ^name predict-yes)
<=WM: (14820: R1058 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2111 = 0.1215966545261001)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2111 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2112 = 0.9999972751638363)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2110 = 0.9999972751638363)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2109 = 0.1215966545261001)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2109 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.87234,0.111958)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465481 0.412927 0.878409 -> 0.465481 0.412927 0.878408(R,m,v=1,1,0)
=>WM: (14840: S1 ^operator O2112)

  1056:    O: O2112 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1056 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1055 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14841: I3 ^predict-no N1056)
<=WM: (14828: N1055 ^status complete)
<=WM: (14827: I3 ^predict-yes N1055)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14845: I2 ^dir U)
=>WM: (14844: I2 ^reward 1)
=>WM: (14843: I2 ^see 0)
=>WM: (14842: N1056 ^status complete)
<=WM: (14831: I2 ^dir R)
<=WM: (14830: I2 ^reward 1)
<=WM: (14829: I2 ^see 1)
=>WM: (14846: I2 ^level-1 R0-root)
<=WM: (14832: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1060 ^value 1 +)
 (R1 ^reward R1060 +)
Firing propose*predict-yes
 -->
 (O2113 ^name predict-yes +)
 (S1 ^operator O2113 +)
Firing propose*predict-no
 -->
 (O2114 ^name predict-no +)
 (S1 ^operator O2114 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2112 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2111 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2112 ^name predict-no +)
 (S1 ^operator O2112 +)
Retracting propose*predict-yes
 -->
 (O2111 ^name predict-yes +)
 (S1 ^operator O2111 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1059 ^value 1 +)
 (R1 ^reward R1059 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2112 = 0.9999972751638363)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2111 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2111 = 0.1215962258870366)
=>WM: (14854: S1 ^operator O2114 +)
=>WM: (14853: S1 ^operator O2113 +)
=>WM: (14852: I3 ^dir U)
=>WM: (14851: O2114 ^name predict-no)
=>WM: (14850: O2113 ^name predict-yes)
=>WM: (14849: R1060 ^value 1)
=>WM: (14848: R1 ^reward R1060)
=>WM: (14847: I3 ^see 0)
<=WM: (14838: S1 ^operator O2111 +)
<=WM: (14839: S1 ^operator O2112 +)
<=WM: (14840: S1 ^operator O2112)
<=WM: (14823: I3 ^dir R)
<=WM: (14834: R1 ^reward R1059)
<=WM: (14833: I3 ^see 1)
<=WM: (14837: O2112 ^name predict-no)
<=WM: (14836: O2111 ^name predict-yes)
<=WM: (14835: R1059 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2113 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2114 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2112 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2111 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999997 0 0.999997 -> 0.999998 0 0.999998(R,m,v=1,0.94086,0.055943)
=>WM: (14855: S1 ^operator O2114)

  1057:    O: O2114 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1057 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1056 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14856: I3 ^predict-no N1057)
<=WM: (14842: N1056 ^status complete)
<=WM: (14841: I3 ^predict-no N1056)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (14860: I2 ^dir L)
=>WM: (14859: I2 ^reward 1)
=>WM: (14858: I2 ^see 0)
=>WM: (14857: N1057 ^status complete)
<=WM: (14845: I2 ^dir U)
<=WM: (14844: I2 ^reward 1)
<=WM: (14843: I2 ^see 0)
=>WM: (14861: I2 ^level-1 R0-root)
<=WM: (14846: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2114 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2113 = 0.6091452119121891)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1061 ^value 1 +)
 (R1 ^reward R1061 +)
Firing propose*predict-yes
 -->
 (O2115 ^name predict-yes +)
 (S1 ^operator O2115 +)
Firing propose*predict-no
 -->
 (O2116 ^name predict-no +)
 (S1 ^operator O2116 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2114 = 0.314498303095341)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2113 = 0.3907669546625557)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2114 ^name predict-no +)
 (S1 ^operator O2114 +)
Retracting propose*predict-yes
 -->
 (O2113 ^name predict-yes +)
 (S1 ^operator O2113 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1060 ^value 1 +)
 (R1 ^reward R1060 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2114 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2113 = 0.)
=>WM: (14868: S1 ^operator O2116 +)
=>WM: (14867: S1 ^operator O2115 +)
=>WM: (14866: I3 ^dir L)
=>WM: (14865: O2116 ^name predict-no)
=>WM: (14864: O2115 ^name predict-yes)
=>WM: (14863: R1061 ^value 1)
=>WM: (14862: R1 ^reward R1061)
<=WM: (14853: S1 ^operator O2113 +)
<=WM: (14854: S1 ^operator O2114 +)
<=WM: (14855: S1 ^operator O2114)
<=WM: (14852: I3 ^dir U)
<=WM: (14848: R1 ^reward R1060)
<=WM: (14851: O2114 ^name predict-no)
<=WM: (14850: O2113 ^name predict-yes)
<=WM: (14849: R1060 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2115 = 0.6091452119121891)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2115 = 0.3907669546625557)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2116 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2116 = 0.314498303095341)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2114 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2114 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2113 = 0.3907669546625557)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2113 = 0.6091452119121891)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14869: S1 ^operator O2115)

  1058:    O: O2115 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1058 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1057 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14870: I3 ^predict-yes N1058)
<=WM: (14857: N1057 ^status complete)
<=WM: (14856: I3 ^predict-no N1057)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\-/--- Input Phase --- 
=>WM: (14874: I2 ^dir L)
=>WM: (14873: I2 ^reward 1)
=>WM: (14872: I2 ^see 1)
=>WM: (14871: N1058 ^status complete)
<=WM: (14860: I2 ^dir L)
<=WM: (14859: I2 ^reward 1)
<=WM: (14858: I2 ^see 0)
=>WM: (14875: I2 ^level-1 L1-root)
<=WM: (14861: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2115 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2116 = 0.6855193314559108)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1062 ^value 1 +)
 (R1 ^reward R1062 +)
Firing propose*predict-yes
 -->
 (O2117 ^name predict-yes +)
 (S1 ^operator O2117 +)
Firing propose*predict-no
 -->
 (O2118 ^name predict-no +)
 (S1 ^operator O2118 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2116 = 0.314498303095341)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2115 = 0.3907669546625557)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2116 ^name predict-no +)
 (S1 ^operator O2116 +)
Retracting propose*predict-yes
 -->
 (O2115 ^name predict-yes +)
 (S1 ^operator O2115 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1061 ^value 1 +)
 (R1 ^reward R1061 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2116 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2116 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2115 = 0.3907669546625557)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2115 = 0.6091452119121891)
=>WM: (14882: S1 ^operator O2118 +)
=>WM: (14881: S1 ^operator O2117 +)
=>WM: (14880: O2118 ^name predict-no)
=>WM: (14879: O2117 ^name predict-yes)
=>WM: (14878: R1062 ^value 1)
=>WM: (14877: R1 ^reward R1062)
=>WM: (14876: I3 ^see 1)
<=WM: (14867: S1 ^operator O2115 +)
<=WM: (14869: S1 ^operator O2115)
<=WM: (14868: S1 ^operator O2116 +)
<=WM: (14862: R1 ^reward R1061)
<=WM: (14847: I3 ^see 0)
<=WM: (14865: O2116 ^name predict-no)
<=WM: (14864: O2115 ^name predict-yes)
<=WM: (14863: R1061 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2117 = 0.3907669546625557)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2117 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2118 = 0.314498303095341)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2118 = 0.6855193314559108)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2116 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2116 = 0.6855193314559108)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2115 = 0.3907669546625557)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2115 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472314 -0.0815475 0.390767 -> 0.472321 -0.0815465 0.390774(R,m,v=1,0.947674,0.0498776)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527611 0.0815345 0.609145 -> 0.527618 0.0815357 0.609153(R,m,v=1,1,0)
=>WM: (14883: S1 ^operator O2118)

  1059:    O: O2118 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1059 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1058 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14884: I3 ^predict-no N1059)
<=WM: (14871: N1058 ^status complete)
<=WM: (14870: I3 ^predict-yes N1058)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14888: I2 ^dir R)
=>WM: (14887: I2 ^reward 1)
=>WM: (14886: I2 ^see 0)
=>WM: (14885: N1059 ^status complete)
<=WM: (14874: I2 ^dir L)
<=WM: (14873: I2 ^reward 1)
<=WM: (14872: I2 ^see 1)
=>WM: (14889: I2 ^level-1 L0-root)
<=WM: (14875: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2117 = 0.8783979318684918)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1063 ^value 1 +)
 (R1 ^reward R1063 +)
Firing propose*predict-yes
 -->
 (O2119 ^name predict-yes +)
 (S1 ^operator O2119 +)
Firing propose*predict-no
 -->
 (O2120 ^name predict-no +)
 (S1 ^operator O2120 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2118 = 0.9999977128360235)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2117 = 0.1215962258870366)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2118 ^name predict-no +)
 (S1 ^operator O2118 +)
Retracting propose*predict-yes
 -->
 (O2117 ^name predict-yes +)
 (S1 ^operator O2117 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1062 ^value 1 +)
 (R1 ^reward R1062 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2118 = 0.6855193314559108)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2118 = 0.314498303095341)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2117 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2117 = 0.3907740985018537)
=>WM: (14897: S1 ^operator O2120 +)
=>WM: (14896: S1 ^operator O2119 +)
=>WM: (14895: I3 ^dir R)
=>WM: (14894: O2120 ^name predict-no)
=>WM: (14893: O2119 ^name predict-yes)
=>WM: (14892: R1063 ^value 1)
=>WM: (14891: R1 ^reward R1063)
=>WM: (14890: I3 ^see 0)
<=WM: (14881: S1 ^operator O2117 +)
<=WM: (14882: S1 ^operator O2118 +)
<=WM: (14883: S1 ^operator O2118)
<=WM: (14866: I3 ^dir L)
<=WM: (14877: R1 ^reward R1062)
<=WM: (14876: I3 ^see 1)
<=WM: (14880: O2118 ^name predict-no)
<=WM: (14879: O2117 ^name predict-yes)
<=WM: (14878: R1062 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2119 = 0.1215962258870366)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2119 = 0.8783979318684918)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2120 = 0.9999977128360235)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2118 = 0.9999977128360235)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2117 = 0.1215962258870366)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2117 = 0.8783979318684918)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478547 -0.164049 0.314498 -> 0.478546 -0.164049 0.314497(R,m,v=1,0.925926,0.0690131)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521469 0.16405 0.685519 -> 0.521467 0.16405 0.685518(R,m,v=1,1,0)
=>WM: (14898: S1 ^operator O2120)

  1060:    O: O2120 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1060 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1059 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14899: I3 ^predict-no N1060)
<=WM: (14885: N1059 ^status complete)
<=WM: (14884: I3 ^predict-no N1059)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, False)
predict error 1
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14903: I2 ^dir U)
=>WM: (14902: I2 ^reward 0)
=>WM: (14901: I2 ^see 1)
=>WM: (14900: N1060 ^status complete)
<=WM: (14888: I2 ^dir R)
<=WM: (14887: I2 ^reward 1)
<=WM: (14886: I2 ^see 0)
=>WM: (14904: I2 ^level-1 R1-root)
<=WM: (14889: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1064 ^value 0 +)
 (R1 ^reward R1064 +)
Firing propose*predict-yes
 -->
 (O2121 ^name predict-yes +)
 (S1 ^operator O2121 +)
Firing propose*predict-no
 -->
 (O2122 ^name predict-no +)
 (S1 ^operator O2122 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2120 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2119 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2120 ^name predict-no +)
 (S1 ^operator O2120 +)
Retracting propose*predict-yes
 -->
 (O2119 ^name predict-yes +)
 (S1 ^operator O2119 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1063 ^value 1 +)
 (R1 ^reward R1063 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2120 = 0.9999977128360235)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2119 = 0.8783979318684918)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2119 = 0.1215962258870366)
=>WM: (14912: S1 ^operator O2122 +)
=>WM: (14911: S1 ^operator O2121 +)
=>WM: (14910: I3 ^dir U)
=>WM: (14909: O2122 ^name predict-no)
=>WM: (14908: O2121 ^name predict-yes)
=>WM: (14907: R1064 ^value 0)
=>WM: (14906: R1 ^reward R1064)
=>WM: (14905: I3 ^see 1)
<=WM: (14896: S1 ^operator O2119 +)
<=WM: (14897: S1 ^operator O2120 +)
<=WM: (14898: S1 ^operator O2120)
<=WM: (14895: I3 ^dir R)
<=WM: (14891: R1 ^reward R1063)
<=WM: (14890: I3 ^see 0)
<=WM: (14894: O2120 ^name predict-no)
<=WM: (14893: O2119 ^name predict-yes)
<=WM: (14892: R1063 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2121 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2122 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2120 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2119 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.999998 0 0.999998 -> 0.839513 0 0.839513(R,m,v=0,0.935829,0.0603761)
=>WM: (14913: S1 ^operator O2122)

  1061:    O: O2122 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1061 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1060 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14914: I3 ^predict-no N1061)
<=WM: (14900: N1060 ^status complete)
<=WM: (14899: I3 ^predict-no N1060)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 0 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (14918: I2 ^dir L)
=>WM: (14917: I2 ^reward 1)
=>WM: (14916: I2 ^see 0)
=>WM: (14915: N1061 ^status complete)
<=WM: (14903: I2 ^dir U)
<=WM: (14902: I2 ^reward 0)
<=WM: (14901: I2 ^see 1)
=>WM: (14919: I2 ^level-1 R1-root)
<=WM: (14904: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2122 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2121 = 0.609265798910378)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1065 ^value 1 +)
 (R1 ^reward R1065 +)
Firing propose*predict-yes
 -->
 (O2123 ^name predict-yes +)
 (S1 ^operator O2123 +)
Firing propose*predict-no
 -->
 (O2124 ^name predict-no +)
 (S1 ^operator O2124 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2122 = 0.3144968546951614)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2121 = 0.3907740985018537)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2122 ^name predict-no +)
 (S1 ^operator O2122 +)
Retracting propose*predict-yes
 -->
 (O2121 ^name predict-yes +)
 (S1 ^operator O2121 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1064 ^value 0 +)
 (R1 ^reward R1064 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2122 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2121 = 0.)
=>WM: (14927: S1 ^operator O2124 +)
=>WM: (14926: S1 ^operator O2123 +)
=>WM: (14925: I3 ^dir L)
=>WM: (14924: O2124 ^name predict-no)
=>WM: (14923: O2123 ^name predict-yes)
=>WM: (14922: R1065 ^value 1)
=>WM: (14921: R1 ^reward R1065)
=>WM: (14920: I3 ^see 0)
<=WM: (14911: S1 ^operator O2121 +)
<=WM: (14912: S1 ^operator O2122 +)
<=WM: (14913: S1 ^operator O2122)
<=WM: (14910: I3 ^dir U)
<=WM: (14906: R1 ^reward R1064)
<=WM: (14905: I3 ^see 1)
<=WM: (14909: O2122 ^name predict-no)
<=WM: (14908: O2121 ^name predict-yes)
<=WM: (14907: R1064 ^value 0)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2123 = 0.609265798910378)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2123 = 0.3907740985018537)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2124 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2124 = 0.3144968546951614)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2122 = 0.3144968546951614)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2122 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2121 = 0.3907740985018537)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2121 = 0.609265798910378)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14928: S1 ^operator O2123)

  1062:    O: O2123 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1062 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1061 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14929: I3 ^predict-yes N1062)
<=WM: (14915: N1061 ^status complete)
<=WM: (14914: I3 ^predict-no N1061)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (14933: I2 ^dir U)
=>WM: (14932: I2 ^reward 1)
=>WM: (14931: I2 ^see 1)
=>WM: (14930: N1062 ^status complete)
<=WM: (14918: I2 ^dir L)
<=WM: (14917: I2 ^reward 1)
<=WM: (14916: I2 ^see 0)
=>WM: (14934: I2 ^level-1 L1-root)
<=WM: (14919: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1066 ^value 1 +)
 (R1 ^reward R1066 +)
Firing propose*predict-yes
 -->
 (O2125 ^name predict-yes +)
 (S1 ^operator O2125 +)
Firing propose*predict-no
 -->
 (O2126 ^name predict-no +)
 (S1 ^operator O2126 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2124 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2123 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2124 ^name predict-no +)
 (S1 ^operator O2124 +)
Retracting propose*predict-yes
 -->
 (O2123 ^name predict-yes +)
 (S1 ^operator O2123 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1065 ^value 1 +)
 (R1 ^reward R1065 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2124 = 0.3144968546951614)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2124 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2123 = 0.3907740985018537)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2123 = 0.609265798910378)
=>WM: (14942: S1 ^operator O2126 +)
=>WM: (14941: S1 ^operator O2125 +)
=>WM: (14940: I3 ^dir U)
=>WM: (14939: O2126 ^name predict-no)
=>WM: (14938: O2125 ^name predict-yes)
=>WM: (14937: R1066 ^value 1)
=>WM: (14936: R1 ^reward R1066)
=>WM: (14935: I3 ^see 1)
<=WM: (14926: S1 ^operator O2123 +)
<=WM: (14928: S1 ^operator O2123)
<=WM: (14927: S1 ^operator O2124 +)
<=WM: (14925: I3 ^dir L)
<=WM: (14921: R1 ^reward R1065)
<=WM: (14920: I3 ^see 0)
<=WM: (14924: O2124 ^name predict-no)
<=WM: (14923: O2123 ^name predict-yes)
<=WM: (14922: R1065 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2125 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2126 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2124 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2123 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472321 -0.0815465 0.390774 -> 0.472318 -0.0815469 0.390771(R,m,v=1,0.947977,0.0496034)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527713 0.0815524 0.609266 -> 0.52771 0.0815518 0.609262(R,m,v=1,1,0)
=>WM: (14943: S1 ^operator O2126)

  1063:    O: O2126 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1063 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1062 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14944: I3 ^predict-no N1063)
<=WM: (14930: N1062 ^status complete)
<=WM: (14929: I3 ^predict-yes N1062)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (14948: I2 ^dir U)
=>WM: (14947: I2 ^reward 1)
=>WM: (14946: I2 ^see 0)
=>WM: (14945: N1063 ^status complete)
<=WM: (14933: I2 ^dir U)
<=WM: (14932: I2 ^reward 1)
<=WM: (14931: I2 ^see 1)
=>WM: (14949: I2 ^level-1 L1-root)
<=WM: (14934: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1067 ^value 1 +)
 (R1 ^reward R1067 +)
Firing propose*predict-yes
 -->
 (O2127 ^name predict-yes +)
 (S1 ^operator O2127 +)
Firing propose*predict-no
 -->
 (O2128 ^name predict-no +)
 (S1 ^operator O2128 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2126 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2125 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2126 ^name predict-no +)
 (S1 ^operator O2126 +)
Retracting propose*predict-yes
 -->
 (O2125 ^name predict-yes +)
 (S1 ^operator O2125 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1066 ^value 1 +)
 (R1 ^reward R1066 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2126 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2125 = 0.)
=>WM: (14956: S1 ^operator O2128 +)
=>WM: (14955: S1 ^operator O2127 +)
=>WM: (14954: O2128 ^name predict-no)
=>WM: (14953: O2127 ^name predict-yes)
=>WM: (14952: R1067 ^value 1)
=>WM: (14951: R1 ^reward R1067)
=>WM: (14950: I3 ^see 0)
<=WM: (14941: S1 ^operator O2125 +)
<=WM: (14942: S1 ^operator O2126 +)
<=WM: (14943: S1 ^operator O2126)
<=WM: (14936: R1 ^reward R1066)
<=WM: (14935: I3 ^see 1)
<=WM: (14939: O2126 ^name predict-no)
<=WM: (14938: O2125 ^name predict-yes)
<=WM: (14937: R1066 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2127 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2128 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2126 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2125 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14957: S1 ^operator O2128)

  1064:    O: O2128 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1064 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1063 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14958: I3 ^predict-no N1064)
<=WM: (14945: N1063 ^status complete)
<=WM: (14944: I3 ^predict-no N1063)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (14962: I2 ^dir L)
=>WM: (14961: I2 ^reward 1)
=>WM: (14960: I2 ^see 0)
=>WM: (14959: N1064 ^status complete)
<=WM: (14948: I2 ^dir U)
<=WM: (14947: I2 ^reward 1)
<=WM: (14946: I2 ^see 0)
=>WM: (14963: I2 ^level-1 L1-root)
<=WM: (14949: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2127 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2128 = 0.6855176931742328)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1068 ^value 1 +)
 (R1 ^reward R1068 +)
Firing propose*predict-yes
 -->
 (O2129 ^name predict-yes +)
 (S1 ^operator O2129 +)
Firing propose*predict-no
 -->
 (O2130 ^name predict-no +)
 (S1 ^operator O2130 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2128 = 0.3144968546951614)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2127 = 0.390770856544958)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2128 ^name predict-no +)
 (S1 ^operator O2128 +)
Retracting propose*predict-yes
 -->
 (O2127 ^name predict-yes +)
 (S1 ^operator O2127 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1067 ^value 1 +)
 (R1 ^reward R1067 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2128 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2127 = 0.)
=>WM: (14970: S1 ^operator O2130 +)
=>WM: (14969: S1 ^operator O2129 +)
=>WM: (14968: I3 ^dir L)
=>WM: (14967: O2130 ^name predict-no)
=>WM: (14966: O2129 ^name predict-yes)
=>WM: (14965: R1068 ^value 1)
=>WM: (14964: R1 ^reward R1068)
<=WM: (14955: S1 ^operator O2127 +)
<=WM: (14956: S1 ^operator O2128 +)
<=WM: (14957: S1 ^operator O2128)
<=WM: (14940: I3 ^dir U)
<=WM: (14951: R1 ^reward R1067)
<=WM: (14954: O2128 ^name predict-no)
<=WM: (14953: O2127 ^name predict-yes)
<=WM: (14952: R1067 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2129 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2129 = 0.390770856544958)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2130 = 0.6855176931742328)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2130 = 0.3144968546951614)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2128 = 0.3144968546951614)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2128 = 0.6855176931742328)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2127 = 0.390770856544958)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2127 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (14971: S1 ^operator O2130)

  1065:    O: O2130 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1065 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1064 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14972: I3 ^predict-no N1065)
<=WM: (14959: N1064 ^status complete)
<=WM: (14958: I3 ^predict-no N1064)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (14976: I2 ^dir R)
=>WM: (14975: I2 ^reward 1)
=>WM: (14974: I2 ^see 0)
=>WM: (14973: N1065 ^status complete)
<=WM: (14962: I2 ^dir L)
<=WM: (14961: I2 ^reward 1)
<=WM: (14960: I2 ^see 0)
=>WM: (14977: I2 ^level-1 L0-root)
<=WM: (14963: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2129 = 0.8783979318684918)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1069 ^value 1 +)
 (R1 ^reward R1069 +)
Firing propose*predict-yes
 -->
 (O2131 ^name predict-yes +)
 (S1 ^operator O2131 +)
Firing propose*predict-no
 -->
 (O2132 ^name predict-no +)
 (S1 ^operator O2132 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2130 = 0.8395129942530221)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2129 = 0.1215962258870366)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2130 ^name predict-no +)
 (S1 ^operator O2130 +)
Retracting propose*predict-yes
 -->
 (O2129 ^name predict-yes +)
 (S1 ^operator O2129 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1068 ^value 1 +)
 (R1 ^reward R1068 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2130 = 0.3144968546951614)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2130 = 0.6855176931742328)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2129 = 0.390770856544958)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2129 = -0.2062723012911647)
=>WM: (14984: S1 ^operator O2132 +)
=>WM: (14983: S1 ^operator O2131 +)
=>WM: (14982: I3 ^dir R)
=>WM: (14981: O2132 ^name predict-no)
=>WM: (14980: O2131 ^name predict-yes)
=>WM: (14979: R1069 ^value 1)
=>WM: (14978: R1 ^reward R1069)
<=WM: (14969: S1 ^operator O2129 +)
<=WM: (14970: S1 ^operator O2130 +)
<=WM: (14971: S1 ^operator O2130)
<=WM: (14968: I3 ^dir L)
<=WM: (14964: R1 ^reward R1068)
<=WM: (14967: O2130 ^name predict-no)
<=WM: (14966: O2129 ^name predict-yes)
<=WM: (14965: R1068 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2131 = 0.1215962258870366)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2131 = 0.8783979318684918)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2132 = 0.8395129942530221)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2130 = 0.8395129942530221)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2129 = 0.1215962258870366)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2129 = 0.8783979318684918)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478546 -0.164049 0.314497 -> 0.478545 -0.164049 0.314496(R,m,v=1,0.92638,0.0686208)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521467 0.16405 0.685518 -> 0.521466 0.16405 0.685516(R,m,v=1,1,0)
=>WM: (14985: S1 ^operator O2131)

  1066:    O: O2131 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1066 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1065 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (14986: I3 ^predict-yes N1066)
<=WM: (14973: N1065 ^status complete)
<=WM: (14972: I3 ^predict-no N1065)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (14990: I2 ^dir R)
=>WM: (14989: I2 ^reward 1)
=>WM: (14988: I2 ^see 1)
=>WM: (14987: N1066 ^status complete)
<=WM: (14976: I2 ^dir R)
<=WM: (14975: I2 ^reward 1)
<=WM: (14974: I2 ^see 0)
=>WM: (14991: I2 ^level-1 R1-root)
<=WM: (14977: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2131 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1070 ^value 1 +)
 (R1 ^reward R1070 +)
Firing propose*predict-yes
 -->
 (O2133 ^name predict-yes +)
 (S1 ^operator O2133 +)
Firing propose*predict-no
 -->
 (O2134 ^name predict-no +)
 (S1 ^operator O2134 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2132 = 0.8395129942530221)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2131 = 0.1215962258870366)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2132 ^name predict-no +)
 (S1 ^operator O2132 +)
Retracting propose*predict-yes
 -->
 (O2131 ^name predict-yes +)
 (S1 ^operator O2131 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1069 ^value 1 +)
 (R1 ^reward R1069 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2132 = 0.8395129942530221)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2131 = 0.8783979318684918)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2131 = 0.1215962258870366)
=>WM: (14998: S1 ^operator O2134 +)
=>WM: (14997: S1 ^operator O2133 +)
=>WM: (14996: O2134 ^name predict-no)
=>WM: (14995: O2133 ^name predict-yes)
=>WM: (14994: R1070 ^value 1)
=>WM: (14993: R1 ^reward R1070)
=>WM: (14992: I3 ^see 1)
<=WM: (14983: S1 ^operator O2131 +)
<=WM: (14985: S1 ^operator O2131)
<=WM: (14984: S1 ^operator O2132 +)
<=WM: (14978: R1 ^reward R1069)
<=WM: (14950: I3 ^see 0)
<=WM: (14981: O2132 ^name predict-no)
<=WM: (14980: O2131 ^name predict-yes)
<=WM: (14979: R1069 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2133 = 0.1215962258870366)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2133 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2134 = 0.8395129942530221)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2132 = 0.8395129942530221)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2131 = 0.1215962258870366)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2131 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.873016,0.111449)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465473 0.412925 0.878398 -> 0.465473 0.412926 0.878398(R,m,v=1,1,0)
=>WM: (14999: S1 ^operator O2134)

  1067:    O: O2134 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1067 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1066 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15000: I3 ^predict-no N1067)
<=WM: (14987: N1066 ^status complete)
<=WM: (14986: I3 ^predict-yes N1066)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15004: I2 ^dir U)
=>WM: (15003: I2 ^reward 1)
=>WM: (15002: I2 ^see 0)
=>WM: (15001: N1067 ^status complete)
<=WM: (14990: I2 ^dir R)
<=WM: (14989: I2 ^reward 1)
<=WM: (14988: I2 ^see 1)
=>WM: (15005: I2 ^level-1 R0-root)
<=WM: (14991: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1071 ^value 1 +)
 (R1 ^reward R1071 +)
Firing propose*predict-yes
 -->
 (O2135 ^name predict-yes +)
 (S1 ^operator O2135 +)
Firing propose*predict-no
 -->
 (O2136 ^name predict-no +)
 (S1 ^operator O2136 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2134 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2133 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2134 ^name predict-no +)
 (S1 ^operator O2134 +)
Retracting propose*predict-yes
 -->
 (O2133 ^name predict-yes +)
 (S1 ^operator O2133 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1070 ^value 1 +)
 (R1 ^reward R1070 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2134 = 0.8395129942530221)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2133 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2133 = 0.1215966938845745)
=>WM: (15013: S1 ^operator O2136 +)
=>WM: (15012: S1 ^operator O2135 +)
=>WM: (15011: I3 ^dir U)
=>WM: (15010: O2136 ^name predict-no)
=>WM: (15009: O2135 ^name predict-yes)
=>WM: (15008: R1071 ^value 1)
=>WM: (15007: R1 ^reward R1071)
=>WM: (15006: I3 ^see 0)
<=WM: (14997: S1 ^operator O2133 +)
<=WM: (14998: S1 ^operator O2134 +)
<=WM: (14999: S1 ^operator O2134)
<=WM: (14982: I3 ^dir R)
<=WM: (14993: R1 ^reward R1070)
<=WM: (14992: I3 ^see 1)
<=WM: (14996: O2134 ^name predict-no)
<=WM: (14995: O2133 ^name predict-yes)
<=WM: (14994: R1070 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2135 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2136 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2134 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2133 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.839513 0 0.839513 -> 0.865247 0 0.865247(R,m,v=1,0.93617,0.0600751)
=>WM: (15014: S1 ^operator O2136)

  1068:    O: O2136 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1068 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1067 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15015: I3 ^predict-no N1068)
<=WM: (15001: N1067 ^status complete)
<=WM: (15000: I3 ^predict-no N1067)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15019: I2 ^dir R)
=>WM: (15018: I2 ^reward 1)
=>WM: (15017: I2 ^see 0)
=>WM: (15016: N1068 ^status complete)
<=WM: (15004: I2 ^dir U)
<=WM: (15003: I2 ^reward 1)
<=WM: (15002: I2 ^see 0)
=>WM: (15020: I2 ^level-1 R0-root)
<=WM: (15005: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2135 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1072 ^value 1 +)
 (R1 ^reward R1072 +)
Firing propose*predict-yes
 -->
 (O2137 ^name predict-yes +)
 (S1 ^operator O2137 +)
Firing propose*predict-no
 -->
 (O2138 ^name predict-no +)
 (S1 ^operator O2138 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2136 = 0.8652467390234381)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2135 = 0.1215966938845745)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2136 ^name predict-no +)
 (S1 ^operator O2136 +)
Retracting propose*predict-yes
 -->
 (O2135 ^name predict-yes +)
 (S1 ^operator O2135 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1071 ^value 1 +)
 (R1 ^reward R1071 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2136 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2135 = 0.)
=>WM: (15027: S1 ^operator O2138 +)
=>WM: (15026: S1 ^operator O2137 +)
=>WM: (15025: I3 ^dir R)
=>WM: (15024: O2138 ^name predict-no)
=>WM: (15023: O2137 ^name predict-yes)
=>WM: (15022: R1072 ^value 1)
=>WM: (15021: R1 ^reward R1072)
<=WM: (15012: S1 ^operator O2135 +)
<=WM: (15013: S1 ^operator O2136 +)
<=WM: (15014: S1 ^operator O2136)
<=WM: (15011: I3 ^dir U)
<=WM: (15007: R1 ^reward R1071)
<=WM: (15010: O2136 ^name predict-no)
<=WM: (15009: O2135 ^name predict-yes)
<=WM: (15008: R1071 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2137 = -0.1512366769350551)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2137 = 0.1215966938845745)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2138 = 0.8652467390234381)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2136 = 0.8652467390234381)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2135 = 0.1215966938845745)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2135 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15028: S1 ^operator O2138)

  1069:    O: O2138 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1069 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1068 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15029: I3 ^predict-no N1069)
<=WM: (15016: N1068 ^status complete)
<=WM: (15015: I3 ^predict-no N1068)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|\--- Input Phase --- 
=>WM: (15033: I2 ^dir L)
=>WM: (15032: I2 ^reward 1)
=>WM: (15031: I2 ^see 0)
=>WM: (15030: N1069 ^status complete)
<=WM: (15019: I2 ^dir R)
<=WM: (15018: I2 ^reward 1)
<=WM: (15017: I2 ^see 0)
=>WM: (15034: I2 ^level-1 R0-root)
<=WM: (15020: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2138 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2137 = 0.6091533345297356)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1073 ^value 1 +)
 (R1 ^reward R1073 +)
Firing propose*predict-yes
 -->
 (O2139 ^name predict-yes +)
 (S1 ^operator O2139 +)
Firing propose*predict-no
 -->
 (O2140 ^name predict-no +)
 (S1 ^operator O2140 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2138 = 0.3144956610238658)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2137 = 0.390770856544958)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2138 ^name predict-no +)
 (S1 ^operator O2138 +)
Retracting propose*predict-yes
 -->
 (O2137 ^name predict-yes +)
 (S1 ^operator O2137 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1072 ^value 1 +)
 (R1 ^reward R1072 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2138 = 0.8652467390234381)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2137 = 0.1215966938845745)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2137 = -0.1512366769350551)
=>WM: (15041: S1 ^operator O2140 +)
=>WM: (15040: S1 ^operator O2139 +)
=>WM: (15039: I3 ^dir L)
=>WM: (15038: O2140 ^name predict-no)
=>WM: (15037: O2139 ^name predict-yes)
=>WM: (15036: R1073 ^value 1)
=>WM: (15035: R1 ^reward R1073)
<=WM: (15026: S1 ^operator O2137 +)
<=WM: (15027: S1 ^operator O2138 +)
<=WM: (15028: S1 ^operator O2138)
<=WM: (15025: I3 ^dir R)
<=WM: (15021: R1 ^reward R1072)
<=WM: (15024: O2138 ^name predict-no)
<=WM: (15023: O2137 ^name predict-yes)
<=WM: (15022: R1072 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2139 = 0.6091533345297356)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2139 = 0.390770856544958)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2140 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2140 = 0.3144956610238658)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2138 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2138 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2137 = 0.390770856544958)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2137 = 0.6091533345297356)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.865247 0 0.865247 -> 0.886836 0 0.886836(R,m,v=1,0.936508,0.0597771)
=>WM: (15042: S1 ^operator O2139)

  1070:    O: O2139 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1070 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1069 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15043: I3 ^predict-yes N1070)
<=WM: (15030: N1069 ^status complete)
<=WM: (15029: I3 ^predict-no N1069)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (15047: I2 ^dir R)
=>WM: (15046: I2 ^reward 1)
=>WM: (15045: I2 ^see 1)
=>WM: (15044: N1070 ^status complete)
<=WM: (15033: I2 ^dir L)
<=WM: (15032: I2 ^reward 1)
<=WM: (15031: I2 ^see 0)
=>WM: (15048: I2 ^level-1 L1-root)
<=WM: (15034: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2139 = 0.8784081974205705)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1074 ^value 1 +)
 (R1 ^reward R1074 +)
Firing propose*predict-yes
 -->
 (O2141 ^name predict-yes +)
 (S1 ^operator O2141 +)
Firing propose*predict-no
 -->
 (O2142 ^name predict-no +)
 (S1 ^operator O2142 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2140 = 0.886835768609456)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2139 = 0.1215966938845745)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2140 ^name predict-no +)
 (S1 ^operator O2140 +)
Retracting propose*predict-yes
 -->
 (O2139 ^name predict-yes +)
 (S1 ^operator O2139 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1073 ^value 1 +)
 (R1 ^reward R1073 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2140 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2140 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2139 = 0.390770856544958)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2139 = 0.6091533345297356)
=>WM: (15056: S1 ^operator O2142 +)
=>WM: (15055: S1 ^operator O2141 +)
=>WM: (15054: I3 ^dir R)
=>WM: (15053: O2142 ^name predict-no)
=>WM: (15052: O2141 ^name predict-yes)
=>WM: (15051: R1074 ^value 1)
=>WM: (15050: R1 ^reward R1074)
=>WM: (15049: I3 ^see 1)
<=WM: (15040: S1 ^operator O2139 +)
<=WM: (15042: S1 ^operator O2139)
<=WM: (15041: S1 ^operator O2140 +)
<=WM: (15039: I3 ^dir L)
<=WM: (15035: R1 ^reward R1073)
<=WM: (15006: I3 ^see 0)
<=WM: (15038: O2140 ^name predict-no)
<=WM: (15037: O2139 ^name predict-yes)
<=WM: (15036: R1073 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2141 = 0.1215966938845745)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2141 = 0.8784081974205705)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2142 = 0.886835768609456)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2140 = 0.886835768609456)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2139 = 0.1215966938845745)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2139 = 0.8784081974205705)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472318 -0.0815469 0.390771 -> 0.472323 -0.081546 0.390777(R,m,v=1,0.948276,0.0493323)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527618 0.0815357 0.609153 -> 0.527624 0.0815368 0.60916(R,m,v=1,1,0)
=>WM: (15057: S1 ^operator O2141)

  1071:    O: O2141 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1071 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1070 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15058: I3 ^predict-yes N1071)
<=WM: (15044: N1070 ^status complete)
<=WM: (15043: I3 ^predict-yes N1070)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (15062: I2 ^dir L)
=>WM: (15061: I2 ^reward 1)
=>WM: (15060: I2 ^see 1)
=>WM: (15059: N1071 ^status complete)
<=WM: (15047: I2 ^dir R)
<=WM: (15046: I2 ^reward 1)
<=WM: (15045: I2 ^see 1)
=>WM: (15063: I2 ^level-1 R1-root)
<=WM: (15048: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2142 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2141 = 0.6092621009042343)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1075 ^value 1 +)
 (R1 ^reward R1075 +)
Firing propose*predict-yes
 -->
 (O2143 ^name predict-yes +)
 (S1 ^operator O2143 +)
Firing propose*predict-no
 -->
 (O2144 ^name predict-no +)
 (S1 ^operator O2144 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2142 = 0.3144956610238658)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2141 = 0.3907770108106386)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2142 ^name predict-no +)
 (S1 ^operator O2142 +)
Retracting propose*predict-yes
 -->
 (O2141 ^name predict-yes +)
 (S1 ^operator O2141 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1074 ^value 1 +)
 (R1 ^reward R1074 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2142 = 0.886835768609456)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2141 = 0.8784081974205705)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2141 = 0.1215966938845745)
=>WM: (15070: S1 ^operator O2144 +)
=>WM: (15069: S1 ^operator O2143 +)
=>WM: (15068: I3 ^dir L)
=>WM: (15067: O2144 ^name predict-no)
=>WM: (15066: O2143 ^name predict-yes)
=>WM: (15065: R1075 ^value 1)
=>WM: (15064: R1 ^reward R1075)
<=WM: (15055: S1 ^operator O2141 +)
<=WM: (15057: S1 ^operator O2141)
<=WM: (15056: S1 ^operator O2142 +)
<=WM: (15054: I3 ^dir R)
<=WM: (15050: R1 ^reward R1074)
<=WM: (15053: O2142 ^name predict-no)
<=WM: (15052: O2141 ^name predict-yes)
<=WM: (15051: R1074 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2143 = 0.3907770108106386)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2143 = 0.6092621009042343)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2144 = 0.3144956610238658)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2144 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2142 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2142 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2141 = 0.3907770108106386)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2141 = 0.6092621009042343)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.873684,0.110944)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465481 0.412927 0.878408 -> 0.465481 0.412927 0.878408(R,m,v=1,1,0)
=>WM: (15071: S1 ^operator O2143)

  1072:    O: O2143 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1072 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1071 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15072: I3 ^predict-yes N1072)
<=WM: (15059: N1071 ^status complete)
<=WM: (15058: I3 ^predict-yes N1071)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (15076: I2 ^dir R)
=>WM: (15075: I2 ^reward 1)
=>WM: (15074: I2 ^see 1)
=>WM: (15073: N1072 ^status complete)
<=WM: (15062: I2 ^dir L)
<=WM: (15061: I2 ^reward 1)
<=WM: (15060: I2 ^see 1)
=>WM: (15077: I2 ^level-1 L1-root)
<=WM: (15063: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2143 = 0.878407746096616)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1076 ^value 1 +)
 (R1 ^reward R1076 +)
Firing propose*predict-yes
 -->
 (O2145 ^name predict-yes +)
 (S1 ^operator O2145 +)
Firing propose*predict-no
 -->
 (O2146 ^name predict-no +)
 (S1 ^operator O2146 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2144 = 0.886835768609456)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2143 = 0.1215963023937551)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2144 ^name predict-no +)
 (S1 ^operator O2144 +)
Retracting propose*predict-yes
 -->
 (O2143 ^name predict-yes +)
 (S1 ^operator O2143 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1075 ^value 1 +)
 (R1 ^reward R1075 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2144 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2144 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2143 = 0.6092621009042343)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2143 = 0.3907770108106386)
=>WM: (15084: S1 ^operator O2146 +)
=>WM: (15083: S1 ^operator O2145 +)
=>WM: (15082: I3 ^dir R)
=>WM: (15081: O2146 ^name predict-no)
=>WM: (15080: O2145 ^name predict-yes)
=>WM: (15079: R1076 ^value 1)
=>WM: (15078: R1 ^reward R1076)
<=WM: (15069: S1 ^operator O2143 +)
<=WM: (15071: S1 ^operator O2143)
<=WM: (15070: S1 ^operator O2144 +)
<=WM: (15068: I3 ^dir L)
<=WM: (15064: R1 ^reward R1075)
<=WM: (15067: O2144 ^name predict-no)
<=WM: (15066: O2143 ^name predict-yes)
<=WM: (15065: R1075 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2145 = 0.1215963023937551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2145 = 0.878407746096616)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2146 = 0.886835768609456)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2144 = 0.886835768609456)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2143 = 0.1215963023937551)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2143 = 0.878407746096616)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472323 -0.081546 0.390777 -> 0.47232 -0.0815465 0.390774(R,m,v=1,0.948571,0.049064)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.52771 0.0815518 0.609262 -> 0.527707 0.0815513 0.609258(R,m,v=1,1,0)
=>WM: (15085: S1 ^operator O2145)

  1073:    O: O2145 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1073 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1072 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15086: I3 ^predict-yes N1073)
<=WM: (15073: N1072 ^status complete)
<=WM: (15072: I3 ^predict-yes N1072)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15090: I2 ^dir R)
=>WM: (15089: I2 ^reward 1)
=>WM: (15088: I2 ^see 1)
=>WM: (15087: N1073 ^status complete)
<=WM: (15076: I2 ^dir R)
<=WM: (15075: I2 ^reward 1)
<=WM: (15074: I2 ^see 1)
=>WM: (15091: I2 ^level-1 R1-root)
<=WM: (15077: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2145 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1077 ^value 1 +)
 (R1 ^reward R1077 +)
Firing propose*predict-yes
 -->
 (O2147 ^name predict-yes +)
 (S1 ^operator O2147 +)
Firing propose*predict-no
 -->
 (O2148 ^name predict-no +)
 (S1 ^operator O2148 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2146 = 0.886835768609456)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2145 = 0.1215963023937551)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2146 ^name predict-no +)
 (S1 ^operator O2146 +)
Retracting propose*predict-yes
 -->
 (O2145 ^name predict-yes +)
 (S1 ^operator O2145 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1076 ^value 1 +)
 (R1 ^reward R1076 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2146 = 0.886835768609456)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2145 = 0.878407746096616)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2145 = 0.1215963023937551)
=>WM: (15097: S1 ^operator O2148 +)
=>WM: (15096: S1 ^operator O2147 +)
=>WM: (15095: O2148 ^name predict-no)
=>WM: (15094: O2147 ^name predict-yes)
=>WM: (15093: R1077 ^value 1)
=>WM: (15092: R1 ^reward R1077)
<=WM: (15083: S1 ^operator O2145 +)
<=WM: (15085: S1 ^operator O2145)
<=WM: (15084: S1 ^operator O2146 +)
<=WM: (15078: R1 ^reward R1076)
<=WM: (15081: O2146 ^name predict-no)
<=WM: (15080: O2145 ^name predict-yes)
<=WM: (15079: R1076 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2147 = 0.1215963023937551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2147 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2148 = 0.886835768609456)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2146 = 0.886835768609456)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2145 = 0.1215963023937551)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2145 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534522 -0.412926 0.121596(R,m,v=1,0.874346,0.110444)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465481 0.412927 0.878408 -> 0.46548 0.412927 0.878407(R,m,v=1,1,0)
=>WM: (15098: S1 ^operator O2148)

  1074:    O: O2148 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1074 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1073 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15099: I3 ^predict-no N1074)
<=WM: (15087: N1073 ^status complete)
<=WM: (15086: I3 ^predict-yes N1073)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|\--- Input Phase --- 
=>WM: (15103: I2 ^dir L)
=>WM: (15102: I2 ^reward 1)
=>WM: (15101: I2 ^see 0)
=>WM: (15100: N1074 ^status complete)
<=WM: (15090: I2 ^dir R)
<=WM: (15089: I2 ^reward 1)
<=WM: (15088: I2 ^see 1)
=>WM: (15104: I2 ^level-1 R0-root)
<=WM: (15091: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2148 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2147 = 0.6091603294693171)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1078 ^value 1 +)
 (R1 ^reward R1078 +)
Firing propose*predict-yes
 -->
 (O2149 ^name predict-yes +)
 (S1 ^operator O2149 +)
Firing propose*predict-no
 -->
 (O2150 ^name predict-no +)
 (S1 ^operator O2150 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2148 = 0.3144956610238658)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2147 = 0.3907738386230689)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2148 ^name predict-no +)
 (S1 ^operator O2148 +)
Retracting propose*predict-yes
 -->
 (O2147 ^name predict-yes +)
 (S1 ^operator O2147 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1077 ^value 1 +)
 (R1 ^reward R1077 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2148 = 0.886835768609456)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2147 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2147 = 0.1215959786322932)
=>WM: (15112: S1 ^operator O2150 +)
=>WM: (15111: S1 ^operator O2149 +)
=>WM: (15110: I3 ^dir L)
=>WM: (15109: O2150 ^name predict-no)
=>WM: (15108: O2149 ^name predict-yes)
=>WM: (15107: R1078 ^value 1)
=>WM: (15106: R1 ^reward R1078)
=>WM: (15105: I3 ^see 0)
<=WM: (15096: S1 ^operator O2147 +)
<=WM: (15097: S1 ^operator O2148 +)
<=WM: (15098: S1 ^operator O2148)
<=WM: (15082: I3 ^dir R)
<=WM: (15092: R1 ^reward R1077)
<=WM: (15049: I3 ^see 1)
<=WM: (15095: O2148 ^name predict-no)
<=WM: (15094: O2147 ^name predict-yes)
<=WM: (15093: R1077 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2149 = 0.3907738386230689)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2149 = 0.6091603294693171)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2150 = 0.3144956610238658)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2150 = -0.1984300550322165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2148 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2148 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2147 = 0.3907738386230689)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2147 = 0.6091603294693171)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.886836 0 0.886836 -> 0.904951 0 0.904951(R,m,v=1,0.936842,0.059482)
=>WM: (15113: S1 ^operator O2149)

  1075:    O: O2149 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1075 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1074 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15114: I3 ^predict-yes N1075)
<=WM: (15100: N1074 ^status complete)
<=WM: (15099: I3 ^predict-no N1074)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15118: I2 ^dir L)
=>WM: (15117: I2 ^reward 1)
=>WM: (15116: I2 ^see 1)
=>WM: (15115: N1075 ^status complete)
<=WM: (15103: I2 ^dir L)
<=WM: (15102: I2 ^reward 1)
<=WM: (15101: I2 ^see 0)
=>WM: (15119: I2 ^level-1 L1-root)
<=WM: (15104: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2149 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2150 = 0.6855163447632109)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1079 ^value 1 +)
 (R1 ^reward R1079 +)
Firing propose*predict-yes
 -->
 (O2151 ^name predict-yes +)
 (S1 ^operator O2151 +)
Firing propose*predict-no
 -->
 (O2152 ^name predict-no +)
 (S1 ^operator O2152 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2150 = 0.3144956610238658)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2149 = 0.3907738386230689)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2150 ^name predict-no +)
 (S1 ^operator O2150 +)
Retracting propose*predict-yes
 -->
 (O2149 ^name predict-yes +)
 (S1 ^operator O2149 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1078 ^value 1 +)
 (R1 ^reward R1078 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2150 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2150 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2149 = 0.6091603294693171)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2149 = 0.3907738386230689)
=>WM: (15126: S1 ^operator O2152 +)
=>WM: (15125: S1 ^operator O2151 +)
=>WM: (15124: O2152 ^name predict-no)
=>WM: (15123: O2151 ^name predict-yes)
=>WM: (15122: R1079 ^value 1)
=>WM: (15121: R1 ^reward R1079)
=>WM: (15120: I3 ^see 1)
<=WM: (15111: S1 ^operator O2149 +)
<=WM: (15113: S1 ^operator O2149)
<=WM: (15112: S1 ^operator O2150 +)
<=WM: (15106: R1 ^reward R1078)
<=WM: (15105: I3 ^see 0)
<=WM: (15109: O2150 ^name predict-no)
<=WM: (15108: O2149 ^name predict-yes)
<=WM: (15107: R1078 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2151 = 0.3907738386230689)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2151 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2152 = 0.3144956610238658)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2152 = 0.6855163447632109)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2150 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2150 = 0.6855163447632109)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2149 = 0.3907738386230689)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2149 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.47232 -0.0815465 0.390774 -> 0.472325 -0.0815457 0.390779(R,m,v=1,0.948864,0.0487987)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527624 0.0815368 0.60916 -> 0.527629 0.0815377 0.609166(R,m,v=1,1,0)
=>WM: (15127: S1 ^operator O2152)

  1076:    O: O2152 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1076 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1075 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15128: I3 ^predict-no N1076)
<=WM: (15115: N1075 ^status complete)
<=WM: (15114: I3 ^predict-yes N1075)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15132: I2 ^dir R)
=>WM: (15131: I2 ^reward 1)
=>WM: (15130: I2 ^see 0)
=>WM: (15129: N1076 ^status complete)
<=WM: (15118: I2 ^dir L)
<=WM: (15117: I2 ^reward 1)
<=WM: (15116: I2 ^see 1)
=>WM: (15133: I2 ^level-1 L0-root)
<=WM: (15119: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2151 = 0.8783984798460494)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1080 ^value 1 +)
 (R1 ^reward R1080 +)
Firing propose*predict-yes
 -->
 (O2153 ^name predict-yes +)
 (S1 ^operator O2153 +)
Firing propose*predict-no
 -->
 (O2154 ^name predict-no +)
 (S1 ^operator O2154 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2152 = 0.9049506710147235)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2151 = 0.1215959786322932)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2152 ^name predict-no +)
 (S1 ^operator O2152 +)
Retracting propose*predict-yes
 -->
 (O2151 ^name predict-yes +)
 (S1 ^operator O2151 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1079 ^value 1 +)
 (R1 ^reward R1079 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2152 = 0.6855163447632109)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2152 = 0.3144956610238658)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2151 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2151 = 0.390779173043162)
=>WM: (15141: S1 ^operator O2154 +)
=>WM: (15140: S1 ^operator O2153 +)
=>WM: (15139: I3 ^dir R)
=>WM: (15138: O2154 ^name predict-no)
=>WM: (15137: O2153 ^name predict-yes)
=>WM: (15136: R1080 ^value 1)
=>WM: (15135: R1 ^reward R1080)
=>WM: (15134: I3 ^see 0)
<=WM: (15125: S1 ^operator O2151 +)
<=WM: (15126: S1 ^operator O2152 +)
<=WM: (15127: S1 ^operator O2152)
<=WM: (15110: I3 ^dir L)
<=WM: (15121: R1 ^reward R1079)
<=WM: (15120: I3 ^see 1)
<=WM: (15124: O2152 ^name predict-no)
<=WM: (15123: O2151 ^name predict-yes)
<=WM: (15122: R1079 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2153 = 0.1215959786322932)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2153 = 0.8783984798460494)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2154 = 0.9049506710147235)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2152 = 0.9049506710147235)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2151 = 0.1215959786322932)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2151 = 0.8783984798460494)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478545 -0.164049 0.314496 -> 0.478544 -0.164049 0.314495(R,m,v=1,0.926829,0.0682328)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521466 0.16405 0.685516 -> 0.521465 0.16405 0.685515(R,m,v=1,1,0)
=>WM: (15142: S1 ^operator O2153)

  1077:    O: O2153 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1077 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1076 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15143: I3 ^predict-yes N1077)
<=WM: (15129: N1076 ^status complete)
<=WM: (15128: I3 ^predict-no N1076)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15147: I2 ^dir R)
=>WM: (15146: I2 ^reward 1)
=>WM: (15145: I2 ^see 1)
=>WM: (15144: N1077 ^status complete)
<=WM: (15132: I2 ^dir R)
<=WM: (15131: I2 ^reward 1)
<=WM: (15130: I2 ^see 0)
=>WM: (15148: I2 ^level-1 R1-root)
<=WM: (15133: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2153 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1081 ^value 1 +)
 (R1 ^reward R1081 +)
Firing propose*predict-yes
 -->
 (O2155 ^name predict-yes +)
 (S1 ^operator O2155 +)
Firing propose*predict-no
 -->
 (O2156 ^name predict-no +)
 (S1 ^operator O2156 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2154 = 0.9049506710147235)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2153 = 0.1215959786322932)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2154 ^name predict-no +)
 (S1 ^operator O2154 +)
Retracting propose*predict-yes
 -->
 (O2153 ^name predict-yes +)
 (S1 ^operator O2153 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1080 ^value 1 +)
 (R1 ^reward R1080 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2154 = 0.9049506710147235)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2153 = 0.8783984798460494)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2153 = 0.1215959786322932)
=>WM: (15155: S1 ^operator O2156 +)
=>WM: (15154: S1 ^operator O2155 +)
=>WM: (15153: O2156 ^name predict-no)
=>WM: (15152: O2155 ^name predict-yes)
=>WM: (15151: R1081 ^value 1)
=>WM: (15150: R1 ^reward R1081)
=>WM: (15149: I3 ^see 1)
<=WM: (15140: S1 ^operator O2153 +)
<=WM: (15142: S1 ^operator O2153)
<=WM: (15141: S1 ^operator O2154 +)
<=WM: (15135: R1 ^reward R1080)
<=WM: (15134: I3 ^see 0)
<=WM: (15138: O2154 ^name predict-no)
<=WM: (15137: O2153 ^name predict-yes)
<=WM: (15136: R1080 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2155 = 0.1215959786322932)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2155 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2156 = 0.9049506710147235)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2154 = 0.9049506710147235)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2153 = 0.1215959786322932)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2153 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534522 -0.412926 0.121596 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.875,0.109948)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465473 0.412926 0.878398 -> 0.465473 0.412926 0.878399(R,m,v=1,1,0)
=>WM: (15156: S1 ^operator O2156)

  1078:    O: O2156 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1078 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1077 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15157: I3 ^predict-no N1078)
<=WM: (15144: N1077 ^status complete)
<=WM: (15143: I3 ^predict-yes N1077)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15161: I2 ^dir R)
=>WM: (15160: I2 ^reward 1)
=>WM: (15159: I2 ^see 0)
=>WM: (15158: N1078 ^status complete)
<=WM: (15147: I2 ^dir R)
<=WM: (15146: I2 ^reward 1)
<=WM: (15145: I2 ^see 1)
=>WM: (15162: I2 ^level-1 R0-root)
<=WM: (15148: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2155 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1082 ^value 1 +)
 (R1 ^reward R1082 +)
Firing propose*predict-yes
 -->
 (O2157 ^name predict-yes +)
 (S1 ^operator O2157 +)
Firing propose*predict-no
 -->
 (O2158 ^name predict-no +)
 (S1 ^operator O2158 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2156 = 0.9049506710147235)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2155 = 0.1215964214230049)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2156 ^name predict-no +)
 (S1 ^operator O2156 +)
Retracting propose*predict-yes
 -->
 (O2155 ^name predict-yes +)
 (S1 ^operator O2155 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1081 ^value 1 +)
 (R1 ^reward R1081 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2156 = 0.9049506710147235)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2155 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2155 = 0.1215964214230049)
=>WM: (15169: S1 ^operator O2158 +)
=>WM: (15168: S1 ^operator O2157 +)
=>WM: (15167: O2158 ^name predict-no)
=>WM: (15166: O2157 ^name predict-yes)
=>WM: (15165: R1082 ^value 1)
=>WM: (15164: R1 ^reward R1082)
=>WM: (15163: I3 ^see 0)
<=WM: (15154: S1 ^operator O2155 +)
<=WM: (15155: S1 ^operator O2156 +)
<=WM: (15156: S1 ^operator O2156)
<=WM: (15150: R1 ^reward R1081)
<=WM: (15149: I3 ^see 1)
<=WM: (15153: O2156 ^name predict-no)
<=WM: (15152: O2155 ^name predict-yes)
<=WM: (15151: R1081 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2157 = 0.1215964214230049)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2157 = -0.1512366769350551)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2158 = 0.9049506710147235)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2156 = 0.9049506710147235)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2155 = 0.1215964214230049)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2155 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.904951 0 0.904951 -> 0.920153 0 0.920153(R,m,v=1,0.937173,0.0591899)
=>WM: (15170: S1 ^operator O2158)

  1079:    O: O2158 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1079 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1078 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15171: I3 ^predict-no N1079)
<=WM: (15158: N1078 ^status complete)
<=WM: (15157: I3 ^predict-no N1078)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15175: I2 ^dir U)
=>WM: (15174: I2 ^reward 1)
=>WM: (15173: I2 ^see 0)
=>WM: (15172: N1079 ^status complete)
<=WM: (15161: I2 ^dir R)
<=WM: (15160: I2 ^reward 1)
<=WM: (15159: I2 ^see 0)
=>WM: (15176: I2 ^level-1 R0-root)
<=WM: (15162: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1083 ^value 1 +)
 (R1 ^reward R1083 +)
Firing propose*predict-yes
 -->
 (O2159 ^name predict-yes +)
 (S1 ^operator O2159 +)
Firing propose*predict-no
 -->
 (O2160 ^name predict-no +)
 (S1 ^operator O2160 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2158 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2157 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2158 ^name predict-no +)
 (S1 ^operator O2158 +)
Retracting propose*predict-yes
 -->
 (O2157 ^name predict-yes +)
 (S1 ^operator O2157 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1082 ^value 1 +)
 (R1 ^reward R1082 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2158 = 0.920153033815893)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2157 = -0.1512366769350551)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2157 = 0.1215964214230049)
=>WM: (15183: S1 ^operator O2160 +)
=>WM: (15182: S1 ^operator O2159 +)
=>WM: (15181: I3 ^dir U)
=>WM: (15180: O2160 ^name predict-no)
=>WM: (15179: O2159 ^name predict-yes)
=>WM: (15178: R1083 ^value 1)
=>WM: (15177: R1 ^reward R1083)
<=WM: (15168: S1 ^operator O2157 +)
<=WM: (15169: S1 ^operator O2158 +)
<=WM: (15170: S1 ^operator O2158)
<=WM: (15139: I3 ^dir R)
<=WM: (15164: R1 ^reward R1082)
<=WM: (15167: O2158 ^name predict-no)
<=WM: (15166: O2157 ^name predict-yes)
<=WM: (15165: R1082 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2159 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2160 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2158 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2157 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.920153 0 0.920153 -> 0.932913 0 0.932913(R,m,v=1,0.9375,0.0589005)
=>WM: (15184: S1 ^operator O2160)

  1080:    O: O2160 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1080 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1079 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15185: I3 ^predict-no N1080)
<=WM: (15172: N1079 ^status complete)
<=WM: (15171: I3 ^predict-no N1079)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (15189: I2 ^dir L)
=>WM: (15188: I2 ^reward 1)
=>WM: (15187: I2 ^see 0)
=>WM: (15186: N1080 ^status complete)
<=WM: (15175: I2 ^dir U)
<=WM: (15174: I2 ^reward 1)
<=WM: (15173: I2 ^see 0)
=>WM: (15190: I2 ^level-1 R0-root)
<=WM: (15176: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2160 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2159 = 0.6091663904275534)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1084 ^value 1 +)
 (R1 ^reward R1084 +)
Firing propose*predict-yes
 -->
 (O2161 ^name predict-yes +)
 (S1 ^operator O2161 +)
Firing propose*predict-no
 -->
 (O2162 ^name predict-no +)
 (S1 ^operator O2162 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2160 = 0.3144946769214089)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2159 = 0.390779173043162)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2160 ^name predict-no +)
 (S1 ^operator O2160 +)
Retracting propose*predict-yes
 -->
 (O2159 ^name predict-yes +)
 (S1 ^operator O2159 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1083 ^value 1 +)
 (R1 ^reward R1083 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2160 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2159 = 0.)
=>WM: (15197: S1 ^operator O2162 +)
=>WM: (15196: S1 ^operator O2161 +)
=>WM: (15195: I3 ^dir L)
=>WM: (15194: O2162 ^name predict-no)
=>WM: (15193: O2161 ^name predict-yes)
=>WM: (15192: R1084 ^value 1)
=>WM: (15191: R1 ^reward R1084)
<=WM: (15182: S1 ^operator O2159 +)
<=WM: (15183: S1 ^operator O2160 +)
<=WM: (15184: S1 ^operator O2160)
<=WM: (15181: I3 ^dir U)
<=WM: (15177: R1 ^reward R1083)
<=WM: (15180: O2160 ^name predict-no)
<=WM: (15179: O2159 ^name predict-yes)
<=WM: (15178: R1083 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2161 = 0.6091663904275534)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2161 = 0.390779173043162)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2162 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2162 = 0.3144946769214089)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2160 = 0.3144946769214089)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2160 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2159 = 0.390779173043162)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2159 = 0.6091663904275534)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15198: S1 ^operator O2161)

  1081:    O: O2161 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1081 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1080 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15199: I3 ^predict-yes N1081)
<=WM: (15186: N1080 ^status complete)
<=WM: (15185: I3 ^predict-no N1080)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/--- Input Phase --- 
=>WM: (15203: I2 ^dir R)
=>WM: (15202: I2 ^reward 1)
=>WM: (15201: I2 ^see 1)
=>WM: (15200: N1081 ^status complete)
<=WM: (15189: I2 ^dir L)
<=WM: (15188: I2 ^reward 1)
<=WM: (15187: I2 ^see 0)
=>WM: (15204: I2 ^level-1 L1-root)
<=WM: (15190: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2161 = 0.8784073733635152)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1085 ^value 1 +)
 (R1 ^reward R1085 +)
Firing propose*predict-yes
 -->
 (O2163 ^name predict-yes +)
 (S1 ^operator O2163 +)
Firing propose*predict-no
 -->
 (O2164 ^name predict-no +)
 (S1 ^operator O2164 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2162 = 0.9329132455998342)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2161 = 0.1215964214230049)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2162 ^name predict-no +)
 (S1 ^operator O2162 +)
Retracting propose*predict-yes
 -->
 (O2161 ^name predict-yes +)
 (S1 ^operator O2161 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1084 ^value 1 +)
 (R1 ^reward R1084 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2162 = 0.3144946769214089)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2162 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2161 = 0.390779173043162)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2161 = 0.6091663904275534)
=>WM: (15212: S1 ^operator O2164 +)
=>WM: (15211: S1 ^operator O2163 +)
=>WM: (15210: I3 ^dir R)
=>WM: (15209: O2164 ^name predict-no)
=>WM: (15208: O2163 ^name predict-yes)
=>WM: (15207: R1085 ^value 1)
=>WM: (15206: R1 ^reward R1085)
=>WM: (15205: I3 ^see 1)
<=WM: (15196: S1 ^operator O2161 +)
<=WM: (15198: S1 ^operator O2161)
<=WM: (15197: S1 ^operator O2162 +)
<=WM: (15195: I3 ^dir L)
<=WM: (15191: R1 ^reward R1084)
<=WM: (15163: I3 ^see 0)
<=WM: (15194: O2162 ^name predict-no)
<=WM: (15193: O2161 ^name predict-yes)
<=WM: (15192: R1084 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2163 = 0.1215964214230049)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2163 = 0.8784073733635152)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2164 = 0.9329132455998342)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2162 = 0.9329132455998342)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2161 = 0.1215964214230049)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2161 = 0.8784073733635152)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472325 -0.0815457 0.390779 -> 0.472329 -0.0815451 0.390784(R,m,v=1,0.949153,0.0485362)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527629 0.0815377 0.609166 -> 0.527633 0.0815384 0.609171(R,m,v=1,1,0)
=>WM: (15213: S1 ^operator O2163)

  1082:    O: O2163 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1082 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1081 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15214: I3 ^predict-yes N1082)
<=WM: (15200: N1081 ^status complete)
<=WM: (15199: I3 ^predict-yes N1081)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15218: I2 ^dir R)
=>WM: (15217: I2 ^reward 1)
=>WM: (15216: I2 ^see 1)
=>WM: (15215: N1082 ^status complete)
<=WM: (15203: I2 ^dir R)
<=WM: (15202: I2 ^reward 1)
<=WM: (15201: I2 ^see 1)
=>WM: (15219: I2 ^level-1 R1-root)
<=WM: (15204: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2163 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1086 ^value 1 +)
 (R1 ^reward R1086 +)
Firing propose*predict-yes
 -->
 (O2165 ^name predict-yes +)
 (S1 ^operator O2165 +)
Firing propose*predict-no
 -->
 (O2166 ^name predict-no +)
 (S1 ^operator O2166 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2164 = 0.9329132455998342)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2163 = 0.1215964214230049)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2164 ^name predict-no +)
 (S1 ^operator O2164 +)
Retracting propose*predict-yes
 -->
 (O2163 ^name predict-yes +)
 (S1 ^operator O2163 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1085 ^value 1 +)
 (R1 ^reward R1085 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2164 = 0.9329132455998342)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2163 = 0.8784073733635152)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2163 = 0.1215964214230049)
=>WM: (15225: S1 ^operator O2166 +)
=>WM: (15224: S1 ^operator O2165 +)
=>WM: (15223: O2166 ^name predict-no)
=>WM: (15222: O2165 ^name predict-yes)
=>WM: (15221: R1086 ^value 1)
=>WM: (15220: R1 ^reward R1086)
<=WM: (15211: S1 ^operator O2163 +)
<=WM: (15213: S1 ^operator O2163)
<=WM: (15212: S1 ^operator O2164 +)
<=WM: (15206: R1 ^reward R1085)
<=WM: (15209: O2164 ^name predict-no)
<=WM: (15208: O2163 ^name predict-yes)
<=WM: (15207: R1085 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2165 = 0.1215964214230049)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2165 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2166 = 0.9329132455998342)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2164 = 0.9329132455998342)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2163 = 0.1215964214230049)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2163 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.875648,0.109456)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.46548 0.412927 0.878407 -> 0.46548 0.412927 0.878407(R,m,v=1,1,0)
=>WM: (15226: S1 ^operator O2166)

  1083:    O: O2166 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1083 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1082 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15227: I3 ^predict-no N1083)
<=WM: (15215: N1082 ^status complete)
<=WM: (15214: I3 ^predict-yes N1082)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (15231: I2 ^dir L)
=>WM: (15230: I2 ^reward 1)
=>WM: (15229: I2 ^see 0)
=>WM: (15228: N1083 ^status complete)
<=WM: (15218: I2 ^dir R)
<=WM: (15217: I2 ^reward 1)
<=WM: (15216: I2 ^see 1)
=>WM: (15232: I2 ^level-1 R0-root)
<=WM: (15219: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2166 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2165 = 0.6091713913477592)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1087 ^value 1 +)
 (R1 ^reward R1087 +)
Firing propose*predict-yes
 -->
 (O2167 ^name predict-yes +)
 (S1 ^operator O2167 +)
Firing propose*predict-no
 -->
 (O2168 ^name predict-no +)
 (S1 ^operator O2168 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2166 = 0.3144946769214089)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2165 = 0.3907835800387532)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2166 ^name predict-no +)
 (S1 ^operator O2166 +)
Retracting propose*predict-yes
 -->
 (O2165 ^name predict-yes +)
 (S1 ^operator O2165 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1086 ^value 1 +)
 (R1 ^reward R1086 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2166 = 0.9329132455998342)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2165 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2165 = 0.1215961184552382)
=>WM: (15240: S1 ^operator O2168 +)
=>WM: (15239: S1 ^operator O2167 +)
=>WM: (15238: I3 ^dir L)
=>WM: (15237: O2168 ^name predict-no)
=>WM: (15236: O2167 ^name predict-yes)
=>WM: (15235: R1087 ^value 1)
=>WM: (15234: R1 ^reward R1087)
=>WM: (15233: I3 ^see 0)
<=WM: (15224: S1 ^operator O2165 +)
<=WM: (15225: S1 ^operator O2166 +)
<=WM: (15226: S1 ^operator O2166)
<=WM: (15210: I3 ^dir R)
<=WM: (15220: R1 ^reward R1086)
<=WM: (15205: I3 ^see 1)
<=WM: (15223: O2166 ^name predict-no)
<=WM: (15222: O2165 ^name predict-yes)
<=WM: (15221: R1086 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2167 = 0.3907835800387532)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2167 = 0.6091713913477592)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2168 = 0.3144946769214089)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2168 = -0.1984300550322165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2166 = 0.3144946769214089)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2166 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2165 = 0.3907835800387532)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2165 = 0.6091713913477592)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.932913 0 0.932913 -> 0.943625 0 0.943625(R,m,v=1,0.937824,0.058614)
=>WM: (15241: S1 ^operator O2167)

  1084:    O: O2167 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1084 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1083 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15242: I3 ^predict-yes N1084)
<=WM: (15228: N1083 ^status complete)
<=WM: (15227: I3 ^predict-no N1083)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15246: I2 ^dir L)
=>WM: (15245: I2 ^reward 1)
=>WM: (15244: I2 ^see 1)
=>WM: (15243: N1084 ^status complete)
<=WM: (15231: I2 ^dir L)
<=WM: (15230: I2 ^reward 1)
<=WM: (15229: I2 ^see 0)
=>WM: (15247: I2 ^level-1 L1-root)
<=WM: (15232: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2167 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2168 = 0.6855152344977683)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1088 ^value 1 +)
 (R1 ^reward R1088 +)
Firing propose*predict-yes
 -->
 (O2169 ^name predict-yes +)
 (S1 ^operator O2169 +)
Firing propose*predict-no
 -->
 (O2170 ^name predict-no +)
 (S1 ^operator O2170 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2168 = 0.3144946769214089)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2167 = 0.3907835800387532)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2168 ^name predict-no +)
 (S1 ^operator O2168 +)
Retracting propose*predict-yes
 -->
 (O2167 ^name predict-yes +)
 (S1 ^operator O2167 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1087 ^value 1 +)
 (R1 ^reward R1087 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2168 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2168 = 0.3144946769214089)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2167 = 0.6091713913477592)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2167 = 0.3907835800387532)
=>WM: (15254: S1 ^operator O2170 +)
=>WM: (15253: S1 ^operator O2169 +)
=>WM: (15252: O2170 ^name predict-no)
=>WM: (15251: O2169 ^name predict-yes)
=>WM: (15250: R1088 ^value 1)
=>WM: (15249: R1 ^reward R1088)
=>WM: (15248: I3 ^see 1)
<=WM: (15239: S1 ^operator O2167 +)
<=WM: (15241: S1 ^operator O2167)
<=WM: (15240: S1 ^operator O2168 +)
<=WM: (15234: R1 ^reward R1087)
<=WM: (15233: I3 ^see 0)
<=WM: (15237: O2168 ^name predict-no)
<=WM: (15236: O2167 ^name predict-yes)
<=WM: (15235: R1087 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2169 = 0.3907835800387532)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2169 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2170 = 0.3144946769214089)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2170 = 0.6855152344977683)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2168 = 0.3144946769214089)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2168 = 0.6855152344977683)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2167 = 0.3907835800387532)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2167 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472329 -0.0815451 0.390784 -> 0.472332 -0.0815445 0.390787(R,m,v=1,0.949438,0.0482765)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527633 0.0815384 0.609171 -> 0.527637 0.081539 0.609176(R,m,v=1,1,0)
=>WM: (15255: S1 ^operator O2170)

  1085:    O: O2170 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1085 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1084 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15256: I3 ^predict-no N1085)
<=WM: (15243: N1084 ^status complete)
<=WM: (15242: I3 ^predict-yes N1084)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15260: I2 ^dir U)
=>WM: (15259: I2 ^reward 1)
=>WM: (15258: I2 ^see 0)
=>WM: (15257: N1085 ^status complete)
<=WM: (15246: I2 ^dir L)
<=WM: (15245: I2 ^reward 1)
<=WM: (15244: I2 ^see 1)
=>WM: (15261: I2 ^level-1 L0-root)
<=WM: (15247: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1089 ^value 1 +)
 (R1 ^reward R1089 +)
Firing propose*predict-yes
 -->
 (O2171 ^name predict-yes +)
 (S1 ^operator O2171 +)
Firing propose*predict-no
 -->
 (O2172 ^name predict-no +)
 (S1 ^operator O2172 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2170 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2169 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2170 ^name predict-no +)
 (S1 ^operator O2170 +)
Retracting propose*predict-yes
 -->
 (O2169 ^name predict-yes +)
 (S1 ^operator O2169 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1088 ^value 1 +)
 (R1 ^reward R1088 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2170 = 0.6855152344977683)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2170 = 0.3144946769214089)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2169 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2169 = 0.3907872220793651)
=>WM: (15269: S1 ^operator O2172 +)
=>WM: (15268: S1 ^operator O2171 +)
=>WM: (15267: I3 ^dir U)
=>WM: (15266: O2172 ^name predict-no)
=>WM: (15265: O2171 ^name predict-yes)
=>WM: (15264: R1089 ^value 1)
=>WM: (15263: R1 ^reward R1089)
=>WM: (15262: I3 ^see 0)
<=WM: (15253: S1 ^operator O2169 +)
<=WM: (15254: S1 ^operator O2170 +)
<=WM: (15255: S1 ^operator O2170)
<=WM: (15238: I3 ^dir L)
<=WM: (15249: R1 ^reward R1088)
<=WM: (15248: I3 ^see 1)
<=WM: (15252: O2170 ^name predict-no)
<=WM: (15251: O2169 ^name predict-yes)
<=WM: (15250: R1088 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2171 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2172 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2170 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2169 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478544 -0.164049 0.314495 -> 0.478543 -0.164049 0.314494(R,m,v=1,0.927273,0.0678492)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521465 0.16405 0.685515 -> 0.521464 0.16405 0.685514(R,m,v=1,1,0)
=>WM: (15270: S1 ^operator O2172)

  1086:    O: O2172 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1086 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1085 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15271: I3 ^predict-no N1086)
<=WM: (15257: N1085 ^status complete)
<=WM: (15256: I3 ^predict-no N1085)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15275: I2 ^dir U)
=>WM: (15274: I2 ^reward 1)
=>WM: (15273: I2 ^see 0)
=>WM: (15272: N1086 ^status complete)
<=WM: (15260: I2 ^dir U)
<=WM: (15259: I2 ^reward 1)
<=WM: (15258: I2 ^see 0)
=>WM: (15276: I2 ^level-1 L0-root)
<=WM: (15261: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1090 ^value 1 +)
 (R1 ^reward R1090 +)
Firing propose*predict-yes
 -->
 (O2173 ^name predict-yes +)
 (S1 ^operator O2173 +)
Firing propose*predict-no
 -->
 (O2174 ^name predict-no +)
 (S1 ^operator O2174 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2172 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2171 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2172 ^name predict-no +)
 (S1 ^operator O2172 +)
Retracting propose*predict-yes
 -->
 (O2171 ^name predict-yes +)
 (S1 ^operator O2171 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1089 ^value 1 +)
 (R1 ^reward R1089 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2172 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2171 = 0.)
=>WM: (15282: S1 ^operator O2174 +)
=>WM: (15281: S1 ^operator O2173 +)
=>WM: (15280: O2174 ^name predict-no)
=>WM: (15279: O2173 ^name predict-yes)
=>WM: (15278: R1090 ^value 1)
=>WM: (15277: R1 ^reward R1090)
<=WM: (15268: S1 ^operator O2171 +)
<=WM: (15269: S1 ^operator O2172 +)
<=WM: (15270: S1 ^operator O2172)
<=WM: (15263: R1 ^reward R1089)
<=WM: (15266: O2172 ^name predict-no)
<=WM: (15265: O2171 ^name predict-yes)
<=WM: (15264: R1089 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2173 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2174 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2172 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2171 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15283: S1 ^operator O2174)

  1087:    O: O2174 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1087 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1086 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15284: I3 ^predict-no N1087)
<=WM: (15272: N1086 ^status complete)
<=WM: (15271: I3 ^predict-no N1086)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|--- Input Phase --- 
=>WM: (15288: I2 ^dir U)
=>WM: (15287: I2 ^reward 1)
=>WM: (15286: I2 ^see 0)
=>WM: (15285: N1087 ^status complete)
<=WM: (15275: I2 ^dir U)
<=WM: (15274: I2 ^reward 1)
<=WM: (15273: I2 ^see 0)
=>WM: (15289: I2 ^level-1 L0-root)
<=WM: (15276: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1091 ^value 1 +)
 (R1 ^reward R1091 +)
Firing propose*predict-yes
 -->
 (O2175 ^name predict-yes +)
 (S1 ^operator O2175 +)
Firing propose*predict-no
 -->
 (O2176 ^name predict-no +)
 (S1 ^operator O2176 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2174 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2173 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2174 ^name predict-no +)
 (S1 ^operator O2174 +)
Retracting propose*predict-yes
 -->
 (O2173 ^name predict-yes +)
 (S1 ^operator O2173 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1090 ^value 1 +)
 (R1 ^reward R1090 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2174 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2173 = 0.)
=>WM: (15295: S1 ^operator O2176 +)
=>WM: (15294: S1 ^operator O2175 +)
=>WM: (15293: O2176 ^name predict-no)
=>WM: (15292: O2175 ^name predict-yes)
=>WM: (15291: R1091 ^value 1)
=>WM: (15290: R1 ^reward R1091)
<=WM: (15281: S1 ^operator O2173 +)
<=WM: (15282: S1 ^operator O2174 +)
<=WM: (15283: S1 ^operator O2174)
<=WM: (15277: R1 ^reward R1090)
<=WM: (15280: O2174 ^name predict-no)
<=WM: (15279: O2173 ^name predict-yes)
<=WM: (15278: R1090 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2175 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2176 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2174 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2173 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15296: S1 ^operator O2176)

  1088:    O: O2176 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1088 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1087 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15297: I3 ^predict-no N1088)
<=WM: (15285: N1087 ^status complete)
<=WM: (15284: I3 ^predict-no N1087)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (15301: I2 ^dir L)
=>WM: (15300: I2 ^reward 1)
=>WM: (15299: I2 ^see 0)
=>WM: (15298: N1088 ^status complete)
<=WM: (15288: I2 ^dir U)
<=WM: (15287: I2 ^reward 1)
<=WM: (15286: I2 ^see 0)
=>WM: (15302: I2 ^level-1 L0-root)
<=WM: (15289: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2175 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2176 = 0.6854394259185996)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1092 ^value 1 +)
 (R1 ^reward R1092 +)
Firing propose*predict-yes
 -->
 (O2177 ^name predict-yes +)
 (S1 ^operator O2177 +)
Firing propose*predict-no
 -->
 (O2178 ^name predict-no +)
 (S1 ^operator O2178 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2176 = 0.3144938653010612)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2175 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2176 ^name predict-no +)
 (S1 ^operator O2176 +)
Retracting propose*predict-yes
 -->
 (O2175 ^name predict-yes +)
 (S1 ^operator O2175 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1091 ^value 1 +)
 (R1 ^reward R1091 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2176 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2175 = 0.)
=>WM: (15309: S1 ^operator O2178 +)
=>WM: (15308: S1 ^operator O2177 +)
=>WM: (15307: I3 ^dir L)
=>WM: (15306: O2178 ^name predict-no)
=>WM: (15305: O2177 ^name predict-yes)
=>WM: (15304: R1092 ^value 1)
=>WM: (15303: R1 ^reward R1092)
<=WM: (15294: S1 ^operator O2175 +)
<=WM: (15295: S1 ^operator O2176 +)
<=WM: (15296: S1 ^operator O2176)
<=WM: (15267: I3 ^dir U)
<=WM: (15290: R1 ^reward R1091)
<=WM: (15293: O2176 ^name predict-no)
<=WM: (15292: O2175 ^name predict-yes)
<=WM: (15291: R1091 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2177 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2177 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2178 = 0.6854394259185996)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2178 = 0.3144938653010612)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2176 = 0.3144938653010612)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2176 = 0.6854394259185996)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2175 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2175 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15310: S1 ^operator O2178)

  1089:    O: O2178 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1089 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1088 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15311: I3 ^predict-no N1089)
<=WM: (15298: N1088 ^status complete)
<=WM: (15297: I3 ^predict-no N1088)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15315: I2 ^dir L)
=>WM: (15314: I2 ^reward 1)
=>WM: (15313: I2 ^see 0)
=>WM: (15312: N1089 ^status complete)
<=WM: (15301: I2 ^dir L)
<=WM: (15300: I2 ^reward 1)
<=WM: (15299: I2 ^see 0)
=>WM: (15316: I2 ^level-1 L0-root)
<=WM: (15302: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2177 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2178 = 0.6854394259185996)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1093 ^value 1 +)
 (R1 ^reward R1093 +)
Firing propose*predict-yes
 -->
 (O2179 ^name predict-yes +)
 (S1 ^operator O2179 +)
Firing propose*predict-no
 -->
 (O2180 ^name predict-no +)
 (S1 ^operator O2180 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2178 = 0.3144938653010612)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2177 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2178 ^name predict-no +)
 (S1 ^operator O2178 +)
Retracting propose*predict-yes
 -->
 (O2177 ^name predict-yes +)
 (S1 ^operator O2177 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1092 ^value 1 +)
 (R1 ^reward R1092 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2178 = 0.3144938653010612)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2178 = 0.6854394259185996)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2177 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2177 = -0.208713043145708)
=>WM: (15322: S1 ^operator O2180 +)
=>WM: (15321: S1 ^operator O2179 +)
=>WM: (15320: O2180 ^name predict-no)
=>WM: (15319: O2179 ^name predict-yes)
=>WM: (15318: R1093 ^value 1)
=>WM: (15317: R1 ^reward R1093)
<=WM: (15308: S1 ^operator O2177 +)
<=WM: (15309: S1 ^operator O2178 +)
<=WM: (15310: S1 ^operator O2178)
<=WM: (15303: R1 ^reward R1092)
<=WM: (15306: O2178 ^name predict-no)
<=WM: (15305: O2177 ^name predict-yes)
<=WM: (15304: R1092 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2179 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2179 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2180 = 0.6854394259185996)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2180 = 0.3144938653010612)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2178 = 0.3144938653010612)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2178 = 0.6854394259185996)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2177 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2177 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478543 -0.164049 0.314494 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.927711,0.0674699)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521396 0.164043 0.685439 -> 0.521402 0.164044 0.685446(R,m,v=1,1,0)
=>WM: (15323: S1 ^operator O2180)

  1090:    O: O2180 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1090 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1089 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15324: I3 ^predict-no N1090)
<=WM: (15312: N1089 ^status complete)
<=WM: (15311: I3 ^predict-no N1089)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15328: I2 ^dir L)
=>WM: (15327: I2 ^reward 1)
=>WM: (15326: I2 ^see 0)
=>WM: (15325: N1090 ^status complete)
<=WM: (15315: I2 ^dir L)
<=WM: (15314: I2 ^reward 1)
<=WM: (15313: I2 ^see 0)
=>WM: (15329: I2 ^level-1 L0-root)
<=WM: (15316: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2179 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2180 = 0.6854458162511854)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1094 ^value 1 +)
 (R1 ^reward R1094 +)
Firing propose*predict-yes
 -->
 (O2181 ^name predict-yes +)
 (S1 ^operator O2181 +)
Firing propose*predict-no
 -->
 (O2182 ^name predict-no +)
 (S1 ^operator O2182 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2180 = 0.3144993225093091)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2179 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2180 ^name predict-no +)
 (S1 ^operator O2180 +)
Retracting propose*predict-yes
 -->
 (O2179 ^name predict-yes +)
 (S1 ^operator O2179 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1093 ^value 1 +)
 (R1 ^reward R1093 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2180 = 0.3144993225093091)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2180 = 0.6854458162511854)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2179 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2179 = -0.208713043145708)
=>WM: (15335: S1 ^operator O2182 +)
=>WM: (15334: S1 ^operator O2181 +)
=>WM: (15333: O2182 ^name predict-no)
=>WM: (15332: O2181 ^name predict-yes)
=>WM: (15331: R1094 ^value 1)
=>WM: (15330: R1 ^reward R1094)
<=WM: (15321: S1 ^operator O2179 +)
<=WM: (15322: S1 ^operator O2180 +)
<=WM: (15323: S1 ^operator O2180)
<=WM: (15317: R1 ^reward R1093)
<=WM: (15320: O2180 ^name predict-no)
<=WM: (15319: O2179 ^name predict-yes)
<=WM: (15318: R1093 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2181 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2181 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2182 = 0.6854458162511854)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2182 = 0.3144993225093091)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2180 = 0.3144993225093091)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2180 = 0.6854458162511854)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2179 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2179 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478552 -0.164048 0.314504(R,m,v=1,0.928144,0.0670947)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521402 0.164044 0.685446 -> 0.521407 0.164044 0.685451(R,m,v=1,1,0)
=>WM: (15336: S1 ^operator O2182)

  1091:    O: O2182 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1091 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1090 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15337: I3 ^predict-no N1091)
<=WM: (15325: N1090 ^status complete)
<=WM: (15324: I3 ^predict-no N1090)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\--- Input Phase --- 
=>WM: (15341: I2 ^dir L)
=>WM: (15340: I2 ^reward 1)
=>WM: (15339: I2 ^see 0)
=>WM: (15338: N1091 ^status complete)
<=WM: (15328: I2 ^dir L)
<=WM: (15327: I2 ^reward 1)
<=WM: (15326: I2 ^see 0)
=>WM: (15342: I2 ^level-1 L0-root)
<=WM: (15329: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2181 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2182 = 0.685451056996617)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1095 ^value 1 +)
 (R1 ^reward R1095 +)
Firing propose*predict-yes
 -->
 (O2183 ^name predict-yes +)
 (S1 ^operator O2183 +)
Firing propose*predict-no
 -->
 (O2184 ^name predict-no +)
 (S1 ^operator O2184 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2182 = 0.3145038061064807)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2181 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2182 ^name predict-no +)
 (S1 ^operator O2182 +)
Retracting propose*predict-yes
 -->
 (O2181 ^name predict-yes +)
 (S1 ^operator O2181 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1094 ^value 1 +)
 (R1 ^reward R1094 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2182 = 0.3145038061064807)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2182 = 0.685451056996617)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2181 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2181 = -0.208713043145708)
=>WM: (15348: S1 ^operator O2184 +)
=>WM: (15347: S1 ^operator O2183 +)
=>WM: (15346: O2184 ^name predict-no)
=>WM: (15345: O2183 ^name predict-yes)
=>WM: (15344: R1095 ^value 1)
=>WM: (15343: R1 ^reward R1095)
<=WM: (15334: S1 ^operator O2181 +)
<=WM: (15335: S1 ^operator O2182 +)
<=WM: (15336: S1 ^operator O2182)
<=WM: (15330: R1 ^reward R1094)
<=WM: (15333: O2182 ^name predict-no)
<=WM: (15332: O2181 ^name predict-yes)
<=WM: (15331: R1094 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2183 = -0.208713043145708)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2183 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2184 = 0.685451056996617)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2184 = 0.3145038061064807)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2182 = 0.3145038061064807)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2182 = 0.685451056996617)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2181 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2181 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478552 -0.164048 0.314504 -> 0.478555 -0.164048 0.314507(R,m,v=1,0.928571,0.0667237)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521407 0.164044 0.685451 -> 0.521411 0.164044 0.685455(R,m,v=1,1,0)
=>WM: (15349: S1 ^operator O2184)

  1092:    O: O2184 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1092 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1091 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15350: I3 ^predict-no N1092)
<=WM: (15338: N1091 ^status complete)
<=WM: (15337: I3 ^predict-no N1091)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15354: I2 ^dir R)
=>WM: (15353: I2 ^reward 1)
=>WM: (15352: I2 ^see 0)
=>WM: (15351: N1092 ^status complete)
<=WM: (15341: I2 ^dir L)
<=WM: (15340: I2 ^reward 1)
<=WM: (15339: I2 ^see 0)
=>WM: (15355: I2 ^level-1 L0-root)
<=WM: (15342: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2183 = 0.8783989983456222)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1096 ^value 1 +)
 (R1 ^reward R1096 +)
Firing propose*predict-yes
 -->
 (O2185 ^name predict-yes +)
 (S1 ^operator O2185 +)
Firing propose*predict-no
 -->
 (O2186 ^name predict-no +)
 (S1 ^operator O2186 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2184 = 0.9436253760703815)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2183 = 0.1215961184552382)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2184 ^name predict-no +)
 (S1 ^operator O2184 +)
Retracting propose*predict-yes
 -->
 (O2183 ^name predict-yes +)
 (S1 ^operator O2183 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1095 ^value 1 +)
 (R1 ^reward R1095 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2184 = 0.3145074913744749)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2184 = 0.685455356981167)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2183 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2183 = -0.208713043145708)
=>WM: (15362: S1 ^operator O2186 +)
=>WM: (15361: S1 ^operator O2185 +)
=>WM: (15360: I3 ^dir R)
=>WM: (15359: O2186 ^name predict-no)
=>WM: (15358: O2185 ^name predict-yes)
=>WM: (15357: R1096 ^value 1)
=>WM: (15356: R1 ^reward R1096)
<=WM: (15347: S1 ^operator O2183 +)
<=WM: (15348: S1 ^operator O2184 +)
<=WM: (15349: S1 ^operator O2184)
<=WM: (15307: I3 ^dir L)
<=WM: (15343: R1 ^reward R1095)
<=WM: (15346: O2184 ^name predict-no)
<=WM: (15345: O2183 ^name predict-yes)
<=WM: (15344: R1095 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2185 = 0.8783989983456222)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2185 = 0.1215961184552382)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2186 = 0.9436253760703815)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2184 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2183 = 0.1215961184552382)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2183 = 0.8783989983456222)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478555 -0.164048 0.314507 -> 0.478558 -0.164048 0.314511(R,m,v=1,0.928994,0.0663567)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521411 0.164044 0.685455 -> 0.521414 0.164045 0.685459(R,m,v=1,1,0)
=>WM: (15363: S1 ^operator O2185)

  1093:    O: O2185 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1093 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1092 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15364: I3 ^predict-yes N1093)
<=WM: (15351: N1092 ^status complete)
<=WM: (15350: I3 ^predict-no N1092)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15368: I2 ^dir L)
=>WM: (15367: I2 ^reward 1)
=>WM: (15366: I2 ^see 1)
=>WM: (15365: N1093 ^status complete)
<=WM: (15354: I2 ^dir R)
<=WM: (15353: I2 ^reward 1)
<=WM: (15352: I2 ^see 0)
=>WM: (15369: I2 ^level-1 R1-root)
<=WM: (15355: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2186 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2185 = 0.6092584839497481)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1097 ^value 1 +)
 (R1 ^reward R1097 +)
Firing propose*predict-yes
 -->
 (O2187 ^name predict-yes +)
 (S1 ^operator O2187 +)
Firing propose*predict-no
 -->
 (O2188 ^name predict-no +)
 (S1 ^operator O2188 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2186 = 0.3145105217381143)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2185 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2186 ^name predict-no +)
 (S1 ^operator O2186 +)
Retracting propose*predict-yes
 -->
 (O2185 ^name predict-yes +)
 (S1 ^operator O2185 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1096 ^value 1 +)
 (R1 ^reward R1096 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2186 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2185 = 0.1215961184552382)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2185 = 0.8783989983456222)
=>WM: (15377: S1 ^operator O2188 +)
=>WM: (15376: S1 ^operator O2187 +)
=>WM: (15375: I3 ^dir L)
=>WM: (15374: O2188 ^name predict-no)
=>WM: (15373: O2187 ^name predict-yes)
=>WM: (15372: R1097 ^value 1)
=>WM: (15371: R1 ^reward R1097)
=>WM: (15370: I3 ^see 1)
<=WM: (15361: S1 ^operator O2185 +)
<=WM: (15363: S1 ^operator O2185)
<=WM: (15362: S1 ^operator O2186 +)
<=WM: (15360: I3 ^dir R)
<=WM: (15356: R1 ^reward R1096)
<=WM: (15262: I3 ^see 0)
<=WM: (15359: O2186 ^name predict-no)
<=WM: (15358: O2185 ^name predict-yes)
<=WM: (15357: R1096 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2187 = 0.3907872220793651)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2187 = 0.6092584839497481)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2188 = 0.3145105217381143)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2188 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2186 = 0.3145105217381143)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2186 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2185 = 0.3907872220793651)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2185 = 0.6092584839497481)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.876289,0.108969)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465473 0.412926 0.878399 -> 0.465474 0.412926 0.878399(R,m,v=1,1,0)
=>WM: (15378: S1 ^operator O2187)

  1094:    O: O2187 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1094 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1093 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15379: I3 ^predict-yes N1094)
<=WM: (15365: N1093 ^status complete)
<=WM: (15364: I3 ^predict-yes N1093)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15383: I2 ^dir R)
=>WM: (15382: I2 ^reward 1)
=>WM: (15381: I2 ^see 1)
=>WM: (15380: N1094 ^status complete)
<=WM: (15368: I2 ^dir L)
<=WM: (15367: I2 ^reward 1)
<=WM: (15366: I2 ^see 1)
=>WM: (15384: I2 ^level-1 L1-root)
<=WM: (15369: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2187 = 0.8784070247478919)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1098 ^value 1 +)
 (R1 ^reward R1098 +)
Firing propose*predict-yes
 -->
 (O2189 ^name predict-yes +)
 (S1 ^operator O2189 +)
Firing propose*predict-no
 -->
 (O2190 ^name predict-no +)
 (S1 ^operator O2190 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2188 = 0.9436253760703815)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2187 = 0.1215965079981263)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2188 ^name predict-no +)
 (S1 ^operator O2188 +)
Retracting propose*predict-yes
 -->
 (O2187 ^name predict-yes +)
 (S1 ^operator O2187 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1097 ^value 1 +)
 (R1 ^reward R1097 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2188 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2188 = 0.3145105217381143)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2187 = 0.6092584839497481)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2187 = 0.3907872220793651)
=>WM: (15391: S1 ^operator O2190 +)
=>WM: (15390: S1 ^operator O2189 +)
=>WM: (15389: I3 ^dir R)
=>WM: (15388: O2190 ^name predict-no)
=>WM: (15387: O2189 ^name predict-yes)
=>WM: (15386: R1098 ^value 1)
=>WM: (15385: R1 ^reward R1098)
<=WM: (15376: S1 ^operator O2187 +)
<=WM: (15378: S1 ^operator O2187)
<=WM: (15377: S1 ^operator O2188 +)
<=WM: (15375: I3 ^dir L)
<=WM: (15371: R1 ^reward R1097)
<=WM: (15374: O2188 ^name predict-no)
<=WM: (15373: O2187 ^name predict-yes)
<=WM: (15372: R1097 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2189 = 0.1215965079981263)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2189 = 0.8784070247478919)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2190 = 0.9436253760703815)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2188 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2187 = 0.1215965079981263)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2187 = 0.8784070247478919)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472332 -0.0815445 0.390787 -> 0.472329 -0.0815451 0.390784(R,m,v=1,0.949721,0.0480196)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527707 0.0815513 0.609258 -> 0.527704 0.0815506 0.609254(R,m,v=1,1,0)
=>WM: (15392: S1 ^operator O2189)

  1095:    O: O2189 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1095 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1094 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15393: I3 ^predict-yes N1095)
<=WM: (15380: N1094 ^status complete)
<=WM: (15379: I3 ^predict-yes N1094)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15397: I2 ^dir U)
=>WM: (15396: I2 ^reward 1)
=>WM: (15395: I2 ^see 1)
=>WM: (15394: N1095 ^status complete)
<=WM: (15383: I2 ^dir R)
<=WM: (15382: I2 ^reward 1)
<=WM: (15381: I2 ^see 1)
=>WM: (15398: I2 ^level-1 R1-root)
<=WM: (15384: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1099 ^value 1 +)
 (R1 ^reward R1099 +)
Firing propose*predict-yes
 -->
 (O2191 ^name predict-yes +)
 (S1 ^operator O2191 +)
Firing propose*predict-no
 -->
 (O2192 ^name predict-no +)
 (S1 ^operator O2192 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2190 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2189 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2190 ^name predict-no +)
 (S1 ^operator O2190 +)
Retracting propose*predict-yes
 -->
 (O2189 ^name predict-yes +)
 (S1 ^operator O2189 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1098 ^value 1 +)
 (R1 ^reward R1098 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2190 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2189 = 0.8784070247478919)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2189 = 0.1215965079981263)
=>WM: (15405: S1 ^operator O2192 +)
=>WM: (15404: S1 ^operator O2191 +)
=>WM: (15403: I3 ^dir U)
=>WM: (15402: O2192 ^name predict-no)
=>WM: (15401: O2191 ^name predict-yes)
=>WM: (15400: R1099 ^value 1)
=>WM: (15399: R1 ^reward R1099)
<=WM: (15390: S1 ^operator O2189 +)
<=WM: (15392: S1 ^operator O2189)
<=WM: (15391: S1 ^operator O2190 +)
<=WM: (15389: I3 ^dir R)
<=WM: (15385: R1 ^reward R1098)
<=WM: (15388: O2190 ^name predict-no)
<=WM: (15387: O2189 ^name predict-yes)
<=WM: (15386: R1098 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2191 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2192 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2190 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2189 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.876923,0.108485)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.46548 0.412927 0.878407 -> 0.46548 0.412927 0.878407(R,m,v=1,1,0)
=>WM: (15406: S1 ^operator O2192)

  1096:    O: O2192 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1096 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1095 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15407: I3 ^predict-no N1096)
<=WM: (15394: N1095 ^status complete)
<=WM: (15393: I3 ^predict-yes N1095)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (15411: I2 ^dir U)
=>WM: (15410: I2 ^reward 1)
=>WM: (15409: I2 ^see 0)
=>WM: (15408: N1096 ^status complete)
<=WM: (15397: I2 ^dir U)
<=WM: (15396: I2 ^reward 1)
<=WM: (15395: I2 ^see 1)
=>WM: (15412: I2 ^level-1 R1-root)
<=WM: (15398: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1100 ^value 1 +)
 (R1 ^reward R1100 +)
Firing propose*predict-yes
 -->
 (O2193 ^name predict-yes +)
 (S1 ^operator O2193 +)
Firing propose*predict-no
 -->
 (O2194 ^name predict-no +)
 (S1 ^operator O2194 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2192 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2191 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2192 ^name predict-no +)
 (S1 ^operator O2192 +)
Retracting propose*predict-yes
 -->
 (O2191 ^name predict-yes +)
 (S1 ^operator O2191 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1099 ^value 1 +)
 (R1 ^reward R1099 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2192 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2191 = 0.)
=>WM: (15419: S1 ^operator O2194 +)
=>WM: (15418: S1 ^operator O2193 +)
=>WM: (15417: O2194 ^name predict-no)
=>WM: (15416: O2193 ^name predict-yes)
=>WM: (15415: R1100 ^value 1)
=>WM: (15414: R1 ^reward R1100)
=>WM: (15413: I3 ^see 0)
<=WM: (15404: S1 ^operator O2191 +)
<=WM: (15405: S1 ^operator O2192 +)
<=WM: (15406: S1 ^operator O2192)
<=WM: (15399: R1 ^reward R1099)
<=WM: (15370: I3 ^see 1)
<=WM: (15402: O2192 ^name predict-no)
<=WM: (15401: O2191 ^name predict-yes)
<=WM: (15400: R1099 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2193 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2194 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2192 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2191 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15420: S1 ^operator O2194)

  1097:    O: O2194 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1097 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1096 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15421: I3 ^predict-no N1097)
<=WM: (15408: N1096 ^status complete)
<=WM: (15407: I3 ^predict-no N1096)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15425: I2 ^dir L)
=>WM: (15424: I2 ^reward 1)
=>WM: (15423: I2 ^see 0)
=>WM: (15422: N1097 ^status complete)
<=WM: (15411: I2 ^dir U)
<=WM: (15410: I2 ^reward 1)
<=WM: (15409: I2 ^see 0)
=>WM: (15426: I2 ^level-1 R1-root)
<=WM: (15412: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2194 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2193 = 0.6092542666242702)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1101 ^value 1 +)
 (R1 ^reward R1101 +)
Firing propose*predict-yes
 -->
 (O2195 ^name predict-yes +)
 (S1 ^operator O2195 +)
Firing propose*predict-no
 -->
 (O2196 ^name predict-no +)
 (S1 ^operator O2196 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2194 = 0.3145105217381143)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2193 = 0.3907835285947055)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2194 ^name predict-no +)
 (S1 ^operator O2194 +)
Retracting propose*predict-yes
 -->
 (O2193 ^name predict-yes +)
 (S1 ^operator O2193 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1100 ^value 1 +)
 (R1 ^reward R1100 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2194 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2193 = 0.)
=>WM: (15433: S1 ^operator O2196 +)
=>WM: (15432: S1 ^operator O2195 +)
=>WM: (15431: I3 ^dir L)
=>WM: (15430: O2196 ^name predict-no)
=>WM: (15429: O2195 ^name predict-yes)
=>WM: (15428: R1101 ^value 1)
=>WM: (15427: R1 ^reward R1101)
<=WM: (15418: S1 ^operator O2193 +)
<=WM: (15419: S1 ^operator O2194 +)
<=WM: (15420: S1 ^operator O2194)
<=WM: (15403: I3 ^dir U)
<=WM: (15414: R1 ^reward R1100)
<=WM: (15417: O2194 ^name predict-no)
<=WM: (15416: O2193 ^name predict-yes)
<=WM: (15415: R1100 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2195 = 0.6092542666242702)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2195 = 0.3907835285947055)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2196 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2196 = 0.3145105217381143)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2194 = 0.3145105217381143)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2194 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2193 = 0.3907835285947055)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2193 = 0.6092542666242702)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15434: S1 ^operator O2195)

  1098:    O: O2195 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1098 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1097 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15435: I3 ^predict-yes N1098)
<=WM: (15422: N1097 ^status complete)
<=WM: (15421: I3 ^predict-no N1097)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15439: I2 ^dir L)
=>WM: (15438: I2 ^reward 1)
=>WM: (15437: I2 ^see 1)
=>WM: (15436: N1098 ^status complete)
<=WM: (15425: I2 ^dir L)
<=WM: (15424: I2 ^reward 1)
<=WM: (15423: I2 ^see 0)
=>WM: (15440: I2 ^level-1 L1-root)
<=WM: (15426: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2195 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2196 = 0.685514319964578)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1102 ^value 1 +)
 (R1 ^reward R1102 +)
Firing propose*predict-yes
 -->
 (O2197 ^name predict-yes +)
 (S1 ^operator O2197 +)
Firing propose*predict-no
 -->
 (O2198 ^name predict-no +)
 (S1 ^operator O2198 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2196 = 0.3145105217381143)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2195 = 0.3907835285947055)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2196 ^name predict-no +)
 (S1 ^operator O2196 +)
Retracting propose*predict-yes
 -->
 (O2195 ^name predict-yes +)
 (S1 ^operator O2195 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1101 ^value 1 +)
 (R1 ^reward R1101 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2196 = 0.3145105217381143)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2196 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2195 = 0.3907835285947055)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2195 = 0.6092542666242702)
=>WM: (15447: S1 ^operator O2198 +)
=>WM: (15446: S1 ^operator O2197 +)
=>WM: (15445: O2198 ^name predict-no)
=>WM: (15444: O2197 ^name predict-yes)
=>WM: (15443: R1102 ^value 1)
=>WM: (15442: R1 ^reward R1102)
=>WM: (15441: I3 ^see 1)
<=WM: (15432: S1 ^operator O2195 +)
<=WM: (15434: S1 ^operator O2195)
<=WM: (15433: S1 ^operator O2196 +)
<=WM: (15427: R1 ^reward R1101)
<=WM: (15413: I3 ^see 0)
<=WM: (15430: O2196 ^name predict-no)
<=WM: (15429: O2195 ^name predict-yes)
<=WM: (15428: R1101 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2197 = 0.3907835285947055)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2197 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2198 = 0.3145105217381143)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2198 = 0.685514319964578)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2196 = 0.3145105217381143)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2196 = 0.685514319964578)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2195 = 0.3907835285947055)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2195 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472329 -0.0815451 0.390784 -> 0.472326 -0.0815455 0.39078(R,m,v=1,0.95,0.0477654)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527704 0.0815506 0.609254 -> 0.527701 0.0815501 0.609251(R,m,v=1,1,0)
=>WM: (15448: S1 ^operator O2198)

  1099:    O: O2198 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1099 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1098 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15449: I3 ^predict-no N1099)
<=WM: (15436: N1098 ^status complete)
<=WM: (15435: I3 ^predict-yes N1098)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15453: I2 ^dir U)
=>WM: (15452: I2 ^reward 1)
=>WM: (15451: I2 ^see 0)
=>WM: (15450: N1099 ^status complete)
<=WM: (15439: I2 ^dir L)
<=WM: (15438: I2 ^reward 1)
<=WM: (15437: I2 ^see 1)
=>WM: (15454: I2 ^level-1 L0-root)
<=WM: (15440: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1103 ^value 1 +)
 (R1 ^reward R1103 +)
Firing propose*predict-yes
 -->
 (O2199 ^name predict-yes +)
 (S1 ^operator O2199 +)
Firing propose*predict-no
 -->
 (O2200 ^name predict-no +)
 (S1 ^operator O2200 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2198 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2197 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2198 ^name predict-no +)
 (S1 ^operator O2198 +)
Retracting propose*predict-yes
 -->
 (O2197 ^name predict-yes +)
 (S1 ^operator O2197 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1102 ^value 1 +)
 (R1 ^reward R1102 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2198 = 0.685514319964578)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2198 = 0.3145105217381143)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2197 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2197 = 0.3907804771267326)
=>WM: (15462: S1 ^operator O2200 +)
=>WM: (15461: S1 ^operator O2199 +)
=>WM: (15460: I3 ^dir U)
=>WM: (15459: O2200 ^name predict-no)
=>WM: (15458: O2199 ^name predict-yes)
=>WM: (15457: R1103 ^value 1)
=>WM: (15456: R1 ^reward R1103)
=>WM: (15455: I3 ^see 0)
<=WM: (15446: S1 ^operator O2197 +)
<=WM: (15447: S1 ^operator O2198 +)
<=WM: (15448: S1 ^operator O2198)
<=WM: (15431: I3 ^dir L)
<=WM: (15442: R1 ^reward R1102)
<=WM: (15441: I3 ^see 1)
<=WM: (15445: O2198 ^name predict-no)
<=WM: (15444: O2197 ^name predict-yes)
<=WM: (15443: R1102 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2199 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2200 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2198 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2197 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478558 -0.164048 0.314511 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.929412,0.0659937)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521464 0.16405 0.685514 -> 0.521462 0.16405 0.685512(R,m,v=1,1,0)
=>WM: (15463: S1 ^operator O2200)

  1100:    O: O2200 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1100 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1099 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15464: I3 ^predict-no N1100)
<=WM: (15450: N1099 ^status complete)
<=WM: (15449: I3 ^predict-no N1099)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15468: I2 ^dir R)
=>WM: (15467: I2 ^reward 1)
=>WM: (15466: I2 ^see 0)
=>WM: (15465: N1100 ^status complete)
<=WM: (15453: I2 ^dir U)
<=WM: (15452: I2 ^reward 1)
<=WM: (15451: I2 ^see 0)
=>WM: (15469: I2 ^level-1 L0-root)
<=WM: (15454: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2199 = 0.878399454147804)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1104 ^value 1 +)
 (R1 ^reward R1104 +)
Firing propose*predict-yes
 -->
 (O2201 ^name predict-yes +)
 (S1 ^operator O2201 +)
Firing propose*predict-no
 -->
 (O2202 ^name predict-no +)
 (S1 ^operator O2202 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2200 = 0.9436253760703815)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2199 = 0.1215962264146522)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2200 ^name predict-no +)
 (S1 ^operator O2200 +)
Retracting propose*predict-yes
 -->
 (O2199 ^name predict-yes +)
 (S1 ^operator O2199 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1103 ^value 1 +)
 (R1 ^reward R1103 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2200 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2199 = 0.)
=>WM: (15476: S1 ^operator O2202 +)
=>WM: (15475: S1 ^operator O2201 +)
=>WM: (15474: I3 ^dir R)
=>WM: (15473: O2202 ^name predict-no)
=>WM: (15472: O2201 ^name predict-yes)
=>WM: (15471: R1104 ^value 1)
=>WM: (15470: R1 ^reward R1104)
<=WM: (15461: S1 ^operator O2199 +)
<=WM: (15462: S1 ^operator O2200 +)
<=WM: (15463: S1 ^operator O2200)
<=WM: (15460: I3 ^dir U)
<=WM: (15456: R1 ^reward R1103)
<=WM: (15459: O2200 ^name predict-no)
<=WM: (15458: O2199 ^name predict-yes)
<=WM: (15457: R1103 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2201 = 0.878399454147804)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2201 = 0.1215962264146522)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2202 = 0.9436253760703815)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2200 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2199 = 0.1215962264146522)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2199 = 0.878399454147804)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15477: S1 ^operator O2201)

  1101:    O: O2201 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1101 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1100 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15478: I3 ^predict-yes N1101)
<=WM: (15465: N1100 ^status complete)
<=WM: (15464: I3 ^predict-no N1100)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (15482: I2 ^dir L)
=>WM: (15481: I2 ^reward 1)
=>WM: (15480: I2 ^see 1)
=>WM: (15479: N1101 ^status complete)
<=WM: (15468: I2 ^dir R)
<=WM: (15467: I2 ^reward 1)
<=WM: (15466: I2 ^see 0)
=>WM: (15483: I2 ^level-1 R1-root)
<=WM: (15469: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2202 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2201 = 0.6092507869249565)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1105 ^value 1 +)
 (R1 ^reward R1105 +)
Firing propose*predict-yes
 -->
 (O2203 ^name predict-yes +)
 (S1 ^operator O2203 +)
Firing propose*predict-no
 -->
 (O2204 ^name predict-no +)
 (S1 ^operator O2204 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2202 = 0.3145084974129228)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2201 = 0.3907804771267326)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2202 ^name predict-no +)
 (S1 ^operator O2202 +)
Retracting propose*predict-yes
 -->
 (O2201 ^name predict-yes +)
 (S1 ^operator O2201 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1104 ^value 1 +)
 (R1 ^reward R1104 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2202 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2201 = 0.1215962264146522)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2201 = 0.878399454147804)
=>WM: (15491: S1 ^operator O2204 +)
=>WM: (15490: S1 ^operator O2203 +)
=>WM: (15489: I3 ^dir L)
=>WM: (15488: O2204 ^name predict-no)
=>WM: (15487: O2203 ^name predict-yes)
=>WM: (15486: R1105 ^value 1)
=>WM: (15485: R1 ^reward R1105)
=>WM: (15484: I3 ^see 1)
<=WM: (15475: S1 ^operator O2201 +)
<=WM: (15477: S1 ^operator O2201)
<=WM: (15476: S1 ^operator O2202 +)
<=WM: (15474: I3 ^dir R)
<=WM: (15470: R1 ^reward R1104)
<=WM: (15455: I3 ^see 0)
<=WM: (15473: O2202 ^name predict-no)
<=WM: (15472: O2201 ^name predict-yes)
<=WM: (15471: R1104 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2203 = 0.3907804771267326)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2203 = 0.6092507869249565)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2204 = 0.3145084974129228)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2204 = -0.168718511744511)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2202 = 0.3145084974129228)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2202 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2201 = 0.3907804771267326)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2201 = 0.6092507869249565)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.877551,0.108006)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465474 0.412926 0.878399 -> 0.465474 0.412926 0.8784(R,m,v=1,1,0)
=>WM: (15492: S1 ^operator O2203)

  1102:    O: O2203 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1102 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1101 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15493: I3 ^predict-yes N1102)
<=WM: (15479: N1101 ^status complete)
<=WM: (15478: I3 ^predict-yes N1101)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15497: I2 ^dir L)
=>WM: (15496: I2 ^reward 1)
=>WM: (15495: I2 ^see 1)
=>WM: (15494: N1102 ^status complete)
<=WM: (15482: I2 ^dir L)
<=WM: (15481: I2 ^reward 1)
<=WM: (15480: I2 ^see 1)
=>WM: (15498: I2 ^level-1 L1-root)
<=WM: (15483: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2203 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2204 = 0.6855120328590087)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1106 ^value 1 +)
 (R1 ^reward R1106 +)
Firing propose*predict-yes
 -->
 (O2205 ^name predict-yes +)
 (S1 ^operator O2205 +)
Firing propose*predict-no
 -->
 (O2206 ^name predict-no +)
 (S1 ^operator O2206 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2204 = 0.3145084974129228)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2203 = 0.3907804771267326)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2204 ^name predict-no +)
 (S1 ^operator O2204 +)
Retracting propose*predict-yes
 -->
 (O2203 ^name predict-yes +)
 (S1 ^operator O2203 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1105 ^value 1 +)
 (R1 ^reward R1105 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2204 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2204 = 0.3145084974129228)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2203 = 0.6092507869249565)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2203 = 0.3907804771267326)
=>WM: (15504: S1 ^operator O2206 +)
=>WM: (15503: S1 ^operator O2205 +)
=>WM: (15502: O2206 ^name predict-no)
=>WM: (15501: O2205 ^name predict-yes)
=>WM: (15500: R1106 ^value 1)
=>WM: (15499: R1 ^reward R1106)
<=WM: (15490: S1 ^operator O2203 +)
<=WM: (15492: S1 ^operator O2203)
<=WM: (15491: S1 ^operator O2204 +)
<=WM: (15485: R1 ^reward R1105)
<=WM: (15488: O2204 ^name predict-no)
<=WM: (15487: O2203 ^name predict-yes)
<=WM: (15486: R1105 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2205 = 0.3907804771267326)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2205 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2206 = 0.3145084974129228)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2206 = 0.6855120328590087)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2204 = 0.3145084974129228)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2204 = 0.6855120328590087)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2203 = 0.3907804771267326)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2203 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472326 -0.0815455 0.39078 -> 0.472324 -0.0815459 0.390778(R,m,v=1,0.950276,0.0475138)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527701 0.0815501 0.609251 -> 0.527698 0.0815497 0.609248(R,m,v=1,1,0)
=>WM: (15505: S1 ^operator O2206)

  1103:    O: O2206 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1103 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1102 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15506: I3 ^predict-no N1103)
<=WM: (15494: N1102 ^status complete)
<=WM: (15493: I3 ^predict-yes N1102)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15510: I2 ^dir L)
=>WM: (15509: I2 ^reward 1)
=>WM: (15508: I2 ^see 0)
=>WM: (15507: N1103 ^status complete)
<=WM: (15497: I2 ^dir L)
<=WM: (15496: I2 ^reward 1)
<=WM: (15495: I2 ^see 1)
=>WM: (15511: I2 ^level-1 L0-root)
<=WM: (15498: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2205 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2206 = 0.6854588867079627)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1107 ^value 1 +)
 (R1 ^reward R1107 +)
Firing propose*predict-yes
 -->
 (O2207 ^name predict-yes +)
 (S1 ^operator O2207 +)
Firing propose*predict-no
 -->
 (O2208 ^name predict-no +)
 (S1 ^operator O2208 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2206 = 0.3145084974129228)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2205 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2206 ^name predict-no +)
 (S1 ^operator O2206 +)
Retracting propose*predict-yes
 -->
 (O2205 ^name predict-yes +)
 (S1 ^operator O2205 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1106 ^value 1 +)
 (R1 ^reward R1106 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2206 = 0.6855120328590087)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2206 = 0.3145084974129228)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2205 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2205 = 0.3907779552208955)
=>WM: (15518: S1 ^operator O2208 +)
=>WM: (15517: S1 ^operator O2207 +)
=>WM: (15516: O2208 ^name predict-no)
=>WM: (15515: O2207 ^name predict-yes)
=>WM: (15514: R1107 ^value 1)
=>WM: (15513: R1 ^reward R1107)
=>WM: (15512: I3 ^see 0)
<=WM: (15503: S1 ^operator O2205 +)
<=WM: (15504: S1 ^operator O2206 +)
<=WM: (15505: S1 ^operator O2206)
<=WM: (15499: R1 ^reward R1106)
<=WM: (15484: I3 ^see 1)
<=WM: (15502: O2206 ^name predict-no)
<=WM: (15501: O2205 ^name predict-yes)
<=WM: (15500: R1106 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2207 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2207 = -0.208713043145708)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2208 = 0.3145084974129228)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2208 = 0.6854588867079627)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2206 = 0.3145084974129228)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2206 = 0.6854588867079627)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2205 = 0.3907779552208955)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2205 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478555 -0.164048 0.314507(R,m,v=1,0.929825,0.0656347)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521462 0.16405 0.685512 -> 0.521461 0.16405 0.68551(R,m,v=1,1,0)
=>WM: (15519: S1 ^operator O2208)

  1104:    O: O2208 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1104 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1103 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15520: I3 ^predict-no N1104)
<=WM: (15507: N1103 ^status complete)
<=WM: (15506: I3 ^predict-no N1103)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15524: I2 ^dir L)
=>WM: (15523: I2 ^reward 1)
=>WM: (15522: I2 ^see 0)
=>WM: (15521: N1104 ^status complete)
<=WM: (15510: I2 ^dir L)
<=WM: (15509: I2 ^reward 1)
<=WM: (15508: I2 ^see 0)
=>WM: (15525: I2 ^level-1 L0-root)
<=WM: (15511: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2207 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2208 = 0.6854588867079627)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1108 ^value 1 +)
 (R1 ^reward R1108 +)
Firing propose*predict-yes
 -->
 (O2209 ^name predict-yes +)
 (S1 ^operator O2209 +)
Firing propose*predict-no
 -->
 (O2210 ^name predict-no +)
 (S1 ^operator O2210 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2208 = 0.3145068260195175)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2207 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2208 ^name predict-no +)
 (S1 ^operator O2208 +)
Retracting propose*predict-yes
 -->
 (O2207 ^name predict-yes +)
 (S1 ^operator O2207 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1107 ^value 1 +)
 (R1 ^reward R1107 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2208 = 0.6854588867079627)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2208 = 0.3145068260195175)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2207 = -0.208713043145708)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2207 = 0.3907779552208955)
=>WM: (15531: S1 ^operator O2210 +)
=>WM: (15530: S1 ^operator O2209 +)
=>WM: (15529: O2210 ^name predict-no)
=>WM: (15528: O2209 ^name predict-yes)
=>WM: (15527: R1108 ^value 1)
=>WM: (15526: R1 ^reward R1108)
<=WM: (15517: S1 ^operator O2207 +)
<=WM: (15518: S1 ^operator O2208 +)
<=WM: (15519: S1 ^operator O2208)
<=WM: (15513: R1 ^reward R1107)
<=WM: (15516: O2208 ^name predict-no)
<=WM: (15515: O2207 ^name predict-yes)
<=WM: (15514: R1107 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2209 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2209 = -0.208713043145708)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2210 = 0.3145068260195175)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2210 = 0.6854588867079627)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2208 = 0.3145068260195175)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2208 = 0.6854588867079627)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2207 = 0.3907779552208955)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2207 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478555 -0.164048 0.314507 -> 0.478557 -0.164048 0.31451(R,m,v=1,0.930233,0.0652795)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521414 0.164045 0.685459 -> 0.521417 0.164045 0.685462(R,m,v=1,1,0)
=>WM: (15532: S1 ^operator O2210)

  1105:    O: O2210 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1105 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1104 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15533: I3 ^predict-no N1105)
<=WM: (15521: N1104 ^status complete)
<=WM: (15520: I3 ^predict-no N1104)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|--- Input Phase --- 
=>WM: (15537: I2 ^dir L)
=>WM: (15536: I2 ^reward 1)
=>WM: (15535: I2 ^see 0)
=>WM: (15534: N1105 ^status complete)
<=WM: (15524: I2 ^dir L)
<=WM: (15523: I2 ^reward 1)
<=WM: (15522: I2 ^see 0)
=>WM: (15538: I2 ^level-1 L0-root)
<=WM: (15525: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2209 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2210 = 0.6854621356602126)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1109 ^value 1 +)
 (R1 ^reward R1109 +)
Firing propose*predict-yes
 -->
 (O2211 ^name predict-yes +)
 (S1 ^operator O2211 +)
Firing propose*predict-no
 -->
 (O2212 ^name predict-no +)
 (S1 ^operator O2212 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2210 = 0.3145096147387795)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2209 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2210 ^name predict-no +)
 (S1 ^operator O2210 +)
Retracting propose*predict-yes
 -->
 (O2209 ^name predict-yes +)
 (S1 ^operator O2209 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1108 ^value 1 +)
 (R1 ^reward R1108 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2210 = 0.6854621356602126)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2210 = 0.3145096147387795)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2209 = -0.208713043145708)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2209 = 0.3907779552208955)
=>WM: (15544: S1 ^operator O2212 +)
=>WM: (15543: S1 ^operator O2211 +)
=>WM: (15542: O2212 ^name predict-no)
=>WM: (15541: O2211 ^name predict-yes)
=>WM: (15540: R1109 ^value 1)
=>WM: (15539: R1 ^reward R1109)
<=WM: (15530: S1 ^operator O2209 +)
<=WM: (15531: S1 ^operator O2210 +)
<=WM: (15532: S1 ^operator O2210)
<=WM: (15526: R1 ^reward R1108)
<=WM: (15529: O2210 ^name predict-no)
<=WM: (15528: O2209 ^name predict-yes)
<=WM: (15527: R1108 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2211 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2211 = -0.208713043145708)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2212 = 0.3145096147387795)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2212 = 0.6854621356602126)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2210 = 0.3145096147387795)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2210 = 0.6854621356602126)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2209 = 0.3907779552208955)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2209 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478557 -0.164048 0.31451 -> 0.478559 -0.164048 0.314512(R,m,v=1,0.930636,0.0649281)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521417 0.164045 0.685462 -> 0.521419 0.164045 0.685465(R,m,v=1,1,0)
=>WM: (15545: S1 ^operator O2212)

  1106:    O: O2212 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1106 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1105 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15546: I3 ^predict-no N1106)
<=WM: (15534: N1105 ^status complete)
<=WM: (15533: I3 ^predict-no N1105)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15550: I2 ^dir L)
=>WM: (15549: I2 ^reward 1)
=>WM: (15548: I2 ^see 0)
=>WM: (15547: N1106 ^status complete)
<=WM: (15537: I2 ^dir L)
<=WM: (15536: I2 ^reward 1)
<=WM: (15535: I2 ^see 0)
=>WM: (15551: I2 ^level-1 L0-root)
<=WM: (15538: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2211 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2212 = 0.685464805522946)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1110 ^value 1 +)
 (R1 ^reward R1110 +)
Firing propose*predict-yes
 -->
 (O2213 ^name predict-yes +)
 (S1 ^operator O2213 +)
Firing propose*predict-no
 -->
 (O2214 ^name predict-no +)
 (S1 ^operator O2214 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2212 = 0.3145119102257212)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2211 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2212 ^name predict-no +)
 (S1 ^operator O2212 +)
Retracting propose*predict-yes
 -->
 (O2211 ^name predict-yes +)
 (S1 ^operator O2211 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1109 ^value 1 +)
 (R1 ^reward R1109 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2212 = 0.685464805522946)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2212 = 0.3145119102257212)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2211 = -0.208713043145708)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2211 = 0.3907779552208955)
=>WM: (15557: S1 ^operator O2214 +)
=>WM: (15556: S1 ^operator O2213 +)
=>WM: (15555: O2214 ^name predict-no)
=>WM: (15554: O2213 ^name predict-yes)
=>WM: (15553: R1110 ^value 1)
=>WM: (15552: R1 ^reward R1110)
<=WM: (15543: S1 ^operator O2211 +)
<=WM: (15544: S1 ^operator O2212 +)
<=WM: (15545: S1 ^operator O2212)
<=WM: (15539: R1 ^reward R1109)
<=WM: (15542: O2212 ^name predict-no)
<=WM: (15541: O2211 ^name predict-yes)
<=WM: (15540: R1109 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2213 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2213 = -0.208713043145708)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2214 = 0.3145119102257212)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2214 = 0.685464805522946)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2212 = 0.3145119102257212)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2212 = 0.685464805522946)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2211 = 0.3907779552208955)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2211 = -0.208713043145708)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478559 -0.164048 0.314512 -> 0.478561 -0.164047 0.314514(R,m,v=1,0.931034,0.0645804)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521419 0.164045 0.685465 -> 0.521421 0.164046 0.685467(R,m,v=1,1,0)
=>WM: (15558: S1 ^operator O2214)

  1107:    O: O2214 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1107 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1106 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15559: I3 ^predict-no N1107)
<=WM: (15547: N1106 ^status complete)
<=WM: (15546: I3 ^predict-no N1106)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\---- Input Phase --- 
=>WM: (15563: I2 ^dir R)
=>WM: (15562: I2 ^reward 1)
=>WM: (15561: I2 ^see 0)
=>WM: (15560: N1107 ^status complete)
<=WM: (15550: I2 ^dir L)
<=WM: (15549: I2 ^reward 1)
<=WM: (15548: I2 ^see 0)
=>WM: (15564: I2 ^level-1 L0-root)
<=WM: (15551: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2213 = 0.8783998563714275)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1111 ^value 1 +)
 (R1 ^reward R1111 +)
Firing propose*predict-yes
 -->
 (O2215 ^name predict-yes +)
 (S1 ^operator O2215 +)
Firing propose*predict-no
 -->
 (O2216 ^name predict-no +)
 (S1 ^operator O2216 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2214 = 0.9436253760703815)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2213 = 0.1215965704221909)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2214 ^name predict-no +)
 (S1 ^operator O2214 +)
Retracting propose*predict-yes
 -->
 (O2213 ^name predict-yes +)
 (S1 ^operator O2213 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1110 ^value 1 +)
 (R1 ^reward R1110 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2214 = 0.685467000466911)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2214 = 0.3145138004710756)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2213 = -0.208713043145708)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2213 = 0.3907779552208955)
=>WM: (15571: S1 ^operator O2216 +)
=>WM: (15570: S1 ^operator O2215 +)
=>WM: (15569: I3 ^dir R)
=>WM: (15568: O2216 ^name predict-no)
=>WM: (15567: O2215 ^name predict-yes)
=>WM: (15566: R1111 ^value 1)
=>WM: (15565: R1 ^reward R1111)
<=WM: (15556: S1 ^operator O2213 +)
<=WM: (15557: S1 ^operator O2214 +)
<=WM: (15558: S1 ^operator O2214)
<=WM: (15489: I3 ^dir L)
<=WM: (15552: R1 ^reward R1110)
<=WM: (15555: O2214 ^name predict-no)
<=WM: (15554: O2213 ^name predict-yes)
<=WM: (15553: R1110 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2215 = 0.8783998563714275)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2215 = 0.1215965704221909)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2216 = 0.9436253760703815)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2214 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2213 = 0.1215965704221909)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2213 = 0.8783998563714275)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478561 -0.164047 0.314514 -> 0.478563 -0.164047 0.314515(R,m,v=1,0.931429,0.0642365)
RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521421 0.164046 0.685467 -> 0.521423 0.164046 0.685469(R,m,v=1,1,0)
=>WM: (15572: S1 ^operator O2215)

  1108:    O: O2215 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1108 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1107 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15573: I3 ^predict-yes N1108)
<=WM: (15560: N1107 ^status complete)
<=WM: (15559: I3 ^predict-no N1107)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15577: I2 ^dir U)
=>WM: (15576: I2 ^reward 1)
=>WM: (15575: I2 ^see 1)
=>WM: (15574: N1108 ^status complete)
<=WM: (15563: I2 ^dir R)
<=WM: (15562: I2 ^reward 1)
<=WM: (15561: I2 ^see 0)
=>WM: (15578: I2 ^level-1 R1-root)
<=WM: (15564: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1112 ^value 1 +)
 (R1 ^reward R1112 +)
Firing propose*predict-yes
 -->
 (O2217 ^name predict-yes +)
 (S1 ^operator O2217 +)
Firing propose*predict-no
 -->
 (O2218 ^name predict-no +)
 (S1 ^operator O2218 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2216 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2215 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2216 ^name predict-no +)
 (S1 ^operator O2216 +)
Retracting propose*predict-yes
 -->
 (O2215 ^name predict-yes +)
 (S1 ^operator O2215 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1111 ^value 1 +)
 (R1 ^reward R1111 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2216 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2215 = 0.1215965704221909)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2215 = 0.8783998563714275)
=>WM: (15586: S1 ^operator O2218 +)
=>WM: (15585: S1 ^operator O2217 +)
=>WM: (15584: I3 ^dir U)
=>WM: (15583: O2218 ^name predict-no)
=>WM: (15582: O2217 ^name predict-yes)
=>WM: (15581: R1112 ^value 1)
=>WM: (15580: R1 ^reward R1112)
=>WM: (15579: I3 ^see 1)
<=WM: (15570: S1 ^operator O2215 +)
<=WM: (15572: S1 ^operator O2215)
<=WM: (15571: S1 ^operator O2216 +)
<=WM: (15569: I3 ^dir R)
<=WM: (15565: R1 ^reward R1111)
<=WM: (15512: I3 ^see 0)
<=WM: (15568: O2216 ^name predict-no)
<=WM: (15567: O2215 ^name predict-yes)
<=WM: (15566: R1111 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2217 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2218 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2216 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2215 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.878173,0.107531)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465474 0.412926 0.8784 -> 0.465474 0.412926 0.8784(R,m,v=1,1,0)
=>WM: (15587: S1 ^operator O2218)

  1109:    O: O2218 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1109 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1108 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15588: I3 ^predict-no N1109)
<=WM: (15574: N1108 ^status complete)
<=WM: (15573: I3 ^predict-yes N1108)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (15592: I2 ^dir L)
=>WM: (15591: I2 ^reward 1)
=>WM: (15590: I2 ^see 0)
=>WM: (15589: N1109 ^status complete)
<=WM: (15577: I2 ^dir U)
<=WM: (15576: I2 ^reward 1)
<=WM: (15575: I2 ^see 1)
=>WM: (15593: I2 ^level-1 R1-root)
<=WM: (15578: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2218 = -0.168718511744511)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2217 = 0.6092479147905668)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1113 ^value 1 +)
 (R1 ^reward R1113 +)
Firing propose*predict-yes
 -->
 (O2219 ^name predict-yes +)
 (S1 ^operator O2219 +)
Firing propose*predict-no
 -->
 (O2220 ^name predict-no +)
 (S1 ^operator O2220 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2218 = 0.3145153576266763)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2217 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2218 ^name predict-no +)
 (S1 ^operator O2218 +)
Retracting propose*predict-yes
 -->
 (O2217 ^name predict-yes +)
 (S1 ^operator O2217 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1112 ^value 1 +)
 (R1 ^reward R1112 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2218 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2217 = 0.)
=>WM: (15601: S1 ^operator O2220 +)
=>WM: (15600: S1 ^operator O2219 +)
=>WM: (15599: I3 ^dir L)
=>WM: (15598: O2220 ^name predict-no)
=>WM: (15597: O2219 ^name predict-yes)
=>WM: (15596: R1113 ^value 1)
=>WM: (15595: R1 ^reward R1113)
=>WM: (15594: I3 ^see 0)
<=WM: (15585: S1 ^operator O2217 +)
<=WM: (15586: S1 ^operator O2218 +)
<=WM: (15587: S1 ^operator O2218)
<=WM: (15584: I3 ^dir U)
<=WM: (15580: R1 ^reward R1112)
<=WM: (15579: I3 ^see 1)
<=WM: (15583: O2218 ^name predict-no)
<=WM: (15582: O2217 ^name predict-yes)
<=WM: (15581: R1112 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2219 = 0.6092479147905668)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2219 = 0.3907779552208955)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2220 = -0.168718511744511)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2220 = 0.3145153576266763)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2218 = 0.3145153576266763)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2218 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2217 = 0.3907779552208955)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2217 = 0.6092479147905668)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15602: S1 ^operator O2219)

  1110:    O: O2219 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1110 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1109 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15603: I3 ^predict-yes N1110)
<=WM: (15589: N1109 ^status complete)
<=WM: (15588: I3 ^predict-no N1109)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15607: I2 ^dir R)
=>WM: (15606: I2 ^reward 1)
=>WM: (15605: I2 ^see 1)
=>WM: (15604: N1110 ^status complete)
<=WM: (15592: I2 ^dir L)
<=WM: (15591: I2 ^reward 1)
<=WM: (15590: I2 ^see 0)
=>WM: (15608: I2 ^level-1 L1-root)
<=WM: (15593: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2219 = 0.8784067009010752)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1114 ^value 1 +)
 (R1 ^reward R1114 +)
Firing propose*predict-yes
 -->
 (O2221 ^name predict-yes +)
 (S1 ^operator O2221 +)
Firing propose*predict-no
 -->
 (O2222 ^name predict-no +)
 (S1 ^operator O2222 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2220 = 0.9436253760703815)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2219 = 0.1215968547680865)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2220 ^name predict-no +)
 (S1 ^operator O2220 +)
Retracting propose*predict-yes
 -->
 (O2219 ^name predict-yes +)
 (S1 ^operator O2219 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1113 ^value 1 +)
 (R1 ^reward R1113 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2220 = 0.3145153576266763)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
 -->
 (S1 ^operator O2220 = -0.168718511744511)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2219 = 0.3907779552208955)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
 -->
 (S1 ^operator O2219 = 0.6092479147905668)
=>WM: (15616: S1 ^operator O2222 +)
=>WM: (15615: S1 ^operator O2221 +)
=>WM: (15614: I3 ^dir R)
=>WM: (15613: O2222 ^name predict-no)
=>WM: (15612: O2221 ^name predict-yes)
=>WM: (15611: R1114 ^value 1)
=>WM: (15610: R1 ^reward R1114)
=>WM: (15609: I3 ^see 1)
<=WM: (15600: S1 ^operator O2219 +)
<=WM: (15602: S1 ^operator O2219)
<=WM: (15601: S1 ^operator O2220 +)
<=WM: (15599: I3 ^dir L)
<=WM: (15595: R1 ^reward R1113)
<=WM: (15594: I3 ^see 0)
<=WM: (15598: O2220 ^name predict-no)
<=WM: (15597: O2219 ^name predict-yes)
<=WM: (15596: R1113 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2221 = 0.1215968547680865)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2221 = 0.8784067009010752)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2222 = 0.9436253760703815)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2220 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2219 = 0.1215968547680865)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2219 = 0.8784067009010752)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472324 -0.0815459 0.390778 -> 0.472322 -0.0815462 0.390776(R,m,v=1,0.950549,0.0472649)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527698 0.0815497 0.609248 -> 0.527696 0.0815494 0.609246(R,m,v=1,1,0)
=>WM: (15617: S1 ^operator O2221)

  1111:    O: O2221 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1111 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1110 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15618: I3 ^predict-yes N1111)
<=WM: (15604: N1110 ^status complete)
<=WM: (15603: I3 ^predict-yes N1110)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
---- Input Phase --- 
=>WM: (15622: I2 ^dir R)
=>WM: (15621: I2 ^reward 1)
=>WM: (15620: I2 ^see 1)
=>WM: (15619: N1111 ^status complete)
<=WM: (15607: I2 ^dir R)
<=WM: (15606: I2 ^reward 1)
<=WM: (15605: I2 ^see 1)
=>WM: (15623: I2 ^level-1 R1-root)
<=WM: (15608: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2221 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1115 ^value 1 +)
 (R1 ^reward R1115 +)
Firing propose*predict-yes
 -->
 (O2223 ^name predict-yes +)
 (S1 ^operator O2223 +)
Firing propose*predict-no
 -->
 (O2224 ^name predict-no +)
 (S1 ^operator O2224 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2222 = 0.9436253760703815)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2221 = 0.1215968547680865)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2222 ^name predict-no +)
 (S1 ^operator O2222 +)
Retracting propose*predict-yes
 -->
 (O2221 ^name predict-yes +)
 (S1 ^operator O2221 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1114 ^value 1 +)
 (R1 ^reward R1114 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2222 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
 -->
 (S1 ^operator O2221 = 0.8784067009010752)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2221 = 0.1215968547680865)
=>WM: (15629: S1 ^operator O2224 +)
=>WM: (15628: S1 ^operator O2223 +)
=>WM: (15627: O2224 ^name predict-no)
=>WM: (15626: O2223 ^name predict-yes)
=>WM: (15625: R1115 ^value 1)
=>WM: (15624: R1 ^reward R1115)
<=WM: (15615: S1 ^operator O2221 +)
<=WM: (15617: S1 ^operator O2221)
<=WM: (15616: S1 ^operator O2222 +)
<=WM: (15610: R1 ^reward R1114)
<=WM: (15613: O2222 ^name predict-no)
<=WM: (15612: O2221 ^name predict-yes)
<=WM: (15611: R1114 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2223 = 0.1215968547680865)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2223 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2224 = 0.9436253760703815)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2222 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2221 = 0.1215968547680865)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2221 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.878788,0.10706)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.46548 0.412927 0.878407 -> 0.46548 0.412927 0.878406(R,m,v=1,1,0)
=>WM: (15630: S1 ^operator O2224)

  1112:    O: O2224 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1112 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1111 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15631: I3 ^predict-no N1112)
<=WM: (15619: N1111 ^status complete)
<=WM: (15618: I3 ^predict-yes N1111)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15635: I2 ^dir U)
=>WM: (15634: I2 ^reward 1)
=>WM: (15633: I2 ^see 0)
=>WM: (15632: N1112 ^status complete)
<=WM: (15622: I2 ^dir R)
<=WM: (15621: I2 ^reward 1)
<=WM: (15620: I2 ^see 1)
=>WM: (15636: I2 ^level-1 R0-root)
<=WM: (15623: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1116 ^value 1 +)
 (R1 ^reward R1116 +)
Firing propose*predict-yes
 -->
 (O2225 ^name predict-yes +)
 (S1 ^operator O2225 +)
Firing propose*predict-no
 -->
 (O2226 ^name predict-no +)
 (S1 ^operator O2226 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2224 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2223 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2224 ^name predict-no +)
 (S1 ^operator O2224 +)
Retracting propose*predict-yes
 -->
 (O2223 ^name predict-yes +)
 (S1 ^operator O2223 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1115 ^value 1 +)
 (R1 ^reward R1115 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2224 = 0.9436253760703815)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2223 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2223 = 0.1215965720455857)
=>WM: (15644: S1 ^operator O2226 +)
=>WM: (15643: S1 ^operator O2225 +)
=>WM: (15642: I3 ^dir U)
=>WM: (15641: O2226 ^name predict-no)
=>WM: (15640: O2225 ^name predict-yes)
=>WM: (15639: R1116 ^value 1)
=>WM: (15638: R1 ^reward R1116)
=>WM: (15637: I3 ^see 0)
<=WM: (15628: S1 ^operator O2223 +)
<=WM: (15629: S1 ^operator O2224 +)
<=WM: (15630: S1 ^operator O2224)
<=WM: (15614: I3 ^dir R)
<=WM: (15624: R1 ^reward R1115)
<=WM: (15609: I3 ^see 1)
<=WM: (15627: O2224 ^name predict-no)
<=WM: (15626: O2223 ^name predict-yes)
<=WM: (15625: R1115 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2225 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2226 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2224 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2223 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.943625 0 0.943625 -> 0.95262 0 0.95262(R,m,v=1,0.938144,0.0583302)
=>WM: (15645: S1 ^operator O2226)

  1113:    O: O2226 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1113 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1112 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15646: I3 ^predict-no N1113)
<=WM: (15632: N1112 ^status complete)
<=WM: (15631: I3 ^predict-no N1112)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
-/--- Input Phase --- 
=>WM: (15650: I2 ^dir U)
=>WM: (15649: I2 ^reward 1)
=>WM: (15648: I2 ^see 0)
=>WM: (15647: N1113 ^status complete)
<=WM: (15635: I2 ^dir U)
<=WM: (15634: I2 ^reward 1)
<=WM: (15633: I2 ^see 0)
=>WM: (15651: I2 ^level-1 R0-root)
<=WM: (15636: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1117 ^value 1 +)
 (R1 ^reward R1117 +)
Firing propose*predict-yes
 -->
 (O2227 ^name predict-yes +)
 (S1 ^operator O2227 +)
Firing propose*predict-no
 -->
 (O2228 ^name predict-no +)
 (S1 ^operator O2228 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2226 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2225 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2226 ^name predict-no +)
 (S1 ^operator O2226 +)
Retracting propose*predict-yes
 -->
 (O2225 ^name predict-yes +)
 (S1 ^operator O2225 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1116 ^value 1 +)
 (R1 ^reward R1116 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2226 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2225 = 0.)
=>WM: (15657: S1 ^operator O2228 +)
=>WM: (15656: S1 ^operator O2227 +)
=>WM: (15655: O2228 ^name predict-no)
=>WM: (15654: O2227 ^name predict-yes)
=>WM: (15653: R1117 ^value 1)
=>WM: (15652: R1 ^reward R1117)
<=WM: (15643: S1 ^operator O2225 +)
<=WM: (15644: S1 ^operator O2226 +)
<=WM: (15645: S1 ^operator O2226)
<=WM: (15638: R1 ^reward R1116)
<=WM: (15641: O2226 ^name predict-no)
<=WM: (15640: O2225 ^name predict-yes)
<=WM: (15639: R1116 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2227 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2228 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2226 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2225 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15658: S1 ^operator O2228)

  1114:    O: O2228 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1114 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1113 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15659: I3 ^predict-no N1114)
<=WM: (15647: N1113 ^status complete)
<=WM: (15646: I3 ^predict-no N1113)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isU
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15663: I2 ^dir U)
=>WM: (15662: I2 ^reward 1)
=>WM: (15661: I2 ^see 0)
=>WM: (15660: N1114 ^status complete)
<=WM: (15650: I2 ^dir U)
<=WM: (15649: I2 ^reward 1)
<=WM: (15648: I2 ^see 0)
=>WM: (15664: I2 ^level-1 R0-root)
<=WM: (15651: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1118 ^value 1 +)
 (R1 ^reward R1118 +)
Firing propose*predict-yes
 -->
 (O2229 ^name predict-yes +)
 (S1 ^operator O2229 +)
Firing propose*predict-no
 -->
 (O2230 ^name predict-no +)
 (S1 ^operator O2230 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2228 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2227 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2228 ^name predict-no +)
 (S1 ^operator O2228 +)
Retracting propose*predict-yes
 -->
 (O2227 ^name predict-yes +)
 (S1 ^operator O2227 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1117 ^value 1 +)
 (R1 ^reward R1117 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2228 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2227 = 0.)
=>WM: (15670: S1 ^operator O2230 +)
=>WM: (15669: S1 ^operator O2229 +)
=>WM: (15668: O2230 ^name predict-no)
=>WM: (15667: O2229 ^name predict-yes)
=>WM: (15666: R1118 ^value 1)
=>WM: (15665: R1 ^reward R1118)
<=WM: (15656: S1 ^operator O2227 +)
<=WM: (15657: S1 ^operator O2228 +)
<=WM: (15658: S1 ^operator O2228)
<=WM: (15652: R1 ^reward R1117)
<=WM: (15655: O2228 ^name predict-no)
<=WM: (15654: O2227 ^name predict-yes)
<=WM: (15653: R1117 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2229 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2230 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2228 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2227 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15671: S1 ^operator O2230)

  1115:    O: O2230 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1115 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1114 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15672: I3 ^predict-no N1115)
<=WM: (15660: N1114 ^status complete)
<=WM: (15659: I3 ^predict-no N1114)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-B
In  State-B moving U
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15676: I2 ^dir L)
=>WM: (15675: I2 ^reward 1)
=>WM: (15674: I2 ^see 0)
=>WM: (15673: N1115 ^status complete)
<=WM: (15663: I2 ^dir U)
<=WM: (15662: I2 ^reward 1)
<=WM: (15661: I2 ^see 0)
=>WM: (15677: I2 ^level-1 R0-root)
<=WM: (15664: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2230 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2229 = 0.6091755191206203)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1119 ^value 1 +)
 (R1 ^reward R1119 +)
Firing propose*predict-yes
 -->
 (O2231 ^name predict-yes +)
 (S1 ^operator O2231 +)
Firing propose*predict-no
 -->
 (O2232 ^name predict-no +)
 (S1 ^operator O2232 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2230 = 0.3145153576266763)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2229 = 0.3907758702770224)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2230 ^name predict-no +)
 (S1 ^operator O2230 +)
Retracting propose*predict-yes
 -->
 (O2229 ^name predict-yes +)
 (S1 ^operator O2229 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1118 ^value 1 +)
 (R1 ^reward R1118 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2230 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2229 = 0.)
=>WM: (15684: S1 ^operator O2232 +)
=>WM: (15683: S1 ^operator O2231 +)
=>WM: (15682: I3 ^dir L)
=>WM: (15681: O2232 ^name predict-no)
=>WM: (15680: O2231 ^name predict-yes)
=>WM: (15679: R1119 ^value 1)
=>WM: (15678: R1 ^reward R1119)
<=WM: (15669: S1 ^operator O2229 +)
<=WM: (15670: S1 ^operator O2230 +)
<=WM: (15671: S1 ^operator O2230)
<=WM: (15642: I3 ^dir U)
<=WM: (15665: R1 ^reward R1118)
<=WM: (15668: O2230 ^name predict-no)
<=WM: (15667: O2229 ^name predict-yes)
<=WM: (15666: R1118 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2231 = 0.6091755191206203)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2231 = 0.3907758702770224)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2232 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2232 = 0.3145153576266763)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2230 = 0.3145153576266763)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2230 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2229 = 0.3907758702770224)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2229 = 0.6091755191206203)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15685: S1 ^operator O2231)

  1116:    O: O2231 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1116 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1115 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15686: I3 ^predict-yes N1116)
<=WM: (15673: N1115 ^status complete)
<=WM: (15672: I3 ^predict-no N1115)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isU
--- END Output Phase ---
\-/--- Input Phase --- 
=>WM: (15690: I2 ^dir U)
=>WM: (15689: I2 ^reward 1)
=>WM: (15688: I2 ^see 1)
=>WM: (15687: N1116 ^status complete)
<=WM: (15676: I2 ^dir L)
<=WM: (15675: I2 ^reward 1)
<=WM: (15674: I2 ^see 0)
=>WM: (15691: I2 ^level-1 L1-root)
<=WM: (15677: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1120 ^value 1 +)
 (R1 ^reward R1120 +)
Firing propose*predict-yes
 -->
 (O2233 ^name predict-yes +)
 (S1 ^operator O2233 +)
Firing propose*predict-no
 -->
 (O2234 ^name predict-no +)
 (S1 ^operator O2234 +)
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2232 = 1.)
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2231 = 0.)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2232 ^name predict-no +)
 (S1 ^operator O2232 +)
Retracting propose*predict-yes
 -->
 (O2231 ^name predict-yes +)
 (S1 ^operator O2231 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1119 ^value 1 +)
 (R1 ^reward R1119 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2232 = 0.3145153576266763)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2232 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2231 = 0.3907758702770224)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2231 = 0.6091755191206203)
=>WM: (15699: S1 ^operator O2234 +)
=>WM: (15698: S1 ^operator O2233 +)
=>WM: (15697: I3 ^dir U)
=>WM: (15696: O2234 ^name predict-no)
=>WM: (15695: O2233 ^name predict-yes)
=>WM: (15694: R1120 ^value 1)
=>WM: (15693: R1 ^reward R1120)
=>WM: (15692: I3 ^see 1)
<=WM: (15683: S1 ^operator O2231 +)
<=WM: (15685: S1 ^operator O2231)
<=WM: (15684: S1 ^operator O2232 +)
<=WM: (15682: I3 ^dir L)
<=WM: (15678: R1 ^reward R1119)
<=WM: (15637: I3 ^see 0)
<=WM: (15681: O2232 ^name predict-no)
<=WM: (15680: O2231 ^name predict-yes)
<=WM: (15679: R1119 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2233 = 0.)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2234 = 1.)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2232 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2231 = 0.)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472322 -0.0815462 0.390776 -> 0.472325 -0.0815456 0.39078(R,m,v=1,0.95082,0.0470186)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527637 0.081539 0.609176 -> 0.52764 0.0815397 0.60918(R,m,v=1,1,0)
=>WM: (15700: S1 ^operator O2234)

  1117:    O: O2234 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1117 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1116 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15701: I3 ^predict-no N1117)
<=WM: (15687: N1116 ^status complete)
<=WM: (15686: I3 ^predict-yes N1116)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction U in state State-A
In  State-A moving U
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
|\--- Input Phase --- 
=>WM: (15705: I2 ^dir L)
=>WM: (15704: I2 ^reward 1)
=>WM: (15703: I2 ^see 0)
=>WM: (15702: N1117 ^status complete)
<=WM: (15690: I2 ^dir U)
<=WM: (15689: I2 ^reward 1)
<=WM: (15688: I2 ^see 1)
=>WM: (15706: I2 ^level-1 L1-root)
<=WM: (15691: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2233 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2234 = 0.6855101468046794)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1121 ^value 1 +)
 (R1 ^reward R1121 +)
Firing propose*predict-yes
 -->
 (O2235 ^name predict-yes +)
 (S1 ^operator O2235 +)
Firing propose*predict-no
 -->
 (O2236 ^name predict-no +)
 (S1 ^operator O2236 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2234 = 0.3145153576266763)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2233 = 0.3907797844980353)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2234 ^name predict-no +)
 (S1 ^operator O2234 +)
Retracting propose*predict-yes
 -->
 (O2233 ^name predict-yes +)
 (S1 ^operator O2233 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1120 ^value 1 +)
 (R1 ^reward R1120 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir U +)
Retracting rl*prefer*rvt*predict-no*H0*2
 -->
 (S1 ^operator O2234 = 1.)
Retracting rl*prefer*rvt*predict-yes*H0*1
 -->
 (S1 ^operator O2233 = 0.)
=>WM: (15714: S1 ^operator O2236 +)
=>WM: (15713: S1 ^operator O2235 +)
=>WM: (15712: I3 ^dir L)
=>WM: (15711: O2236 ^name predict-no)
=>WM: (15710: O2235 ^name predict-yes)
=>WM: (15709: R1121 ^value 1)
=>WM: (15708: R1 ^reward R1121)
=>WM: (15707: I3 ^see 0)
<=WM: (15698: S1 ^operator O2233 +)
<=WM: (15699: S1 ^operator O2234 +)
<=WM: (15700: S1 ^operator O2234)
<=WM: (15697: I3 ^dir U)
<=WM: (15693: R1 ^reward R1120)
<=WM: (15692: I3 ^see 1)
<=WM: (15696: O2234 ^name predict-no)
<=WM: (15695: O2233 ^name predict-yes)
<=WM: (15694: R1120 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2235 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2235 = 0.3907797844980353)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2236 = 0.6855101468046794)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2236 = 0.3145153576266763)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2234 = 0.3145153576266763)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2234 = 0.6855101468046794)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2233 = 0.3907797844980353)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2233 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
=>WM: (15715: S1 ^operator O2236)

  1118:    O: O2236 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1118 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1117 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15716: I3 ^predict-no N1118)
<=WM: (15702: N1117 ^status complete)
<=WM: (15701: I3 ^predict-no N1117)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15720: I2 ^dir R)
=>WM: (15719: I2 ^reward 1)
=>WM: (15718: I2 ^see 0)
=>WM: (15717: N1118 ^status complete)
<=WM: (15705: I2 ^dir L)
<=WM: (15704: I2 ^reward 1)
<=WM: (15703: I2 ^see 0)
=>WM: (15721: I2 ^level-1 L0-root)
<=WM: (15706: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2235 = 0.8784001883287573)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1122 ^value 1 +)
 (R1 ^reward R1122 +)
Firing propose*predict-yes
 -->
 (O2237 ^name predict-yes +)
 (S1 ^operator O2237 +)
Firing propose*predict-no
 -->
 (O2238 ^name predict-no +)
 (S1 ^operator O2238 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2236 = 0.9526196166066165)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2235 = 0.1215965720455857)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2236 ^name predict-no +)
 (S1 ^operator O2236 +)
Retracting propose*predict-yes
 -->
 (O2235 ^name predict-yes +)
 (S1 ^operator O2235 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1121 ^value 1 +)
 (R1 ^reward R1121 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2236 = 0.3145153576266763)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2236 = 0.6855101468046794)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2235 = 0.3907797844980353)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2235 = -0.2062723012911647)
=>WM: (15728: S1 ^operator O2238 +)
=>WM: (15727: S1 ^operator O2237 +)
=>WM: (15726: I3 ^dir R)
=>WM: (15725: O2238 ^name predict-no)
=>WM: (15724: O2237 ^name predict-yes)
=>WM: (15723: R1122 ^value 1)
=>WM: (15722: R1 ^reward R1122)
<=WM: (15713: S1 ^operator O2235 +)
<=WM: (15714: S1 ^operator O2236 +)
<=WM: (15715: S1 ^operator O2236)
<=WM: (15712: I3 ^dir L)
<=WM: (15708: R1 ^reward R1121)
<=WM: (15711: O2236 ^name predict-no)
<=WM: (15710: O2235 ^name predict-yes)
<=WM: (15709: R1121 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2237 = 0.1215965720455857)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2237 = 0.8784001883287573)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2238 = 0.9526196166066165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2236 = 0.9526196166066165)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2235 = 0.1215965720455857)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2235 = 0.8784001883287573)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*4 0.478563 -0.164047 0.314515 -> 0.478561 -0.164047 0.314513(R,m,v=1,0.931818,0.0638961)
RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521461 0.16405 0.68551 -> 0.521458 0.164049 0.685508(R,m,v=1,1,0)
=>WM: (15729: S1 ^operator O2237)

  1119:    O: O2237 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1119 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1118 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15730: I3 ^predict-yes N1119)
<=WM: (15717: N1118 ^status complete)
<=WM: (15716: I3 ^predict-no N1118)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction R in state State-A
In  State-A moving R
ENV: (next state, see, prediction correct?) = (State-B, 1, True)
predict error 0
dir: dir isR
--- END Output Phase ---
\---- Input Phase --- 
=>WM: (15734: I2 ^dir R)
=>WM: (15733: I2 ^reward 1)
=>WM: (15732: I2 ^see 1)
=>WM: (15731: N1119 ^status complete)
<=WM: (15720: I2 ^dir R)
<=WM: (15719: I2 ^reward 1)
<=WM: (15718: I2 ^see 0)
=>WM: (15735: I2 ^level-1 R1-root)
<=WM: (15721: I2 ^level-1 L0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2237 = -0.04253361215288998)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1123 ^value 1 +)
 (R1 ^reward R1123 +)
Firing propose*predict-yes
 -->
 (O2239 ^name predict-yes +)
 (S1 ^operator O2239 +)
Firing propose*predict-no
 -->
 (O2240 ^name predict-no +)
 (S1 ^operator O2240 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2238 = 0.9526196166066165)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2237 = 0.1215965720455857)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2238 ^name predict-no +)
 (S1 ^operator O2238 +)
Retracting propose*predict-yes
 -->
 (O2237 ^name predict-yes +)
 (S1 ^operator O2237 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1122 ^value 1 +)
 (R1 ^reward R1122 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2238 = 0.9526196166066165)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
 -->
 (S1 ^operator O2237 = 0.8784001883287573)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2237 = 0.1215965720455857)
=>WM: (15742: S1 ^operator O2240 +)
=>WM: (15741: S1 ^operator O2239 +)
=>WM: (15740: O2240 ^name predict-no)
=>WM: (15739: O2239 ^name predict-yes)
=>WM: (15738: R1123 ^value 1)
=>WM: (15737: R1 ^reward R1123)
=>WM: (15736: I3 ^see 1)
<=WM: (15727: S1 ^operator O2237 +)
<=WM: (15729: S1 ^operator O2237)
<=WM: (15728: S1 ^operator O2238 +)
<=WM: (15722: R1 ^reward R1122)
<=WM: (15707: I3 ^see 0)
<=WM: (15725: O2238 ^name predict-no)
<=WM: (15724: O2237 ^name predict-yes)
<=WM: (15723: R1122 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2239 = 0.1215965720455857)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2239 = -0.04253361215288998)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2240 = 0.9526196166066165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2238 = 0.9526196166066165)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2237 = 0.1215965720455857)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2237 = -0.04253361215288998)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.879397,0.106594)
RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465474 0.412926 0.8784 -> 0.465475 0.412926 0.8784(R,m,v=1,1,0)
=>WM: (15743: S1 ^operator O2240)

  1120:    O: O2240 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1120 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1119 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15744: I3 ^predict-no N1120)
<=WM: (15731: N1119 ^status complete)
<=WM: (15730: I3 ^predict-yes N1119)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isR
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15748: I2 ^dir R)
=>WM: (15747: I2 ^reward 1)
=>WM: (15746: I2 ^see 0)
=>WM: (15745: N1120 ^status complete)
<=WM: (15734: I2 ^dir R)
<=WM: (15733: I2 ^reward 1)
<=WM: (15732: I2 ^see 1)
=>WM: (15749: I2 ^level-1 R0-root)
<=WM: (15735: I2 ^level-1 R1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2239 = -0.1512366769350551)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1124 ^value 1 +)
 (R1 ^reward R1124 +)
Firing propose*predict-yes
 -->
 (O2241 ^name predict-yes +)
 (S1 ^operator O2241 +)
Firing propose*predict-no
 -->
 (O2242 ^name predict-no +)
 (S1 ^operator O2242 +)
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2240 = 0.9526196166066165)
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2239 = 0.1215968294322646)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2240 ^name predict-no +)
 (S1 ^operator O2240 +)
Retracting propose*predict-yes
 -->
 (O2239 ^name predict-yes +)
 (S1 ^operator O2239 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1123 ^value 1 +)
 (R1 ^reward R1123 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2240 = 0.9526196166066165)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
 -->
 (S1 ^operator O2239 = -0.04253361215288998)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2239 = 0.1215968294322646)
=>WM: (15756: S1 ^operator O2242 +)
=>WM: (15755: S1 ^operator O2241 +)
=>WM: (15754: O2242 ^name predict-no)
=>WM: (15753: O2241 ^name predict-yes)
=>WM: (15752: R1124 ^value 1)
=>WM: (15751: R1 ^reward R1124)
=>WM: (15750: I3 ^see 0)
<=WM: (15741: S1 ^operator O2239 +)
<=WM: (15742: S1 ^operator O2240 +)
<=WM: (15743: S1 ^operator O2240)
<=WM: (15737: R1 ^reward R1123)
<=WM: (15736: I3 ^see 1)
<=WM: (15740: O2240 ^name predict-no)
<=WM: (15739: O2239 ^name predict-yes)
<=WM: (15738: R1123 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2241 = 0.1215968294322646)
Firing prefer*rvt*predict-yes*H0*5*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2241 = -0.1512366769350551)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2242 = 0.9526196166066165)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2240 = 0.9526196166066165)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2239 = 0.1215968294322646)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2239 = -0.1512366769350551)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.95262 0 0.95262 -> 0.960173 0 0.960173(R,m,v=1,0.938462,0.0580492)
=>WM: (15757: S1 ^operator O2242)

  1121:    O: O2242 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1121 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1120 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15758: I3 ^predict-no N1121)
<=WM: (15745: N1120 ^status complete)
<=WM: (15744: I3 ^predict-no N1120)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction R in state State-B
In  State-B moving R
ENV: (next state, see, prediction correct?) = (State-B, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
---- Input Phase --- 
=>WM: (15762: I2 ^dir L)
=>WM: (15761: I2 ^reward 1)
=>WM: (15760: I2 ^see 0)
=>WM: (15759: N1121 ^status complete)
<=WM: (15748: I2 ^dir R)
<=WM: (15747: I2 ^reward 1)
<=WM: (15746: I2 ^see 0)
=>WM: (15763: I2 ^level-1 R0-root)
<=WM: (15749: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2242 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2241 = 0.6091799658293192)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1125 ^value 1 +)
 (R1 ^reward R1125 +)
Firing propose*predict-yes
 -->
 (O2243 ^name predict-yes +)
 (S1 ^operator O2243 +)
Firing propose*predict-no
 -->
 (O2244 ^name predict-no +)
 (S1 ^operator O2244 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2242 = 0.3145132909791186)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2241 = 0.3907797844980353)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2242 ^name predict-no +)
 (S1 ^operator O2242 +)
Retracting propose*predict-yes
 -->
 (O2241 ^name predict-yes +)
 (S1 ^operator O2241 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1124 ^value 1 +)
 (R1 ^reward R1124 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir R +)
Retracting rl*prefer*rvt*predict-no*H0*6
 -->
 (S1 ^operator O2242 = 0.9601726831979733)
Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
 -->
 (S1 ^operator O2241 = -0.1512366769350551)
Retracting rl*prefer*rvt*predict-yes*H0*5
 -->
 (S1 ^operator O2241 = 0.1215968294322646)
=>WM: (15770: S1 ^operator O2244 +)
=>WM: (15769: S1 ^operator O2243 +)
=>WM: (15768: I3 ^dir L)
=>WM: (15767: O2244 ^name predict-no)
=>WM: (15766: O2243 ^name predict-yes)
=>WM: (15765: R1125 ^value 1)
=>WM: (15764: R1 ^reward R1125)
<=WM: (15755: S1 ^operator O2241 +)
<=WM: (15756: S1 ^operator O2242 +)
<=WM: (15757: S1 ^operator O2242)
<=WM: (15726: I3 ^dir R)
<=WM: (15751: R1 ^reward R1124)
<=WM: (15754: O2242 ^name predict-no)
<=WM: (15753: O2241 ^name predict-yes)
<=WM: (15752: R1124 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2243 = 0.6091799658293192)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2243 = 0.3907797844980353)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2244 = -0.1984300550322165)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2244 = 0.3145132909791186)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2242 = 0.3145132909791186)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2242 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2241 = 0.3907797844980353)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2241 = 0.6091799658293192)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-no*H0*6 0.960173 0 0.960173 -> 0.966517 0 0.966517(R,m,v=1,0.938776,0.0577708)
=>WM: (15771: S1 ^operator O2243)

  1122:    O: O2243 (predict-yes)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-yes N1122 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-no N1121 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15772: I3 ^predict-yes N1122)
<=WM: (15759: N1121 ^status complete)
<=WM: (15758: I3 ^predict-no N1121)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-yes for direction L in state State-B
In  State-B moving L
ENV: (next state, see, prediction correct?) = (State-A, 1, True)
predict error 0
dir: dir isL
--- END Output Phase ---
/|\--- Input Phase --- 
=>WM: (15776: I2 ^dir L)
=>WM: (15775: I2 ^reward 1)
=>WM: (15774: I2 ^see 1)
=>WM: (15773: N1122 ^status complete)
<=WM: (15762: I2 ^dir L)
<=WM: (15761: I2 ^reward 1)
<=WM: (15760: I2 ^see 0)
=>WM: (15777: I2 ^level-1 L1-root)
<=WM: (15763: I2 ^level-1 R0-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2243 = -0.2062723012911647)
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2244 = 0.6855078088135349)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1126 ^value 1 +)
 (R1 ^reward R1126 +)
Firing propose*predict-yes
 -->
 (O2245 ^name predict-yes +)
 (S1 ^operator O2245 +)
Firing propose*predict-no
 -->
 (O2246 ^name predict-no +)
 (S1 ^operator O2246 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2244 = 0.3145132909791186)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2243 = 0.3907797844980353)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Retracting propose*predict-no
 -->
 (O2244 ^name predict-no +)
 (S1 ^operator O2244 +)
Retracting propose*predict-yes
 -->
 (O2243 ^name predict-yes +)
 (S1 ^operator O2243 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1125 ^value 1 +)
 (R1 ^reward R1125 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2244 = 0.3145132909791186)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
 -->
 (S1 ^operator O2244 = -0.1984300550322165)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2243 = 0.3907797844980353)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
 -->
 (S1 ^operator O2243 = 0.6091799658293192)
=>WM: (15784: S1 ^operator O2246 +)
=>WM: (15783: S1 ^operator O2245 +)
=>WM: (15782: O2246 ^name predict-no)
=>WM: (15781: O2245 ^name predict-yes)
=>WM: (15780: R1126 ^value 1)
=>WM: (15779: R1 ^reward R1126)
=>WM: (15778: I3 ^see 1)
<=WM: (15769: S1 ^operator O2243 +)
<=WM: (15771: S1 ^operator O2243)
<=WM: (15770: S1 ^operator O2244 +)
<=WM: (15764: R1 ^reward R1125)
<=WM: (15750: I3 ^see 0)
<=WM: (15767: O2244 ^name predict-no)
<=WM: (15766: O2243 ^name predict-yes)
<=WM: (15765: R1125 ^value 1)

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing prefer*rvt*predict-yes*H0
 -->
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2245 = 0.3907797844980353)
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2245 = -0.2062723012911647)
Firing prefer*rvt*predict-no*H0
 -->
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2246 = 0.3145132909791186)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2246 = 0.6855078088135349)
 inner elaboration loop at bottom goal.
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2244 = 0.3145132909791186)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2244 = 0.6855078088135349)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2243 = 0.3907797844980353)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2243 = -0.2062723012911647)

--- END Proposal Phase ---

--- Decision Phase ---
RL update rl*prefer*rvt*predict-yes*H0*3 0.472325 -0.0815456 0.39078 -> 0.472328 -0.0815451 0.390783(R,m,v=1,0.951087,0.0467748)
RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.52764 0.0815397 0.60918 -> 0.527643 0.0815402 0.609184(R,m,v=1,1,0)
=>WM: (15785: S1 ^operator O2246)

  1123:    O: O2246 (predict-no)
--- END Decision Phase ---

--- Application Phase ---
	--- Firing Productions (PE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing apply*operator
 -->
 (I3 ^predict-no N1123 +  :O )
Firing apply*operator*complete
 -->
 (I3 ^predict-yes N1122 -  :O )
 inner elaboration loop at bottom goal.
	--- Change Working Memory (PE) ---
=>WM: (15786: I3 ^predict-no N1123)
<=WM: (15773: N1122 ^status complete)
<=WM: (15772: I3 ^predict-yes N1122)
	--- Firing Productions (IE) For State At Depth 1 ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing monitor*world
 -->

I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
	--- Change Working Memory (IE) ---

--- END Application Phase ---
--- Output Phase ---
ENV: Agent did: predict-no for direction L in state State-A
In  State-A moving L
ENV: (next state, see, prediction correct?) = (State-A, 0, True)
predict error 0
dir: dir isL
--- END Output Phase ---
-/|--- Input Phase --- 
=>WM: (15790: I2 ^dir L)
=>WM: (15789: I2 ^reward 1)
=>WM: (15788: I2 ^see 0)
=>WM: (15787: N1123 ^status complete)
<=WM: (15776: I2 ^dir L)
<=WM: (15775: I2 ^reward 1)
<=WM: (15774: I2 ^see 1)
=>WM: (15791: I2 ^level-1 L0-root)
<=WM: (15777: I2 ^level-1 L1-root)

--- END Input Phase --- 

--- Proposal Phase ---

--- Inner Elaboration Phase, active level 1 (S1) ---
Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
 -->
 (S1 ^operator O2245 = -0.208713043145708)
Firing rl*prefer*rvt*predict-no*H0*4*H1*12
 -->
 (S1 ^operator O2246 = 0.6854688057424099)
Firing prefer*rvt*predict-no*H0*4*H1
 -->
Firing prefer*rvt*predict-yes*H0*3*H1
 -->
Firing elaborate*copy-see-to-output-link
 -->
 (I3 ^see 0 +)
Firing elaborate*reward*based*on*reward
 -->
 (R1127 ^value 1 +)
 (R1 ^reward R1127 +)
Firing propose*predict-yes
 -->
 (O2247 ^name predict-yes +)
 (S1 ^operator O2247 +)
Firing propose*predict-no
 -->
 (O2248 ^name predict-no +)
 (S1 ^operator O2248 +)
Firing rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2246 = 0.3145132909791186)
Firing rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2245 = 0.3907830226387189)
Firing prefer*rvt*predict-yes*H0
 -->
Firing prefer*rvt*predict-no*H0
 -->
Firing elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
 inner elaboration loop at bottom goal.
Retracting elaborate*copy-see-to-output-link
 -->
 (I3 ^see 1 +)
Retracting propose*predict-no
 -->
 (O2246 ^name predict-no +)
 (S1 ^operator O2246 +)
Retracting propose*predict-yes
 -->
 (O2245 ^name predict-yes +)
 (S1 ^operator O2245 +)
Retracting elaborate*reward*based*on*reward
 -->
 (R1126 ^value 1 +)
 (R1 ^reward R1126 +)
Retracting elaborate*copy-dir-to-output-link
 -->
 (I3 ^dir L +)
Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
 -->
 (S1 ^operator O2246 = 0.6855078088135349)
Retracting rl*prefer*rvt*predict-no*H0*4
 -->
 (S1 ^operator O2246 = 0.3145132909791186)
Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
 -->
 (S1 ^operator O2245 = -0.2062723012911647)
Retracting rl*prefer*rvt*predict-yes*H0*3
 -->
 (S1 ^operator O2245 = 0.3907830226387189)
=>WM: (15798: S1 ^operator O2248 +)
=>WM: (15797: S1 ^operator O2247 +)
=>WM: (15796: O2248 ^name predict-no)
=>WM: (15795: O2247 ^name predict-yes)
=>WM: (15794: R1127 ^value 1)
=>WM: (15793: R1 ^reward R1127)
=>WM: (15792: I3 ^see 0)
<=WM: (15783: S1 ^operator O2245 +)
<=WM: (15784: S1 ^operator O2246 +)
<=WM: (15785: S1 ^operator O2246)
<=WM: (15779: R1 ^reward R1126)
<=WM: (15778: I3 ^see 1)
<=WM: (15782: O2246 ^name predict-no)
<=WM: (15781: O2245 ^name predict-yes)
<=WM: (15780: R1126 ^value 1)

--- Inner Elaborati