/flipv2/20121113-091142-2.5K-ReLST-asneeded_epochs_50_5runs_noalphadecay/stdout-flip-2.5K_0.txt
Plain Text | 16417 lines | 15665 code | 752 blank | 0 comment | 0 complexity | 5d7ebd7b63172960c957e153706b02e3 MD5 | raw file
Possible License(s): BSD-3-Clause
- Seeding... 0
- dir: dir isU
- Python-Soar Flip environment.
- To accept commands from an external sml process, you'll need to
- type 'slave <log file> <n decisons>' at the prompt...
- sourcing 'flip_predict.soar'
- ***********
- Total: 11 productions sourced.
- seeding Soar with 0 ...
- soar> Entering slave mode:
- - log file 'rl-slave-2.5K_0.log'....
- - will exit slave mode after 2500 decisions
- waiting for commands from an externally connected sml process...
- -/|sleeping...
- \sleeping...
- -sleeping...
- /sleeping...
- |sleeping...
- \-/|\-/|\-/|sleeping...
- \-/|\-/sleeping...
- |1: O: O1 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isU
- rule alias: '*'
- rule alias: '*'
- \-/|\-/2: O: O4 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-3: O: O5 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\4: O: O7 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|5: O: O9 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/6: O: O11 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- |\7: O: O14 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|8: O: O15 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-9: O: O17 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\10: O: O19 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- -/|11: O: O22 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- \12: O: O24 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|13: O: O26 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isU
- \-14: O: O28 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|15: O: O29 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/16: O: O31 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-17: O: O34 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\18: O: O36 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|19: O: O38 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/20: O: O40 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-21: O: O41 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isU
- /22: O: O44 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-23: O: O46 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\24: O: O48 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|25: O: O50 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- \-/26: O: O51 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\27: O: O53 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|28: O: O55 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- \-/29: O: O57 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- |\-/30: O: O60 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-31: O: O61 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- /32: O: O64 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-33: O: O65 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /|\34: O: O68 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|35: O: O69 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/36: O: O71 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\37: O: O74 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|38: O: O75 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-39: O: O77 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- /|40: O: O80 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/41: O: O81 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |42: O: O83 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/43: O: O86 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-44: O: O87 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- /|\45: O: O89 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isU
- -/|46: O: O92 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/47: O: O93 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isR
- |\-48: O: O96 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- /|49: O: O97 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/50: O: O100 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-/|\-sleeping...
- /sleeping...
- |51: O: O102 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \52: O: O104 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- -/|53: O: O106 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isL
- \-/54: O: O107 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isR
- |\-55: O: O109 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\56: O: O112 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|57: O: O114 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isR
- \-/58: O: O115 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\-59: O: O118 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\60: O: O119 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- -/|61: O: O122 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- \62: O: O123 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- -/63: O: O126 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\64: O: O127 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- -65: O: O129 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- /|\66: O: O131 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- -/67: O: O133 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- |68: O: O135 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- \-69: O: O137 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isL
- /|70: O: O139 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/71: O: O141 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isL
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- |72: O: O143 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isR
- \-/73: O: O146 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- |\-74: O: O147 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- /|\75: O: O150 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/76: O: O151 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\77: O: O154 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|78: O: O156 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/79: O: O158 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-80: O: O160 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|81: O: O162 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- \82: O: O164 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|83: O: O165 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/84: O: O167 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- |\-85: O: O169 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isL
- /|\86: O: O172 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isU
- -/|87: O: O174 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/88: O: O176 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-89: O: O178 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\90: O: O179 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/|91: O: O182 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- \92: O: O184 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|93: O: O186 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/94: O: O188 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-95: O: O189 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- /96: O: O191 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isU
- |\97: O: O194 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|98: O: O195 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/99: O: O197 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\100: O: O199 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- -/|101: O: O202 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- rule alias: '*'
- \-/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|\sleeping...
- -102: O: O204 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\103: O: O205 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/104: O: O208 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-105: O: O209 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isL
- /|\106: O: O211 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isU
- -/|107: O: O214 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/108: O: O216 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-109: O: O218 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\110: O: O220 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|111: O: O222 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- \112: O: O224 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|113: O: O226 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/114: O: O227 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\115: O: O230 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|116: O: O232 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/117: O: O234 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-118: O: O236 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isR
- /|\119: O: O238 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- -/|120: O: O240 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/121: O: O241 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- rule alias: '*'
- |122: O: O244 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/123: O: O246 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\124: O: O247 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isL
- -/125: O: O250 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isL
- |\-126: O: O252 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\127: O: O254 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|128: O: O256 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/129: O: O257 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isL
- |\-130: O: O260 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\131: O: O262 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- rule alias: '*'
- -132: O: O264 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\133: O: O266 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/134: O: O268 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- |\-135: O: O270 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isL
- /|136: O: O272 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/137: O: O274 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\138: O: O276 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- -/|139: O: O278 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/140: O: O280 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isR
- |\-141: O: O282 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- rule alias: '*'
- rule alias: '*'
- /142: O: O284 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isL
- |\143: O: O285 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isU
- -/|144: O: O288 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-145: O: O290 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\146: O: O292 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isU
- -/|147: O: O294 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/148: O: O296 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-149: O: O298 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\150: O: O300 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isU
- -/|151: O: O302 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \152: O: O304 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|153: O: O306 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-154: O: O308 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\155: O: O310 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/156: O: O312 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- |\-157: O: O314 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isR
- /|158: O: O316 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- \-/159: O: O318 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\160: O: O319 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|161: O: O322 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- \162: O: O324 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|163: O: O326 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/164: O: O328 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\165: O: O329 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|166: O: O332 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isU
- \-167: O: O334 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\168: O: O335 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|169: O: O338 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- \-/170: O: O339 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-171: O: O342 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /172: O: O344 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- |\-173: O: O345 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- /|\174: O: O348 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|175: O: O350 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/176: O: O352 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-177: O: O354 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- /|178: O: O355 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-179: O: O358 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isU
- /|\180: O: O360 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/181: O: O362 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |182: O: O364 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/183: O: O366 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\184: O: O368 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/185: O: O370 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-186: O: O372 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\187: O: O373 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|188: O: O376 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/189: O: O378 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- |\-190: O: O379 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|191: O: O382 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- \192: O: O384 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/193: O: O386 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |194: O: O388 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/195: O: O390 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-196: O: O392 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\197: O: O394 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|198: O: O396 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/199: O: O398 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-200: O: O399 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|\201: O: O402 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- -/202: O: O403 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-203: O: O406 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\204: O: O408 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isR
- -/205: O: O410 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\206: O: O412 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|207: O: O414 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/208: O: O416 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\209: O: O418 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|210: O: O419 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-211: O: O422 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isU
- /212: O: O424 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-213: O: O426 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\214: O: O428 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|215: O: O429 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/216: O: O432 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-217: O: O434 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isL
- /|218: O: O435 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-219: O: O437 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isU
- /|\220: O: O440 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|221: O: O441 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \222: O: O444 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|223: O: O445 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/224: O: O448 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-225: O: O450 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|226: O: O452 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/227: O: O454 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-228: O: O455 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|229: O: O457 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-230: O: O460 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\231: O: O461 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -232: O: O464 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|233: O: O466 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isU
- \-/234: O: O468 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-235: O: O470 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\236: O: O472 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|237: O: O473 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/238: O: O475 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-239: O: O478 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\240: O: O479 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/|241: O: O482 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \242: O: O483 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isL
- -/|243: O: O485 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/244: O: O487 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\-245: O: O490 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\246: O: O492 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|247: O: O494 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/248: O: O495 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\249: O: O498 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|250: O: O500 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-251: O: O502 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /252: O: O503 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\253: O: O506 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|254: O: O508 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/255: O: O509 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isL
- |\256: O: O511 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/|257: O: O514 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/258: O: O516 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-259: O: O517 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\260: O: O520 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/261: O: O521 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- |262: O: O523 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- \-263: O: O526 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|264: O: O528 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/265: O: O529 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-266: O: O531 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|267: O: O533 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-268: O: O536 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|269: O: O538 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, False)
- predict error 1
- dir: dir isU
- \-/270: O: O540 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-271: O: O542 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /272: O: O544 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-273: O: O546 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|274: O: O547 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/275: O: O550 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-276: O: O552 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\277: O: O554 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/278: O: O555 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\-279: O: O558 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|280: O: O559 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/281: O: O561 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |282: O: O563 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/283: O: O565 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isL
- |\-284: O: O568 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\-sleeping...
- /285: O: O569 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |\-286: O: O572 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isR
- /|\287: O: O573 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/288: O: O575 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\289: O: O577 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/290: O: O579 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |291: O: O582 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \292: O: O583 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/293: O: O586 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-294: O: O588 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\295: O: O590 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/296: O: O592 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-297: O: O593 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, False)
- predict error 1
- dir: dir isR
- /|298: O: O596 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-299: O: O597 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /|300: O: O600 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/|\-301: O: O602 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /302: O: O604 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-303: O: O605 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\-304: O: O608 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|305: O: O610 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/306: O: O612 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |307: O: O613 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/308: O: O616 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-309: O: O618 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\310: O: O620 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|311: O: O622 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \312: O: O623 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|313: O: O626 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/314: O: O628 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-315: O: O630 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\316: O: O632 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/317: O: O634 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-318: O: O636 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\319: O: O638 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/320: O: O640 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-321: O: O641 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /322: O: O644 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-323: O: O645 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|324: O: O648 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/325: O: O649 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-326: O: O652 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\327: O: O654 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|328: O: O656 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/329: O: O657 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\-330: O: O660 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\331: O: O661 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -332: O: O663 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\333: O: O666 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isL
- -/|334: O: O668 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/|335: O: O670 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-336: O: O672 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\337: O: O674 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|338: O: O676 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/339: O: O677 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\340: O: O680 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|341: O: O681 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \342: O: O684 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|343: O: O686 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/344: O: O687 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isR
- |\-345: O: O689 (predict-yes)
- I see 0 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|346: O: O692 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \347: O: O694 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|348: O: O696 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/349: O: O698 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\350: O: O699 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|351: O: O701 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \352: O: O704 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/353: O: O706 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\354: O: O707 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|355: O: O709 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/356: O: O711 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-357: O: O713 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\358: O: O716 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -359: O: O718 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\360: O: O720 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/361: O: O722 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, False)
- predict error 1
- dir: dir isL
- |362: O: O724 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/363: O: O726 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-/364: O: O728 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-365: O: O730 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\366: O: O731 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/|367: O: O734 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/368: O: O736 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-369: O: O737 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- /|\370: O: O740 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|\371: O: O742 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -372: O: O744 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\373: O: O745 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, False)
- predict error 1
- dir: dir isL
- -/|374: O: O748 (predict-no)
- I see 0 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/375: O: O750 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\376: O: O752 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|377: O: O754 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-378: O: O756 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\379: O: O758 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|380: O: O759 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/381: O: O762 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |382: O: O764 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/383: O: O766 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-384: O: O768 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\385: O: O770 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/386: O: O772 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-387: O: O774 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\388: O: O776 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/389: O: O778 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-/390: O: O780 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-391: O: O782 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /392: O: O783 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-393: O: O785 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\394: O: O788 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|395: O: O790 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/396: O: O792 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-397: O: O794 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|398: O: O796 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-399: O: O798 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|400: O: O800 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/401: O: O802 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |402: O: O804 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/403: O: O805 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-404: O: O808 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|405: O: O809 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/406: O: O811 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-407: O: O814 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\408: O: O816 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|409: O: O818 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/410: O: O820 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-411: O: O821 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /412: O: O824 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-413: O: O825 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|\414: O: O827 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|415: O: O829 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-416: O: O832 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|417: O: O834 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-418: O: O836 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\419: O: O838 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|420: O: O839 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/421: O: O842 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |422: O: O844 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/423: O: O846 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-424: O: O848 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\425: O: O849 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/|426: O: O852 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-427: O: O853 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\428: O: O856 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|429: O: O858 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-430: O: O859 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|431: O: O861 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \432: O: O863 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|433: O: O866 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/434: O: O867 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\-435: O: O870 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\436: O: O871 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|437: O: O873 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-438: O: O876 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\439: O: O878 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|440: O: O880 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/441: O: O882 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |442: O: O884 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/443: O: O886 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |444: O: O888 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/445: O: O890 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-446: O: O892 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|447: O: O893 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/448: O: O896 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-449: O: O897 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\450: O: O900 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|451: O: O901 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \452: O: O904 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|453: O: O906 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/|454: O: O908 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-455: O: O910 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\456: O: O912 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|457: O: O914 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-458: O: O915 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\459: O: O918 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|460: O: O919 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/461: O: O922 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |462: O: O923 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/463: O: O926 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\464: O: O928 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|465: O: O930 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/466: O: O931 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\467: O: O934 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|468: O: O936 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/469: O: O937 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\-470: O: O940 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\471: O: O942 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -472: O: O944 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\473: O: O946 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|\474: O: O947 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|475: O: O950 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-476: O: O952 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\477: O: O954 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|478: O: O956 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-479: O: O958 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\480: O: O959 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|481: O: O961 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \482: O: O964 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|\483: O: O965 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -484: O: O968 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\485: O: O970 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|486: O: O972 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-487: O: O974 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\488: O: O975 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/489: O: O978 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-/490: O: O980 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-491: O: O982 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /492: O: O983 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\-493: O: O986 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\494: O: O987 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/|495: O: O990 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/496: O: O992 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\497: O: O994 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|498: O: O996 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/499: O: O998 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-500: O: O999 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\-/|501: O: O1001 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \502: O: O1003 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|503: O: O1005 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/|504: O: O1008 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/505: O: O1010 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-506: O: O1012 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|507: O: O1014 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/|508: O: O1016 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/509: O: O1018 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-510: O: O1020 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\511: O: O1022 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -512: O: O1024 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\513: O: O1026 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|514: O: O1027 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/515: O: O1029 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-516: O: O1031 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|517: O: O1034 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/518: O: O1035 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-519: O: O1038 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|520: O: O1039 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/521: O: O1042 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |522: O: O1043 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/523: O: O1046 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-524: O: O1047 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\525: O: O1050 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/526: O: O1051 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-527: O: O1054 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\528: O: O1056 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|\529: O: O1057 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|530: O: O1059 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-531: O: O1062 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /532: O: O1064 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\533: O: O1065 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|534: O: O1067 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \535: O: O1070 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/536: O: O1072 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-537: O: O1074 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /538: O: O1076 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-539: O: O1078 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\540: O: O1080 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|541: O: O1082 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \542: O: O1083 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -543: O: O1085 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- /|\544: O: O1088 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/545: O: O1090 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\546: O: O1092 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|547: O: O1094 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-548: O: O1095 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\549: O: O1098 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|\sleeping...
- -550: O: O1100 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\551: O: O1101 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -552: O: O1103 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\553: O: O1106 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|554: O: O1107 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/555: O: O1109 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\556: O: O1112 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|557: O: O1114 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/558: O: O1115 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-559: O: O1117 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|\560: O: O1120 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|561: O: O1122 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \562: O: O1123 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|563: O: O1126 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-564: O: O1128 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\565: O: O1129 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/|566: O: O1132 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/567: O: O1134 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\568: O: O1135 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/|569: O: O1137 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/570: O: O1140 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-571: O: O1142 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /572: O: O1144 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-573: O: O1146 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\574: O: O1148 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|575: O: O1150 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/576: O: O1151 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\577: O: O1153 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/|578: O: O1156 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-579: O: O1157 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|\580: O: O1159 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/581: O: O1162 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |582: O: O1164 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/583: O: O1165 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-584: O: O1168 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\585: O: O1170 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|586: O: O1171 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/587: O: O1173 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-588: O: O1176 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\589: O: O1178 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|590: O: O1179 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/591: O: O1182 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |592: O: O1183 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/593: O: O1186 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-594: O: O1187 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\595: O: O1190 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|596: O: O1192 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/597: O: O1193 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-598: O: O1196 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\599: O: O1198 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|600: O: O1200 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/601: O: O1202 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |602: O: O1204 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/603: O: O1206 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\604: O: O1208 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|605: O: O1209 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-606: O: O1212 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\607: O: O1214 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/608: O: O1215 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-609: O: O1218 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\610: O: O1220 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|611: O: O1222 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \612: O: O1224 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|613: O: O1225 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-614: O: O1227 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /|615: O: O1230 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/616: O: O1232 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\617: O: O1233 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/618: O: O1236 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\-619: O: O1237 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /|\620: O: O1240 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|621: O: O1242 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \622: O: O1243 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|623: O: O1245 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/624: O: O1247 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\-625: O: O1250 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\626: O: O1252 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|627: O: O1254 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/628: O: O1256 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-629: O: O1258 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|630: O: O1260 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/631: O: O1262 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |632: O: O1264 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/633: O: O1265 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\634: O: O1267 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|635: O: O1270 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/636: O: O1271 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\637: O: O1274 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|638: O: O1275 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/639: O: O1278 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\640: O: O1279 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/|641: O: O1282 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \642: O: O1283 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|643: O: O1286 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/644: O: O1288 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\645: O: O1290 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|646: O: O1292 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/647: O: O1293 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\648: O: O1295 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|649: O: O1297 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-650: O: O1300 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|651: O: O1302 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \652: O: O1304 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|653: O: O1305 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/654: O: O1307 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\655: O: O1309 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/656: O: O1312 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\657: O: O1313 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/658: O: O1315 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |\-659: O: O1317 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /|\660: O: O1320 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|661: O: O1322 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \662: O: O1324 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|663: O: O1326 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-664: O: O1328 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /665: O: O1330 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-666: O: O1331 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\667: O: O1334 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -668: O: O1336 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\669: O: O1338 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|670: O: O1340 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/|671: O: O1341 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \672: O: O1344 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|673: O: O1346 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/674: O: O1348 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-675: O: O1349 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\676: O: O1351 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -677: O: O1354 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|678: O: O1355 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/679: O: O1358 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\680: O: O1360 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/681: O: O1362 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |682: O: O1364 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \683: O: O1366 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|684: O: O1367 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/685: O: O1370 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-686: O: O1371 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|687: O: O1374 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/688: O: O1376 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-689: O: O1378 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\690: O: O1380 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/691: O: O1381 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |692: O: O1384 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-693: O: O1386 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\694: O: O1387 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|695: O: O1389 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/696: O: O1391 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |\-697: O: O1393 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- /|\698: O: O1396 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|699: O: O1398 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/700: O: O1400 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\701: O: O1401 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -702: O: O1403 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|703: O: O1405 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/704: O: O1408 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\705: O: O1410 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -/|706: O: O1412 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/707: O: O1413 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-708: O: O1416 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\709: O: O1417 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|710: O: O1420 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/|711: O: O1422 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \712: O: O1424 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|713: O: O1426 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/714: O: O1428 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\715: O: O1430 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|716: O: O1432 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/717: O: O1434 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-718: O: O1436 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\719: O: O1438 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|720: O: O1439 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/721: O: O1442 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |722: O: O1444 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/723: O: O1446 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-724: O: O1448 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\725: O: O1449 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|726: O: O1451 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/727: O: O1454 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-728: O: O1456 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\729: O: O1458 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|730: O: O1459 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/731: O: O1462 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |732: O: O1464 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/|733: O: O1466 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/734: O: O1467 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\735: O: O1470 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|736: O: O1472 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/737: O: O1474 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-738: O: O1475 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\739: O: O1477 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/740: O: O1480 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-741: O: O1482 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /742: O: O1483 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |\-743: O: O1485 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|\744: O: O1487 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|745: O: O1489 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/746: O: O1492 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-747: O: O1494 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\748: O: O1496 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|749: O: O1498 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-750: O: O1500 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\751: O: O1502 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -752: O: O1503 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|753: O: O1506 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \754: O: O1507 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/|755: O: O1510 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/756: O: O1512 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-757: O: O1513 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|758: O: O1516 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/759: O: O1517 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\-760: O: O1520 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\761: O: O1522 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -762: O: O1523 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\-763: O: O1525 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- /764: O: O1528 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-765: O: O1530 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|766: O: O1532 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/767: O: O1533 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\-768: O: O1536 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\769: O: O1538 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|770: O: O1539 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/|771: O: O1541 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \772: O: O1544 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|773: O: O1546 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/774: O: O1547 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-775: O: O1550 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|776: O: O1551 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- \-/777: O: O1553 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\778: O: O1556 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/|779: O: O1558 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/780: O: O1560 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-/781: O: O1561 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |782: O: O1564 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/783: O: O1565 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\784: O: O1567 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|\785: O: O1569 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|786: O: O1572 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/|sleeping...
- \787: O: O1573 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|788: O: O1576 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \789: O: O1578 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/790: O: O1579 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-791: O: O1582 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /792: O: O1584 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\793: O: O1586 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|794: O: O1588 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-795: O: O1590 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\796: O: O1592 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/797: O: O1594 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\798: O: O1596 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -799: O: O1597 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|800: O: O1600 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \-/801: O: O1602 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |802: O: O1604 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/803: O: O1605 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-804: O: O1607 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|805: O: O1609 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/806: O: O1612 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\807: O: O1613 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/|808: O: O1616 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/809: O: O1618 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\810: O: O1620 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|811: O: O1621 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \812: O: O1624 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -813: O: O1626 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\-sleeping...
- /814: O: O1627 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\815: O: O1630 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|816: O: O1631 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-817: O: O1633 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\818: O: O1635 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|819: O: O1638 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/|820: O: O1640 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-821: O: O1641 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /822: O: O1643 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-823: O: O1645 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- /|\824: O: O1647 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -825: O: O1650 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|826: O: O1651 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-/827: O: O1654 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\828: O: O1656 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/829: O: O1657 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\830: O: O1660 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -/831: O: O1662 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |832: O: O1664 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-833: O: O1665 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /834: O: O1668 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |\835: O: O1669 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/|836: O: O1672 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-837: O: O1674 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\838: O: O1676 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/839: O: O1677 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- |\-840: O: O1680 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|841: O: O1682 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \842: O: O1684 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/843: O: O1685 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |\844: O: O1688 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/|845: O: O1689 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-846: O: O1692 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- /|\847: O: O1694 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/848: O: O1695 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |\-849: O: O1698 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|850: O: O1699 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/851: O: O1702 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |852: O: O1704 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/853: O: O1706 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-854: O: O1708 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\-855: O: O1709 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- /856: O: O1712 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |\-857: O: O1714 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\858: O: O1715 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- -/859: O: O1718 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- |\-860: O: O1720 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- /|\861: O: O1722 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- -862: O: O1724 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\863: O: O1725 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|864: O: O1728 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/865: O: O1730 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-866: O: O1732 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|867: O: O1733 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/|868: O: O1736 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- \-/869: O: O1738 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\870: O: O1740 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/871: O: O1741 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- |872: O: O1744 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/873: O: O1746 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |874: O: O1747 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \875: O: O1750 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -876: O: O1752 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\877: O: O1753 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -878: O: O1756 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\879: O: O1757 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- -/|880: O: O1759 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/881: O: O1762 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- |882: O: O1764 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- \-/883: O: O1765 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |\884: O: O1767 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- -/|885: O: O1770 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-886: O: O1772 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|887: O: O1773 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- \-888: O: O1776 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|889: O: O1777 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- \-/|890: O: O1779 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/891: O: O1782 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |892: O: O1783 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-893: O: O1786 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\894: O: O1788 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -895: O: O1790 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|\896: O: O1791 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- -/|897: O: O1794 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/898: O: O1795 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- |\-899: O: O1797 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\900: O: O1800 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/|901: O: O1802 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- \902: O: O1804 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- -/|903: O: O1805 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/904: O: O1808 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-905: O: O1809 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- /|906: O: O1812 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-/907: O: O1813 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- |908: O: O1816 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-/909: O: O1818 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-910: O: O1819 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /911: O: O1822 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- |912: O: O1823 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- \-/913: O: O1826 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-914: O: O1828 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\-915: O: O1830 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\916: O: O1832 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -/|917: O: O1834 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-918: O: O1836 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\919: O: O1838 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- -920: O: O1840 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- /|\921: O: O1842 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- -922: O: O1844 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|923: O: O1846 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- \-924: O: O1848 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- /|925: O: O1849 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-/926: O: O1852 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-927: O: O1854 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|\928: O: O1855 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- -/929: O: O1857 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- |\930: O: O1859 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- -/931: O: O1862 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |932: O: O1864 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-/933: O: O1866 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- |\-934: O: O1868 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|935: O: O1870 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-936: O: O1872 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|937: O: O1874 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- \-938: O: O1876 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- /|\939: O: O1878 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- -/940: O: O1879 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- |\-/941: O: O1882 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |942: O: O1884 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- \-943: O: O1885 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- /|944: O: O1887 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- \-945: O: O1890 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- /|946: O: O1891 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- \-/947: O: O1894 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- |\-948: O: O1895 (predict-yes)
- I see 1 and I'm going to do: predict-yes
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- /|\949: O: O1898 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- -/950: O: O1900 (predict-no)
- I see 1 and I'm going to do: predict-no
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- |\-/|\-/|\-/|--- Input Phase ---
- =>WM: (13382: I2 ^dir R)
- =>WM: (13381: I2 ^reward 1)
- =>WM: (13380: I2 ^see 0)
- =>WM: (13379: N950 ^status complete)
- <=WM: (13368: I2 ^dir U)
- <=WM: (13367: I2 ^reward 1)
- <=WM: (13366: I2 ^see 0)
- =>WM: (13383: I2 ^level-1 R1-root)
- <=WM: (13369: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1899 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1900 = 0.7427516277634807)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R954 ^value 1 +)
- (R1 ^reward R954 +)
- Firing propose*predict-yes
- -->
- (O1901 ^name predict-yes +)
- (S1 ^operator O1901 +)
- Firing propose*predict-no
- -->
- (O1902 ^name predict-no +)
- (S1 ^operator O1902 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1900 = 0.2572472160770417)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1899 = 0.736829027581098)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1900 ^name predict-no +)
- (S1 ^operator O1900 +)
- Retracting propose*predict-yes
- -->
- (O1899 ^name predict-yes +)
- (S1 ^operator O1899 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R953 ^value 1 +)
- (R1 ^reward R953 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1900 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1899 = 0.)
- =>WM: (13390: S1 ^operator O1902 +)
- =>WM: (13389: S1 ^operator O1901 +)
- =>WM: (13388: I3 ^dir R)
- =>WM: (13387: O1902 ^name predict-no)
- =>WM: (13386: O1901 ^name predict-yes)
- =>WM: (13385: R954 ^value 1)
- =>WM: (13384: R1 ^reward R954)
- <=WM: (13375: S1 ^operator O1899 +)
- <=WM: (13376: S1 ^operator O1900 +)
- <=WM: (13377: S1 ^operator O1900)
- <=WM: (13360: I3 ^dir U)
- <=WM: (13371: R1 ^reward R953)
- <=WM: (13374: O1900 ^name predict-no)
- <=WM: (13373: O1899 ^name predict-yes)
- <=WM: (13372: R953 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1901 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1901 = 0.736829027581098)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1902 = 0.7427516277634807)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1902 = 0.2572472160770417)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1900 = 0.2572472160770417)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1900 = 0.7427516277634807)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1899 = 0.736829027581098)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1899 = -0.3011268063455669)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13391: S1 ^operator O1902)
- 951: O: O1902 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N951 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N950 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13392: I3 ^predict-no N951)
- <=WM: (13379: N950 ^status complete)
- <=WM: (13378: I3 ^predict-no N950)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- \--- Input Phase ---
- =>WM: (13396: I2 ^dir L)
- =>WM: (13395: I2 ^reward 1)
- =>WM: (13394: I2 ^see 0)
- =>WM: (13393: N951 ^status complete)
- <=WM: (13382: I2 ^dir R)
- <=WM: (13381: I2 ^reward 1)
- <=WM: (13380: I2 ^see 0)
- =>WM: (13397: I2 ^level-1 R0-root)
- <=WM: (13383: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1902 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1901 = 0.5681127864180794)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R955 ^value 1 +)
- (R1 ^reward R955 +)
- Firing propose*predict-yes
- -->
- (O1903 ^name predict-yes +)
- (S1 ^operator O1903 +)
- Firing propose*predict-no
- -->
- (O1904 ^name predict-no +)
- (S1 ^operator O1904 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1902 = 0.3289450941277776)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1901 = 0.43188926143453)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1902 ^name predict-no +)
- (S1 ^operator O1902 +)
- Retracting propose*predict-yes
- -->
- (O1901 ^name predict-yes +)
- (S1 ^operator O1901 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R954 ^value 1 +)
- (R1 ^reward R954 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1902 = 0.2572472160770417)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1902 = 0.7427516277634807)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1901 = 0.736829027581098)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1901 = -0.3011268063455669)
- =>WM: (13404: S1 ^operator O1904 +)
- =>WM: (13403: S1 ^operator O1903 +)
- =>WM: (13402: I3 ^dir L)
- =>WM: (13401: O1904 ^name predict-no)
- =>WM: (13400: O1903 ^name predict-yes)
- =>WM: (13399: R955 ^value 1)
- =>WM: (13398: R1 ^reward R955)
- <=WM: (13389: S1 ^operator O1901 +)
- <=WM: (13390: S1 ^operator O1902 +)
- <=WM: (13391: S1 ^operator O1902)
- <=WM: (13388: I3 ^dir R)
- <=WM: (13384: R1 ^reward R954)
- <=WM: (13387: O1902 ^name predict-no)
- <=WM: (13386: O1901 ^name predict-yes)
- <=WM: (13385: R954 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1903 = 0.43188926143453)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1903 = 0.5681127864180794)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1904 = 0.3289450941277776)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1904 = 0.04178081990804111)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1902 = 0.3289450941277776)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1902 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1901 = 0.43188926143453)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1901 = 0.5681127864180794)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586137 -0.32889 0.257247 -> 0.586137 -0.32889 0.257247(R,m,v=1,0.854545,0.125055)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*36 0.413862 0.32889 0.742752 -> 0.413862 0.32889 0.742752(R,m,v=1,1,0)
- =>WM: (13405: S1 ^operator O1903)
- 952: O: O1903 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N952 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N951 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13406: I3 ^predict-yes N952)
- <=WM: (13393: N951 ^status complete)
- <=WM: (13392: I3 ^predict-no N951)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13410: I2 ^dir U)
- =>WM: (13409: I2 ^reward 1)
- =>WM: (13408: I2 ^see 1)
- =>WM: (13407: N952 ^status complete)
- <=WM: (13396: I2 ^dir L)
- <=WM: (13395: I2 ^reward 1)
- <=WM: (13394: I2 ^see 0)
- =>WM: (13411: I2 ^level-1 L1-root)
- <=WM: (13397: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R956 ^value 1 +)
- (R1 ^reward R956 +)
- Firing propose*predict-yes
- -->
- (O1905 ^name predict-yes +)
- (S1 ^operator O1905 +)
- Firing propose*predict-no
- -->
- (O1906 ^name predict-no +)
- (S1 ^operator O1906 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1904 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1903 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1904 ^name predict-no +)
- (S1 ^operator O1904 +)
- Retracting propose*predict-yes
- -->
- (O1903 ^name predict-yes +)
- (S1 ^operator O1903 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R955 ^value 1 +)
- (R1 ^reward R955 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1904 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1904 = 0.3289450941277776)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1903 = 0.5681127864180794)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1903 = 0.43188926143453)
- =>WM: (13419: S1 ^operator O1906 +)
- =>WM: (13418: S1 ^operator O1905 +)
- =>WM: (13417: I3 ^dir U)
- =>WM: (13416: O1906 ^name predict-no)
- =>WM: (13415: O1905 ^name predict-yes)
- =>WM: (13414: R956 ^value 1)
- =>WM: (13413: R1 ^reward R956)
- =>WM: (13412: I3 ^see 1)
- <=WM: (13403: S1 ^operator O1903 +)
- <=WM: (13405: S1 ^operator O1903)
- <=WM: (13404: S1 ^operator O1904 +)
- <=WM: (13402: I3 ^dir L)
- <=WM: (13398: R1 ^reward R955)
- <=WM: (13370: I3 ^see 0)
- <=WM: (13401: O1904 ^name predict-no)
- <=WM: (13400: O1903 ^name predict-yes)
- <=WM: (13399: R955 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1905 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1906 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1904 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1903 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683775 -0.251886 0.431889 -> 0.683775 -0.251886 0.431889(R,m,v=1,0.919753,0.0742658)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.316226 0.251886 0.568113 -> 0.316226 0.251886 0.568112(R,m,v=1,1,0)
- =>WM: (13420: S1 ^operator O1906)
- 953: O: O1906 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N953 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N952 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13421: I3 ^predict-no N953)
- <=WM: (13407: N952 ^status complete)
- <=WM: (13406: I3 ^predict-yes N952)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (13425: I2 ^dir R)
- =>WM: (13424: I2 ^reward 1)
- =>WM: (13423: I2 ^see 0)
- =>WM: (13422: N953 ^status complete)
- <=WM: (13410: I2 ^dir U)
- <=WM: (13409: I2 ^reward 1)
- <=WM: (13408: I2 ^see 1)
- =>WM: (13426: I2 ^level-1 L1-root)
- <=WM: (13411: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1906 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1905 = 0.2631666904115852)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R957 ^value 1 +)
- (R1 ^reward R957 +)
- Firing propose*predict-yes
- -->
- (O1907 ^name predict-yes +)
- (S1 ^operator O1907 +)
- Firing propose*predict-no
- -->
- (O1908 ^name predict-no +)
- (S1 ^operator O1908 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1906 = 0.2572473895009633)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1905 = 0.736829027581098)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1906 ^name predict-no +)
- (S1 ^operator O1906 +)
- Retracting propose*predict-yes
- -->
- (O1905 ^name predict-yes +)
- (S1 ^operator O1905 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R956 ^value 1 +)
- (R1 ^reward R956 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1906 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1905 = 0.)
- =>WM: (13434: S1 ^operator O1908 +)
- =>WM: (13433: S1 ^operator O1907 +)
- =>WM: (13432: I3 ^dir R)
- =>WM: (13431: O1908 ^name predict-no)
- =>WM: (13430: O1907 ^name predict-yes)
- =>WM: (13429: R957 ^value 1)
- =>WM: (13428: R1 ^reward R957)
- =>WM: (13427: I3 ^see 0)
- <=WM: (13418: S1 ^operator O1905 +)
- <=WM: (13419: S1 ^operator O1906 +)
- <=WM: (13420: S1 ^operator O1906)
- <=WM: (13417: I3 ^dir U)
- <=WM: (13413: R1 ^reward R956)
- <=WM: (13412: I3 ^see 1)
- <=WM: (13416: O1906 ^name predict-no)
- <=WM: (13415: O1905 ^name predict-yes)
- <=WM: (13414: R956 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1907 = 0.2631666904115852)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1907 = 0.736829027581098)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1908 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1908 = 0.2572473895009633)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1906 = 0.2572473895009633)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1906 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1905 = 0.736829027581098)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1905 = 0.2631666904115852)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13435: S1 ^operator O1907)
- 954: O: O1907 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N954 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N953 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13436: I3 ^predict-yes N954)
- <=WM: (13422: N953 ^status complete)
- <=WM: (13421: I3 ^predict-no N953)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\---- Input Phase ---
- =>WM: (13440: I2 ^dir U)
- =>WM: (13439: I2 ^reward 1)
- =>WM: (13438: I2 ^see 1)
- =>WM: (13437: N954 ^status complete)
- <=WM: (13425: I2 ^dir R)
- <=WM: (13424: I2 ^reward 1)
- <=WM: (13423: I2 ^see 0)
- =>WM: (13441: I2 ^level-1 R1-root)
- <=WM: (13426: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R958 ^value 1 +)
- (R1 ^reward R958 +)
- Firing propose*predict-yes
- -->
- (O1909 ^name predict-yes +)
- (S1 ^operator O1909 +)
- Firing propose*predict-no
- -->
- (O1910 ^name predict-no +)
- (S1 ^operator O1910 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1908 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1907 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1908 ^name predict-no +)
- (S1 ^operator O1908 +)
- Retracting propose*predict-yes
- -->
- (O1907 ^name predict-yes +)
- (S1 ^operator O1907 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R957 ^value 1 +)
- (R1 ^reward R957 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1908 = 0.2572473895009633)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1908 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1907 = 0.736829027581098)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1907 = 0.2631666904115852)
- =>WM: (13449: S1 ^operator O1910 +)
- =>WM: (13448: S1 ^operator O1909 +)
- =>WM: (13447: I3 ^dir U)
- =>WM: (13446: O1910 ^name predict-no)
- =>WM: (13445: O1909 ^name predict-yes)
- =>WM: (13444: R958 ^value 1)
- =>WM: (13443: R1 ^reward R958)
- =>WM: (13442: I3 ^see 1)
- <=WM: (13433: S1 ^operator O1907 +)
- <=WM: (13435: S1 ^operator O1907)
- <=WM: (13434: S1 ^operator O1908 +)
- <=WM: (13432: I3 ^dir R)
- <=WM: (13428: R1 ^reward R957)
- <=WM: (13427: I3 ^see 0)
- <=WM: (13431: O1908 ^name predict-no)
- <=WM: (13430: O1907 ^name predict-yes)
- <=WM: (13429: R957 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1909 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1910 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1908 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1907 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114073 0.736829 -> 0.748237 -0.0114068 0.73683(R,m,v=1,0.892405,0.0966298)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*40 0.251763 0.0114042 0.263167 -> 0.251763 0.0114046 0.263167(R,m,v=1,1,0)
- =>WM: (13450: S1 ^operator O1910)
- 955: O: O1910 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N955 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N954 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13451: I3 ^predict-no N955)
- <=WM: (13437: N954 ^status complete)
- <=WM: (13436: I3 ^predict-yes N954)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (13455: I2 ^dir R)
- =>WM: (13454: I2 ^reward 1)
- =>WM: (13453: I2 ^see 0)
- =>WM: (13452: N955 ^status complete)
- <=WM: (13440: I2 ^dir U)
- <=WM: (13439: I2 ^reward 1)
- <=WM: (13438: I2 ^see 1)
- =>WM: (13456: I2 ^level-1 R1-root)
- <=WM: (13441: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1909 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1910 = 0.7427518011874024)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R959 ^value 1 +)
- (R1 ^reward R959 +)
- Firing propose*predict-yes
- -->
- (O1911 ^name predict-yes +)
- (S1 ^operator O1911 +)
- Firing propose*predict-no
- -->
- (O1912 ^name predict-no +)
- (S1 ^operator O1912 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1910 = 0.2572473895009633)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1909 = 0.7368296698821956)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1910 ^name predict-no +)
- (S1 ^operator O1910 +)
- Retracting propose*predict-yes
- -->
- (O1909 ^name predict-yes +)
- (S1 ^operator O1909 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R958 ^value 1 +)
- (R1 ^reward R958 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1910 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1909 = 0.)
- =>WM: (13464: S1 ^operator O1912 +)
- =>WM: (13463: S1 ^operator O1911 +)
- =>WM: (13462: I3 ^dir R)
- =>WM: (13461: O1912 ^name predict-no)
- =>WM: (13460: O1911 ^name predict-yes)
- =>WM: (13459: R959 ^value 1)
- =>WM: (13458: R1 ^reward R959)
- =>WM: (13457: I3 ^see 0)
- <=WM: (13448: S1 ^operator O1909 +)
- <=WM: (13449: S1 ^operator O1910 +)
- <=WM: (13450: S1 ^operator O1910)
- <=WM: (13447: I3 ^dir U)
- <=WM: (13443: R1 ^reward R958)
- <=WM: (13442: I3 ^see 1)
- <=WM: (13446: O1910 ^name predict-no)
- <=WM: (13445: O1909 ^name predict-yes)
- <=WM: (13444: R958 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1911 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1911 = 0.7368296698821956)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1912 = 0.7427518011874024)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1912 = 0.2572473895009633)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1910 = 0.2572473895009633)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1910 = 0.7427518011874024)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1909 = 0.7368296698821956)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1909 = -0.3011268063455669)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13465: S1 ^operator O1912)
- 956: O: O1912 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N956 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N955 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13466: I3 ^predict-no N956)
- <=WM: (13452: N955 ^status complete)
- <=WM: (13451: I3 ^predict-no N955)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13470: I2 ^dir R)
- =>WM: (13469: I2 ^reward 1)
- =>WM: (13468: I2 ^see 0)
- =>WM: (13467: N956 ^status complete)
- <=WM: (13455: I2 ^dir R)
- <=WM: (13454: I2 ^reward 1)
- <=WM: (13453: I2 ^see 0)
- =>WM: (13471: I2 ^level-1 R0-root)
- <=WM: (13456: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1912 = 0.7427606592568701)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1911 = -0.1989581826229297)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R960 ^value 1 +)
- (R1 ^reward R960 +)
- Firing propose*predict-yes
- -->
- (O1913 ^name predict-yes +)
- (S1 ^operator O1913 +)
- Firing propose*predict-no
- -->
- (O1914 ^name predict-no +)
- (S1 ^operator O1914 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1912 = 0.2572473895009633)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1911 = 0.7368296698821956)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1912 ^name predict-no +)
- (S1 ^operator O1912 +)
- Retracting propose*predict-yes
- -->
- (O1911 ^name predict-yes +)
- (S1 ^operator O1911 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R959 ^value 1 +)
- (R1 ^reward R959 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1912 = 0.2572473895009633)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1912 = 0.7427518011874024)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1911 = 0.7368296698821956)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1911 = -0.3011268063455669)
- =>WM: (13477: S1 ^operator O1914 +)
- =>WM: (13476: S1 ^operator O1913 +)
- =>WM: (13475: O1914 ^name predict-no)
- =>WM: (13474: O1913 ^name predict-yes)
- =>WM: (13473: R960 ^value 1)
- =>WM: (13472: R1 ^reward R960)
- <=WM: (13463: S1 ^operator O1911 +)
- <=WM: (13464: S1 ^operator O1912 +)
- <=WM: (13465: S1 ^operator O1912)
- <=WM: (13458: R1 ^reward R959)
- <=WM: (13461: O1912 ^name predict-no)
- <=WM: (13460: O1911 ^name predict-yes)
- <=WM: (13459: R959 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1913 = 0.7368296698821956)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1913 = -0.1989581826229297)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1914 = 0.2572473895009633)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1914 = 0.7427606592568701)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1912 = 0.2572473895009633)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1912 = 0.7427606592568701)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1911 = 0.7368296698821956)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1911 = -0.1989581826229297)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586137 -0.32889 0.257247 -> 0.586137 -0.32889 0.257248(R,m,v=1,0.855422,0.124425)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*36 0.413862 0.32889 0.742752 -> 0.413862 0.32889 0.742752(R,m,v=1,1,0)
- =>WM: (13478: S1 ^operator O1914)
- 957: O: O1914 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N957 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N956 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13479: I3 ^predict-no N957)
- <=WM: (13467: N956 ^status complete)
- <=WM: (13466: I3 ^predict-no N956)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (13483: I2 ^dir L)
- =>WM: (13482: I2 ^reward 1)
- =>WM: (13481: I2 ^see 0)
- =>WM: (13480: N957 ^status complete)
- <=WM: (13470: I2 ^dir R)
- <=WM: (13469: I2 ^reward 1)
- <=WM: (13468: I2 ^see 0)
- =>WM: (13484: I2 ^level-1 R0-root)
- <=WM: (13471: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1914 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1913 = 0.5681124792401879)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R961 ^value 1 +)
- (R1 ^reward R961 +)
- Firing propose*predict-yes
- -->
- (O1915 ^name predict-yes +)
- (S1 ^operator O1915 +)
- Firing propose*predict-no
- -->
- (O1916 ^name predict-no +)
- (S1 ^operator O1916 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1914 = 0.3289450941277776)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1913 = 0.4318889542566386)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1914 ^name predict-no +)
- (S1 ^operator O1914 +)
- Retracting propose*predict-yes
- -->
- (O1913 ^name predict-yes +)
- (S1 ^operator O1913 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R960 ^value 1 +)
- (R1 ^reward R960 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1914 = 0.7427606592568701)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1914 = 0.2572475108977085)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1913 = -0.1989581826229297)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1913 = 0.7368296698821956)
- =>WM: (13491: S1 ^operator O1916 +)
- =>WM: (13490: S1 ^operator O1915 +)
- =>WM: (13489: I3 ^dir L)
- =>WM: (13488: O1916 ^name predict-no)
- =>WM: (13487: O1915 ^name predict-yes)
- =>WM: (13486: R961 ^value 1)
- =>WM: (13485: R1 ^reward R961)
- <=WM: (13476: S1 ^operator O1913 +)
- <=WM: (13477: S1 ^operator O1914 +)
- <=WM: (13478: S1 ^operator O1914)
- <=WM: (13462: I3 ^dir R)
- <=WM: (13472: R1 ^reward R960)
- <=WM: (13475: O1914 ^name predict-no)
- <=WM: (13474: O1913 ^name predict-yes)
- <=WM: (13473: R960 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1915 = 0.5681124792401879)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1915 = 0.4318889542566386)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1916 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1916 = 0.3289450941277776)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1914 = 0.3289450941277776)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1914 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1913 = 0.4318889542566386)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1913 = 0.5681124792401879)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586137 -0.32889 0.257248 -> 0.586136 -0.32889 0.257246(R,m,v=1,0.856287,0.123801)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.413869 0.328891 0.742761 -> 0.413868 0.328891 0.742759(R,m,v=1,1,0)
- =>WM: (13492: S1 ^operator O1915)
- 958: O: O1915 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N958 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N957 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13493: I3 ^predict-yes N958)
- <=WM: (13480: N957 ^status complete)
- <=WM: (13479: I3 ^predict-no N957)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|--- Input Phase ---
- =>WM: (13497: I2 ^dir U)
- =>WM: (13496: I2 ^reward 1)
- =>WM: (13495: I2 ^see 1)
- =>WM: (13494: N958 ^status complete)
- <=WM: (13483: I2 ^dir L)
- <=WM: (13482: I2 ^reward 1)
- <=WM: (13481: I2 ^see 0)
- =>WM: (13498: I2 ^level-1 L1-root)
- <=WM: (13484: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R962 ^value 1 +)
- (R1 ^reward R962 +)
- Firing propose*predict-yes
- -->
- (O1917 ^name predict-yes +)
- (S1 ^operator O1917 +)
- Firing propose*predict-no
- -->
- (O1918 ^name predict-no +)
- (S1 ^operator O1918 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1916 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1915 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1916 ^name predict-no +)
- (S1 ^operator O1916 +)
- Retracting propose*predict-yes
- -->
- (O1915 ^name predict-yes +)
- (S1 ^operator O1915 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R961 ^value 1 +)
- (R1 ^reward R961 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1916 = 0.3289450941277776)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1916 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1915 = 0.4318889542566386)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1915 = 0.5681124792401879)
- =>WM: (13506: S1 ^operator O1918 +)
- =>WM: (13505: S1 ^operator O1917 +)
- =>WM: (13504: I3 ^dir U)
- =>WM: (13503: O1918 ^name predict-no)
- =>WM: (13502: O1917 ^name predict-yes)
- =>WM: (13501: R962 ^value 1)
- =>WM: (13500: R1 ^reward R962)
- =>WM: (13499: I3 ^see 1)
- <=WM: (13490: S1 ^operator O1915 +)
- <=WM: (13492: S1 ^operator O1915)
- <=WM: (13491: S1 ^operator O1916 +)
- <=WM: (13489: I3 ^dir L)
- <=WM: (13485: R1 ^reward R961)
- <=WM: (13457: I3 ^see 0)
- <=WM: (13488: O1916 ^name predict-no)
- <=WM: (13487: O1915 ^name predict-yes)
- <=WM: (13486: R961 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1917 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1918 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1916 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1915 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683775 -0.251886 0.431889 -> 0.683775 -0.251886 0.431889(R,m,v=1,0.920245,0.0738469)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.316226 0.251886 0.568112 -> 0.316226 0.251886 0.568112(R,m,v=1,1,0)
- =>WM: (13507: S1 ^operator O1918)
- 959: O: O1918 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N959 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N958 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13508: I3 ^predict-no N959)
- <=WM: (13494: N958 ^status complete)
- <=WM: (13493: I3 ^predict-yes N958)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- \-/--- Input Phase ---
- =>WM: (13512: I2 ^dir L)
- =>WM: (13511: I2 ^reward 1)
- =>WM: (13510: I2 ^see 0)
- =>WM: (13509: N959 ^status complete)
- <=WM: (13497: I2 ^dir U)
- <=WM: (13496: I2 ^reward 1)
- <=WM: (13495: I2 ^see 1)
- =>WM: (13513: I2 ^level-1 L1-root)
- <=WM: (13498: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1918 = 0.671051122743914)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1917 = -0.06092862110810815)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R963 ^value 1 +)
- (R1 ^reward R963 +)
- Firing propose*predict-yes
- -->
- (O1919 ^name predict-yes +)
- (S1 ^operator O1919 +)
- Firing propose*predict-no
- -->
- (O1920 ^name predict-no +)
- (S1 ^operator O1920 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1918 = 0.3289450941277776)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1917 = 0.4318887392321146)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1918 ^name predict-no +)
- (S1 ^operator O1918 +)
- Retracting propose*predict-yes
- -->
- (O1917 ^name predict-yes +)
- (S1 ^operator O1917 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R962 ^value 1 +)
- (R1 ^reward R962 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1918 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1917 = 0.)
- =>WM: (13521: S1 ^operator O1920 +)
- =>WM: (13520: S1 ^operator O1919 +)
- =>WM: (13519: I3 ^dir L)
- =>WM: (13518: O1920 ^name predict-no)
- =>WM: (13517: O1919 ^name predict-yes)
- =>WM: (13516: R963 ^value 1)
- =>WM: (13515: R1 ^reward R963)
- =>WM: (13514: I3 ^see 0)
- <=WM: (13505: S1 ^operator O1917 +)
- <=WM: (13506: S1 ^operator O1918 +)
- <=WM: (13507: S1 ^operator O1918)
- <=WM: (13504: I3 ^dir U)
- <=WM: (13500: R1 ^reward R962)
- <=WM: (13499: I3 ^see 1)
- <=WM: (13503: O1918 ^name predict-no)
- <=WM: (13502: O1917 ^name predict-yes)
- <=WM: (13501: R962 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1919 = -0.06092862110810815)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1919 = 0.4318887392321146)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1920 = 0.671051122743914)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1920 = 0.3289450941277776)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1918 = 0.3289450941277776)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1918 = 0.671051122743914)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1917 = 0.4318887392321146)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1917 = -0.06092862110810815)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13522: S1 ^operator O1920)
- 960: O: O1920 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N960 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N959 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13523: I3 ^predict-no N960)
- <=WM: (13509: N959 ^status complete)
- <=WM: (13508: I3 ^predict-no N959)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- |\---- Input Phase ---
- =>WM: (13527: I2 ^dir U)
- =>WM: (13526: I2 ^reward 1)
- =>WM: (13525: I2 ^see 0)
- =>WM: (13524: N960 ^status complete)
- <=WM: (13512: I2 ^dir L)
- <=WM: (13511: I2 ^reward 1)
- <=WM: (13510: I2 ^see 0)
- =>WM: (13528: I2 ^level-1 L0-root)
- <=WM: (13513: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R964 ^value 1 +)
- (R1 ^reward R964 +)
- Firing propose*predict-yes
- -->
- (O1921 ^name predict-yes +)
- (S1 ^operator O1921 +)
- Firing propose*predict-no
- -->
- (O1922 ^name predict-no +)
- (S1 ^operator O1922 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1920 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1919 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1920 ^name predict-no +)
- (S1 ^operator O1920 +)
- Retracting propose*predict-yes
- -->
- (O1919 ^name predict-yes +)
- (S1 ^operator O1919 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R963 ^value 1 +)
- (R1 ^reward R963 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1920 = 0.3289450941277776)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1920 = 0.671051122743914)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1919 = 0.4318887392321146)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1919 = -0.06092862110810815)
- =>WM: (13535: S1 ^operator O1922 +)
- =>WM: (13534: S1 ^operator O1921 +)
- =>WM: (13533: I3 ^dir U)
- =>WM: (13532: O1922 ^name predict-no)
- =>WM: (13531: O1921 ^name predict-yes)
- =>WM: (13530: R964 ^value 1)
- =>WM: (13529: R1 ^reward R964)
- <=WM: (13520: S1 ^operator O1919 +)
- <=WM: (13521: S1 ^operator O1920 +)
- <=WM: (13522: S1 ^operator O1920)
- <=WM: (13519: I3 ^dir L)
- <=WM: (13515: R1 ^reward R963)
- <=WM: (13518: O1920 ^name predict-no)
- <=WM: (13517: O1919 ^name predict-yes)
- <=WM: (13516: R963 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1921 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1922 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1920 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1919 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*6 0.565402 -0.236456 0.328945 -> 0.565403 -0.236457 0.328946(R,m,v=1,0.903226,0.0879765)
- RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*43 0.434591 0.23646 0.671051 -> 0.434592 0.23646 0.671052(R,m,v=1,1,0)
- =>WM: (13536: S1 ^operator O1922)
- 961: O: O1922 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N961 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N960 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13537: I3 ^predict-no N961)
- <=WM: (13524: N960 ^status complete)
- <=WM: (13523: I3 ^predict-no N960)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- /--- Input Phase ---
- =>WM: (13541: I2 ^dir R)
- =>WM: (13540: I2 ^reward 1)
- =>WM: (13539: I2 ^see 0)
- =>WM: (13538: N961 ^status complete)
- <=WM: (13527: I2 ^dir U)
- <=WM: (13526: I2 ^reward 1)
- <=WM: (13525: I2 ^see 0)
- =>WM: (13542: I2 ^level-1 L0-root)
- <=WM: (13528: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1922 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1921 = 0.2631774632268827)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R965 ^value 1 +)
- (R1 ^reward R965 +)
- Firing propose*predict-yes
- -->
- (O1923 ^name predict-yes +)
- (S1 ^operator O1923 +)
- Firing propose*predict-no
- -->
- (O1924 ^name predict-no +)
- (S1 ^operator O1924 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1922 = 0.2572462853745217)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1921 = 0.7368296698821956)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1922 ^name predict-no +)
- (S1 ^operator O1922 +)
- Retracting propose*predict-yes
- -->
- (O1921 ^name predict-yes +)
- (S1 ^operator O1921 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R964 ^value 1 +)
- (R1 ^reward R964 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1922 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1921 = 0.)
- =>WM: (13549: S1 ^operator O1924 +)
- =>WM: (13548: S1 ^operator O1923 +)
- =>WM: (13547: I3 ^dir R)
- =>WM: (13546: O1924 ^name predict-no)
- =>WM: (13545: O1923 ^name predict-yes)
- =>WM: (13544: R965 ^value 1)
- =>WM: (13543: R1 ^reward R965)
- <=WM: (13534: S1 ^operator O1921 +)
- <=WM: (13535: S1 ^operator O1922 +)
- <=WM: (13536: S1 ^operator O1922)
- <=WM: (13533: I3 ^dir U)
- <=WM: (13529: R1 ^reward R964)
- <=WM: (13532: O1922 ^name predict-no)
- <=WM: (13531: O1921 ^name predict-yes)
- <=WM: (13530: R964 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1923 = 0.2631774632268827)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1923 = 0.7368296698821956)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1924 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1924 = 0.2572462853745217)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1922 = 0.2572462853745217)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1922 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1921 = 0.7368296698821956)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1921 = 0.2631774632268827)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13550: S1 ^operator O1923)
- 962: O: O1923 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N962 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N961 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13551: I3 ^predict-yes N962)
- <=WM: (13538: N961 ^status complete)
- <=WM: (13537: I3 ^predict-no N961)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- |\---- Input Phase ---
- =>WM: (13555: I2 ^dir U)
- =>WM: (13554: I2 ^reward 1)
- =>WM: (13553: I2 ^see 1)
- =>WM: (13552: N962 ^status complete)
- <=WM: (13541: I2 ^dir R)
- <=WM: (13540: I2 ^reward 1)
- <=WM: (13539: I2 ^see 0)
- =>WM: (13556: I2 ^level-1 R1-root)
- <=WM: (13542: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R966 ^value 1 +)
- (R1 ^reward R966 +)
- Firing propose*predict-yes
- -->
- (O1925 ^name predict-yes +)
- (S1 ^operator O1925 +)
- Firing propose*predict-no
- -->
- (O1926 ^name predict-no +)
- (S1 ^operator O1926 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1924 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1923 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1924 ^name predict-no +)
- (S1 ^operator O1924 +)
- Retracting propose*predict-yes
- -->
- (O1923 ^name predict-yes +)
- (S1 ^operator O1923 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R965 ^value 1 +)
- (R1 ^reward R965 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1924 = 0.2572462853745217)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1924 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1923 = 0.7368296698821956)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1923 = 0.2631774632268827)
- =>WM: (13564: S1 ^operator O1926 +)
- =>WM: (13563: S1 ^operator O1925 +)
- =>WM: (13562: I3 ^dir U)
- =>WM: (13561: O1926 ^name predict-no)
- =>WM: (13560: O1925 ^name predict-yes)
- =>WM: (13559: R966 ^value 1)
- =>WM: (13558: R1 ^reward R966)
- =>WM: (13557: I3 ^see 1)
- <=WM: (13548: S1 ^operator O1923 +)
- <=WM: (13550: S1 ^operator O1923)
- <=WM: (13549: S1 ^operator O1924 +)
- <=WM: (13547: I3 ^dir R)
- <=WM: (13543: R1 ^reward R965)
- <=WM: (13514: I3 ^see 0)
- <=WM: (13546: O1924 ^name predict-no)
- <=WM: (13545: O1923 ^name predict-yes)
- <=WM: (13544: R965 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1925 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1926 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1924 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1923 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748237 -0.0114068 0.73683 -> 0.748236 -0.0114076 0.736829(R,m,v=1,0.893082,0.0960911)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*35 0.251765 0.0114121 0.263177 -> 0.251765 0.0114113 0.263176(R,m,v=1,1,0)
- =>WM: (13565: S1 ^operator O1926)
- 963: O: O1926 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N963 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N962 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13566: I3 ^predict-no N963)
- <=WM: (13552: N962 ^status complete)
- <=WM: (13551: I3 ^predict-yes N962)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (13570: I2 ^dir L)
- =>WM: (13569: I2 ^reward 1)
- =>WM: (13568: I2 ^see 0)
- =>WM: (13567: N963 ^status complete)
- <=WM: (13555: I2 ^dir U)
- <=WM: (13554: I2 ^reward 1)
- <=WM: (13553: I2 ^see 1)
- =>WM: (13571: I2 ^level-1 R1-root)
- <=WM: (13556: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1925 = 0.5681037396512361)
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1926 = -0.1549421060161498)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R967 ^value 1 +)
- (R1 ^reward R967 +)
- Firing propose*predict-yes
- -->
- (O1927 ^name predict-yes +)
- (S1 ^operator O1927 +)
- Firing propose*predict-no
- -->
- (O1928 ^name predict-no +)
- (S1 ^operator O1928 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1926 = 0.3289456615970239)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1925 = 0.4318887392321146)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1926 ^name predict-no +)
- (S1 ^operator O1926 +)
- Retracting propose*predict-yes
- -->
- (O1925 ^name predict-yes +)
- (S1 ^operator O1925 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R966 ^value 1 +)
- (R1 ^reward R966 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1926 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1925 = 0.)
- =>WM: (13579: S1 ^operator O1928 +)
- =>WM: (13578: S1 ^operator O1927 +)
- =>WM: (13577: I3 ^dir L)
- =>WM: (13576: O1928 ^name predict-no)
- =>WM: (13575: O1927 ^name predict-yes)
- =>WM: (13574: R967 ^value 1)
- =>WM: (13573: R1 ^reward R967)
- =>WM: (13572: I3 ^see 0)
- <=WM: (13563: S1 ^operator O1925 +)
- <=WM: (13564: S1 ^operator O1926 +)
- <=WM: (13565: S1 ^operator O1926)
- <=WM: (13562: I3 ^dir U)
- <=WM: (13558: R1 ^reward R966)
- <=WM: (13557: I3 ^see 1)
- <=WM: (13561: O1926 ^name predict-no)
- <=WM: (13560: O1925 ^name predict-yes)
- <=WM: (13559: R966 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1927 = 0.5681037396512361)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1927 = 0.4318887392321146)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1928 = -0.1549421060161498)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1928 = 0.3289456615970239)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1926 = 0.3289456615970239)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1926 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1925 = 0.4318887392321146)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1925 = 0.5681037396512361)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13580: S1 ^operator O1927)
- 964: O: O1927 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N964 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N963 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13581: I3 ^predict-yes N964)
- <=WM: (13567: N963 ^status complete)
- <=WM: (13566: I3 ^predict-no N963)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/--- Input Phase ---
- =>WM: (13585: I2 ^dir U)
- =>WM: (13584: I2 ^reward 1)
- =>WM: (13583: I2 ^see 1)
- =>WM: (13582: N964 ^status complete)
- <=WM: (13570: I2 ^dir L)
- <=WM: (13569: I2 ^reward 1)
- <=WM: (13568: I2 ^see 0)
- =>WM: (13586: I2 ^level-1 L1-root)
- <=WM: (13571: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R968 ^value 1 +)
- (R1 ^reward R968 +)
- Firing propose*predict-yes
- -->
- (O1929 ^name predict-yes +)
- (S1 ^operator O1929 +)
- Firing propose*predict-no
- -->
- (O1930 ^name predict-no +)
- (S1 ^operator O1930 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1928 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1927 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1928 ^name predict-no +)
- (S1 ^operator O1928 +)
- Retracting propose*predict-yes
- -->
- (O1927 ^name predict-yes +)
- (S1 ^operator O1927 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R967 ^value 1 +)
- (R1 ^reward R967 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1928 = 0.3289456615970239)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1928 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1927 = 0.4318887392321146)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1927 = 0.5681037396512361)
- =>WM: (13594: S1 ^operator O1930 +)
- =>WM: (13593: S1 ^operator O1929 +)
- =>WM: (13592: I3 ^dir U)
- =>WM: (13591: O1930 ^name predict-no)
- =>WM: (13590: O1929 ^name predict-yes)
- =>WM: (13589: R968 ^value 1)
- =>WM: (13588: R1 ^reward R968)
- =>WM: (13587: I3 ^see 1)
- <=WM: (13578: S1 ^operator O1927 +)
- <=WM: (13580: S1 ^operator O1927)
- <=WM: (13579: S1 ^operator O1928 +)
- <=WM: (13577: I3 ^dir L)
- <=WM: (13573: R1 ^reward R967)
- <=WM: (13572: I3 ^see 0)
- <=WM: (13576: O1928 ^name predict-no)
- <=WM: (13575: O1927 ^name predict-yes)
- <=WM: (13574: R967 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1929 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1930 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1928 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1927 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683775 -0.251886 0.431889 -> 0.683776 -0.251886 0.43189(R,m,v=1,0.920732,0.0734326)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*45 0.316218 0.251886 0.568104 -> 0.316219 0.251886 0.568105(R,m,v=1,1,0)
- =>WM: (13595: S1 ^operator O1930)
- 965: O: O1930 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N965 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N964 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13596: I3 ^predict-no N965)
- <=WM: (13582: N964 ^status complete)
- <=WM: (13581: I3 ^predict-yes N964)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- |\--- Input Phase ---
- =>WM: (13600: I2 ^dir L)
- =>WM: (13599: I2 ^reward 1)
- =>WM: (13598: I2 ^see 0)
- =>WM: (13597: N965 ^status complete)
- <=WM: (13585: I2 ^dir U)
- <=WM: (13584: I2 ^reward 1)
- <=WM: (13583: I2 ^see 1)
- =>WM: (13601: I2 ^level-1 L1-root)
- <=WM: (13586: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1930 = 0.6710516902131602)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1929 = -0.06092862110810815)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R969 ^value 1 +)
- (R1 ^reward R969 +)
- Firing propose*predict-yes
- -->
- (O1931 ^name predict-yes +)
- (S1 ^operator O1931 +)
- Firing propose*predict-no
- -->
- (O1932 ^name predict-no +)
- (S1 ^operator O1932 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1930 = 0.3289456615970239)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1929 = 0.431889867399612)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1930 ^name predict-no +)
- (S1 ^operator O1930 +)
- Retracting propose*predict-yes
- -->
- (O1929 ^name predict-yes +)
- (S1 ^operator O1929 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R968 ^value 1 +)
- (R1 ^reward R968 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1930 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1929 = 0.)
- =>WM: (13609: S1 ^operator O1932 +)
- =>WM: (13608: S1 ^operator O1931 +)
- =>WM: (13607: I3 ^dir L)
- =>WM: (13606: O1932 ^name predict-no)
- =>WM: (13605: O1931 ^name predict-yes)
- =>WM: (13604: R969 ^value 1)
- =>WM: (13603: R1 ^reward R969)
- =>WM: (13602: I3 ^see 0)
- <=WM: (13593: S1 ^operator O1929 +)
- <=WM: (13594: S1 ^operator O1930 +)
- <=WM: (13595: S1 ^operator O1930)
- <=WM: (13592: I3 ^dir U)
- <=WM: (13588: R1 ^reward R968)
- <=WM: (13587: I3 ^see 1)
- <=WM: (13591: O1930 ^name predict-no)
- <=WM: (13590: O1929 ^name predict-yes)
- <=WM: (13589: R968 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1931 = -0.06092862110810815)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1931 = 0.431889867399612)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1932 = 0.6710516902131602)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1932 = 0.3289456615970239)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1930 = 0.3289456615970239)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1930 = 0.6710516902131602)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1929 = 0.431889867399612)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1929 = -0.06092862110810815)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13610: S1 ^operator O1932)
- 966: O: O1932 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N966 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N965 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13611: I3 ^predict-no N966)
- <=WM: (13597: N965 ^status complete)
- <=WM: (13596: I3 ^predict-no N965)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13615: I2 ^dir R)
- =>WM: (13614: I2 ^reward 1)
- =>WM: (13613: I2 ^see 0)
- =>WM: (13612: N966 ^status complete)
- <=WM: (13600: I2 ^dir L)
- <=WM: (13599: I2 ^reward 1)
- <=WM: (13598: I2 ^see 0)
- =>WM: (13616: I2 ^level-1 L0-root)
- <=WM: (13601: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1932 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1931 = 0.2631763932605209)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R970 ^value 1 +)
- (R1 ^reward R970 +)
- Firing propose*predict-yes
- -->
- (O1933 ^name predict-yes +)
- (S1 ^operator O1933 +)
- Firing propose*predict-no
- -->
- (O1934 ^name predict-no +)
- (S1 ^operator O1934 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1932 = 0.2572462853745217)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1931 = 0.7368285999158338)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1932 ^name predict-no +)
- (S1 ^operator O1932 +)
- Retracting propose*predict-yes
- -->
- (O1931 ^name predict-yes +)
- (S1 ^operator O1931 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R969 ^value 1 +)
- (R1 ^reward R969 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1932 = 0.3289456615970239)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1932 = 0.6710516902131602)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1931 = 0.431889867399612)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1931 = -0.06092862110810815)
- =>WM: (13623: S1 ^operator O1934 +)
- =>WM: (13622: S1 ^operator O1933 +)
- =>WM: (13621: I3 ^dir R)
- =>WM: (13620: O1934 ^name predict-no)
- =>WM: (13619: O1933 ^name predict-yes)
- =>WM: (13618: R970 ^value 1)
- =>WM: (13617: R1 ^reward R970)
- <=WM: (13608: S1 ^operator O1931 +)
- <=WM: (13609: S1 ^operator O1932 +)
- <=WM: (13610: S1 ^operator O1932)
- <=WM: (13607: I3 ^dir L)
- <=WM: (13603: R1 ^reward R969)
- <=WM: (13606: O1932 ^name predict-no)
- <=WM: (13605: O1931 ^name predict-yes)
- <=WM: (13604: R969 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1933 = 0.7368285999158338)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1933 = 0.2631763932605209)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1934 = 0.2572462853745217)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1934 = -0.07401383653737587)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1932 = 0.2572462853745217)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1932 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1931 = 0.7368285999158338)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1931 = 0.2631763932605209)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*6 0.565403 -0.236457 0.328946 -> 0.565403 -0.236457 0.328946(R,m,v=1,0.903846,0.087469)
- RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*43 0.434592 0.23646 0.671052 -> 0.434593 0.236459 0.671052(R,m,v=1,1,0)
- =>WM: (13624: S1 ^operator O1933)
- 967: O: O1933 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N967 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N966 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13625: I3 ^predict-yes N967)
- <=WM: (13612: N966 ^status complete)
- <=WM: (13611: I3 ^predict-no N966)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (13629: I2 ^dir R)
- =>WM: (13628: I2 ^reward 1)
- =>WM: (13627: I2 ^see 1)
- =>WM: (13626: N967 ^status complete)
- <=WM: (13615: I2 ^dir R)
- <=WM: (13614: I2 ^reward 1)
- <=WM: (13613: I2 ^see 0)
- =>WM: (13630: I2 ^level-1 R1-root)
- <=WM: (13616: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1933 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1934 = 0.7427519225841476)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R971 ^value 1 +)
- (R1 ^reward R971 +)
- Firing propose*predict-yes
- -->
- (O1935 ^name predict-yes +)
- (S1 ^operator O1935 +)
- Firing propose*predict-no
- -->
- (O1936 ^name predict-no +)
- (S1 ^operator O1936 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1934 = 0.2572462853745217)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1933 = 0.7368285999158338)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1934 ^name predict-no +)
- (S1 ^operator O1934 +)
- Retracting propose*predict-yes
- -->
- (O1933 ^name predict-yes +)
- (S1 ^operator O1933 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R970 ^value 1 +)
- (R1 ^reward R970 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1934 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1934 = 0.2572462853745217)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1933 = 0.2631763932605209)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1933 = 0.7368285999158338)
- =>WM: (13637: S1 ^operator O1936 +)
- =>WM: (13636: S1 ^operator O1935 +)
- =>WM: (13635: O1936 ^name predict-no)
- =>WM: (13634: O1935 ^name predict-yes)
- =>WM: (13633: R971 ^value 1)
- =>WM: (13632: R1 ^reward R971)
- =>WM: (13631: I3 ^see 1)
- <=WM: (13622: S1 ^operator O1933 +)
- <=WM: (13624: S1 ^operator O1933)
- <=WM: (13623: S1 ^operator O1934 +)
- <=WM: (13617: R1 ^reward R970)
- <=WM: (13602: I3 ^see 0)
- <=WM: (13620: O1934 ^name predict-no)
- <=WM: (13619: O1933 ^name predict-yes)
- <=WM: (13618: R970 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1935 = 0.7368285999158338)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1935 = -0.3011268063455669)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1936 = 0.2572462853745217)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1936 = 0.7427519225841476)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1934 = 0.2572462853745217)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1934 = 0.7427519225841476)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1933 = 0.7368285999158338)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1933 = -0.3011268063455669)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114076 0.736829 -> 0.748236 -0.0114082 0.736828(R,m,v=1,0.89375,0.0955582)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*35 0.251765 0.0114113 0.263176 -> 0.251765 0.0114107 0.263176(R,m,v=1,1,0)
- =>WM: (13638: S1 ^operator O1936)
- 968: O: O1936 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N968 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N967 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13639: I3 ^predict-no N968)
- <=WM: (13626: N967 ^status complete)
- <=WM: (13625: I3 ^predict-yes N967)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (13643: I2 ^dir U)
- =>WM: (13642: I2 ^reward 1)
- =>WM: (13641: I2 ^see 0)
- =>WM: (13640: N968 ^status complete)
- <=WM: (13629: I2 ^dir R)
- <=WM: (13628: I2 ^reward 1)
- <=WM: (13627: I2 ^see 1)
- =>WM: (13644: I2 ^level-1 R0-root)
- <=WM: (13630: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R972 ^value 1 +)
- (R1 ^reward R972 +)
- Firing propose*predict-yes
- -->
- (O1937 ^name predict-yes +)
- (S1 ^operator O1937 +)
- Firing propose*predict-no
- -->
- (O1938 ^name predict-no +)
- (S1 ^operator O1938 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1936 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1935 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1936 ^name predict-no +)
- (S1 ^operator O1936 +)
- Retracting propose*predict-yes
- -->
- (O1935 ^name predict-yes +)
- (S1 ^operator O1935 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R971 ^value 1 +)
- (R1 ^reward R971 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1936 = 0.7427519225841476)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1936 = 0.2572462853745217)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1935 = -0.3011268063455669)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1935 = 0.7368278509393806)
- =>WM: (13652: S1 ^operator O1938 +)
- =>WM: (13651: S1 ^operator O1937 +)
- =>WM: (13650: I3 ^dir U)
- =>WM: (13649: O1938 ^name predict-no)
- =>WM: (13648: O1937 ^name predict-yes)
- =>WM: (13647: R972 ^value 1)
- =>WM: (13646: R1 ^reward R972)
- =>WM: (13645: I3 ^see 0)
- <=WM: (13636: S1 ^operator O1935 +)
- <=WM: (13637: S1 ^operator O1936 +)
- <=WM: (13638: S1 ^operator O1936)
- <=WM: (13621: I3 ^dir R)
- <=WM: (13632: R1 ^reward R971)
- <=WM: (13631: I3 ^see 1)
- <=WM: (13635: O1936 ^name predict-no)
- <=WM: (13634: O1935 ^name predict-yes)
- <=WM: (13633: R971 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1937 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1938 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1936 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1935 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586136 -0.32889 0.257246 -> 0.586136 -0.32889 0.257247(R,m,v=1,0.857143,0.123182)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*36 0.413862 0.32889 0.742752 -> 0.413863 0.32889 0.742752(R,m,v=1,1,0)
- =>WM: (13653: S1 ^operator O1938)
- 969: O: O1938 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N969 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N968 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13654: I3 ^predict-no N969)
- <=WM: (13640: N968 ^status complete)
- <=WM: (13639: I3 ^predict-no N968)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13658: I2 ^dir U)
- =>WM: (13657: I2 ^reward 1)
- =>WM: (13656: I2 ^see 0)
- =>WM: (13655: N969 ^status complete)
- <=WM: (13643: I2 ^dir U)
- <=WM: (13642: I2 ^reward 1)
- <=WM: (13641: I2 ^see 0)
- =>WM: (13659: I2 ^level-1 R0-root)
- <=WM: (13644: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R973 ^value 1 +)
- (R1 ^reward R973 +)
- Firing propose*predict-yes
- -->
- (O1939 ^name predict-yes +)
- (S1 ^operator O1939 +)
- Firing propose*predict-no
- -->
- (O1940 ^name predict-no +)
- (S1 ^operator O1940 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1938 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1937 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1938 ^name predict-no +)
- (S1 ^operator O1938 +)
- Retracting propose*predict-yes
- -->
- (O1937 ^name predict-yes +)
- (S1 ^operator O1937 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R972 ^value 1 +)
- (R1 ^reward R972 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1938 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1937 = 0.)
- =>WM: (13665: S1 ^operator O1940 +)
- =>WM: (13664: S1 ^operator O1939 +)
- =>WM: (13663: O1940 ^name predict-no)
- =>WM: (13662: O1939 ^name predict-yes)
- =>WM: (13661: R973 ^value 1)
- =>WM: (13660: R1 ^reward R973)
- <=WM: (13651: S1 ^operator O1937 +)
- <=WM: (13652: S1 ^operator O1938 +)
- <=WM: (13653: S1 ^operator O1938)
- <=WM: (13646: R1 ^reward R972)
- <=WM: (13649: O1938 ^name predict-no)
- <=WM: (13648: O1937 ^name predict-yes)
- <=WM: (13647: R972 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1939 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1940 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1938 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1937 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13666: S1 ^operator O1940)
- 970: O: O1940 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N970 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N969 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13667: I3 ^predict-no N970)
- <=WM: (13655: N969 ^status complete)
- <=WM: (13654: I3 ^predict-no N969)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (13671: I2 ^dir L)
- =>WM: (13670: I2 ^reward 1)
- =>WM: (13669: I2 ^see 0)
- =>WM: (13668: N970 ^status complete)
- <=WM: (13658: I2 ^dir U)
- <=WM: (13657: I2 ^reward 1)
- <=WM: (13656: I2 ^see 0)
- =>WM: (13672: I2 ^level-1 R0-root)
- <=WM: (13659: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1940 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1939 = 0.568112264215664)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R974 ^value 1 +)
- (R1 ^reward R974 +)
- Firing propose*predict-yes
- -->
- (O1941 ^name predict-yes +)
- (S1 ^operator O1941 +)
- Firing propose*predict-no
- -->
- (O1942 ^name predict-no +)
- (S1 ^operator O1942 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1940 = 0.3289460588254962)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1939 = 0.431889867399612)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1940 ^name predict-no +)
- (S1 ^operator O1940 +)
- Retracting propose*predict-yes
- -->
- (O1939 ^name predict-yes +)
- (S1 ^operator O1939 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R973 ^value 1 +)
- (R1 ^reward R973 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1940 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1939 = 0.)
- =>WM: (13679: S1 ^operator O1942 +)
- =>WM: (13678: S1 ^operator O1941 +)
- =>WM: (13677: I3 ^dir L)
- =>WM: (13676: O1942 ^name predict-no)
- =>WM: (13675: O1941 ^name predict-yes)
- =>WM: (13674: R974 ^value 1)
- =>WM: (13673: R1 ^reward R974)
- <=WM: (13664: S1 ^operator O1939 +)
- <=WM: (13665: S1 ^operator O1940 +)
- <=WM: (13666: S1 ^operator O1940)
- <=WM: (13650: I3 ^dir U)
- <=WM: (13660: R1 ^reward R973)
- <=WM: (13663: O1940 ^name predict-no)
- <=WM: (13662: O1939 ^name predict-yes)
- <=WM: (13661: R973 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1941 = 0.568112264215664)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1941 = 0.431889867399612)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1942 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1942 = 0.3289460588254962)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1940 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1940 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1939 = 0.431889867399612)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1939 = 0.568112264215664)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13680: S1 ^operator O1941)
- 971: O: O1941 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N971 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N970 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13681: I3 ^predict-yes N971)
- <=WM: (13668: N970 ^status complete)
- <=WM: (13667: I3 ^predict-no N970)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- /--- Input Phase ---
- =>WM: (13685: I2 ^dir R)
- =>WM: (13684: I2 ^reward 1)
- =>WM: (13683: I2 ^see 1)
- =>WM: (13682: N971 ^status complete)
- <=WM: (13671: I2 ^dir L)
- <=WM: (13670: I2 ^reward 1)
- <=WM: (13669: I2 ^see 0)
- =>WM: (13686: I2 ^level-1 L1-root)
- <=WM: (13672: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1942 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1941 = 0.2631673327126827)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R975 ^value 1 +)
- (R1 ^reward R975 +)
- Firing propose*predict-yes
- -->
- (O1943 ^name predict-yes +)
- (S1 ^operator O1943 +)
- Firing propose*predict-no
- -->
- (O1944 ^name predict-no +)
- (S1 ^operator O1944 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1942 = 0.2572465541807213)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1941 = 0.7368278509393806)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1942 ^name predict-no +)
- (S1 ^operator O1942 +)
- Retracting propose*predict-yes
- -->
- (O1941 ^name predict-yes +)
- (S1 ^operator O1941 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R974 ^value 1 +)
- (R1 ^reward R974 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1942 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1942 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1941 = 0.431889867399612)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1941 = 0.568112264215664)
- =>WM: (13694: S1 ^operator O1944 +)
- =>WM: (13693: S1 ^operator O1943 +)
- =>WM: (13692: I3 ^dir R)
- =>WM: (13691: O1944 ^name predict-no)
- =>WM: (13690: O1943 ^name predict-yes)
- =>WM: (13689: R975 ^value 1)
- =>WM: (13688: R1 ^reward R975)
- =>WM: (13687: I3 ^see 1)
- <=WM: (13678: S1 ^operator O1941 +)
- <=WM: (13680: S1 ^operator O1941)
- <=WM: (13679: S1 ^operator O1942 +)
- <=WM: (13677: I3 ^dir L)
- <=WM: (13673: R1 ^reward R974)
- <=WM: (13645: I3 ^see 0)
- <=WM: (13676: O1942 ^name predict-no)
- <=WM: (13675: O1941 ^name predict-yes)
- <=WM: (13674: R974 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1943 = 0.7368278509393806)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1943 = 0.2631673327126827)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1944 = 0.2572465541807213)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1944 = -0.1377248055371832)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1942 = 0.2572465541807213)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1942 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1941 = 0.7368278509393806)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1941 = 0.2631673327126827)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683776 -0.251886 0.43189 -> 0.683776 -0.251886 0.43189(R,m,v=1,0.921212,0.0730229)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.316226 0.251886 0.568112 -> 0.316226 0.251886 0.568112(R,m,v=1,1,0)
- =>WM: (13695: S1 ^operator O1943)
- 972: O: O1943 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N972 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N971 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13696: I3 ^predict-yes N972)
- <=WM: (13682: N971 ^status complete)
- <=WM: (13681: I3 ^predict-yes N971)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- |\--- Input Phase ---
- =>WM: (13700: I2 ^dir L)
- =>WM: (13699: I2 ^reward 1)
- =>WM: (13698: I2 ^see 1)
- =>WM: (13697: N972 ^status complete)
- <=WM: (13685: I2 ^dir R)
- <=WM: (13684: I2 ^reward 1)
- <=WM: (13683: I2 ^see 1)
- =>WM: (13701: I2 ^level-1 R1-root)
- <=WM: (13686: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1943 = 0.5681048678187335)
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1944 = -0.1549421060161498)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R976 ^value 1 +)
- (R1 ^reward R976 +)
- Firing propose*predict-yes
- -->
- (O1945 ^name predict-yes +)
- (S1 ^operator O1945 +)
- Firing propose*predict-no
- -->
- (O1946 ^name predict-no +)
- (S1 ^operator O1946 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1944 = 0.3289460588254962)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1943 = 0.4318895476573206)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1944 ^name predict-no +)
- (S1 ^operator O1944 +)
- Retracting propose*predict-yes
- -->
- (O1943 ^name predict-yes +)
- (S1 ^operator O1943 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R975 ^value 1 +)
- (R1 ^reward R975 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1944 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1944 = 0.2572465541807213)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1943 = 0.2631673327126827)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1943 = 0.7368278509393806)
- =>WM: (13708: S1 ^operator O1946 +)
- =>WM: (13707: S1 ^operator O1945 +)
- =>WM: (13706: I3 ^dir L)
- =>WM: (13705: O1946 ^name predict-no)
- =>WM: (13704: O1945 ^name predict-yes)
- =>WM: (13703: R976 ^value 1)
- =>WM: (13702: R1 ^reward R976)
- <=WM: (13693: S1 ^operator O1943 +)
- <=WM: (13695: S1 ^operator O1943)
- <=WM: (13694: S1 ^operator O1944 +)
- <=WM: (13692: I3 ^dir R)
- <=WM: (13688: R1 ^reward R975)
- <=WM: (13691: O1944 ^name predict-no)
- <=WM: (13690: O1943 ^name predict-yes)
- <=WM: (13689: R975 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1945 = 0.4318895476573206)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1945 = 0.5681048678187335)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1946 = 0.3289460588254962)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1946 = -0.1549421060161498)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1944 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1944 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1943 = 0.4318895476573206)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1943 = 0.5681048678187335)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114082 0.736828 -> 0.748236 -0.0114076 0.736829(R,m,v=1,0.89441,0.0950311)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*40 0.251763 0.0114046 0.263167 -> 0.251763 0.0114052 0.263168(R,m,v=1,1,0)
- =>WM: (13709: S1 ^operator O1945)
- 973: O: O1945 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N973 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N972 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13710: I3 ^predict-yes N973)
- <=WM: (13697: N972 ^status complete)
- <=WM: (13696: I3 ^predict-yes N972)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13714: I2 ^dir U)
- =>WM: (13713: I2 ^reward 1)
- =>WM: (13712: I2 ^see 1)
- =>WM: (13711: N973 ^status complete)
- <=WM: (13700: I2 ^dir L)
- <=WM: (13699: I2 ^reward 1)
- <=WM: (13698: I2 ^see 1)
- =>WM: (13715: I2 ^level-1 L1-root)
- <=WM: (13701: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R977 ^value 1 +)
- (R1 ^reward R977 +)
- Firing propose*predict-yes
- -->
- (O1947 ^name predict-yes +)
- (S1 ^operator O1947 +)
- Firing propose*predict-no
- -->
- (O1948 ^name predict-no +)
- (S1 ^operator O1948 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1946 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1945 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1946 ^name predict-no +)
- (S1 ^operator O1946 +)
- Retracting propose*predict-yes
- -->
- (O1945 ^name predict-yes +)
- (S1 ^operator O1945 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R976 ^value 1 +)
- (R1 ^reward R976 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1946 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1946 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1945 = 0.5681048678187335)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1945 = 0.4318895476573206)
- =>WM: (13722: S1 ^operator O1948 +)
- =>WM: (13721: S1 ^operator O1947 +)
- =>WM: (13720: I3 ^dir U)
- =>WM: (13719: O1948 ^name predict-no)
- =>WM: (13718: O1947 ^name predict-yes)
- =>WM: (13717: R977 ^value 1)
- =>WM: (13716: R1 ^reward R977)
- <=WM: (13707: S1 ^operator O1945 +)
- <=WM: (13709: S1 ^operator O1945)
- <=WM: (13708: S1 ^operator O1946 +)
- <=WM: (13706: I3 ^dir L)
- <=WM: (13702: R1 ^reward R976)
- <=WM: (13705: O1946 ^name predict-no)
- <=WM: (13704: O1945 ^name predict-yes)
- <=WM: (13703: R976 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1947 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1948 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1946 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1945 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683776 -0.251886 0.43189 -> 0.683777 -0.251886 0.43189(R,m,v=1,0.921687,0.0726177)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*45 0.316219 0.251886 0.568105 -> 0.31622 0.251886 0.568106(R,m,v=1,1,0)
- =>WM: (13723: S1 ^operator O1948)
- 974: O: O1948 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N974 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N973 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13724: I3 ^predict-no N974)
- <=WM: (13711: N973 ^status complete)
- <=WM: (13710: I3 ^predict-yes N973)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- \-/--- Input Phase ---
- =>WM: (13728: I2 ^dir U)
- =>WM: (13727: I2 ^reward 1)
- =>WM: (13726: I2 ^see 0)
- =>WM: (13725: N974 ^status complete)
- <=WM: (13714: I2 ^dir U)
- <=WM: (13713: I2 ^reward 1)
- <=WM: (13712: I2 ^see 1)
- =>WM: (13729: I2 ^level-1 L1-root)
- <=WM: (13715: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R978 ^value 1 +)
- (R1 ^reward R978 +)
- Firing propose*predict-yes
- -->
- (O1949 ^name predict-yes +)
- (S1 ^operator O1949 +)
- Firing propose*predict-no
- -->
- (O1950 ^name predict-no +)
- (S1 ^operator O1950 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1948 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1947 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1948 ^name predict-no +)
- (S1 ^operator O1948 +)
- Retracting propose*predict-yes
- -->
- (O1947 ^name predict-yes +)
- (S1 ^operator O1947 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R977 ^value 1 +)
- (R1 ^reward R977 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1948 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1947 = 0.)
- =>WM: (13736: S1 ^operator O1950 +)
- =>WM: (13735: S1 ^operator O1949 +)
- =>WM: (13734: O1950 ^name predict-no)
- =>WM: (13733: O1949 ^name predict-yes)
- =>WM: (13732: R978 ^value 1)
- =>WM: (13731: R1 ^reward R978)
- =>WM: (13730: I3 ^see 0)
- <=WM: (13721: S1 ^operator O1947 +)
- <=WM: (13722: S1 ^operator O1948 +)
- <=WM: (13723: S1 ^operator O1948)
- <=WM: (13716: R1 ^reward R977)
- <=WM: (13687: I3 ^see 1)
- <=WM: (13719: O1948 ^name predict-no)
- <=WM: (13718: O1947 ^name predict-yes)
- <=WM: (13717: R977 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1949 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1950 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1948 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1947 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13737: S1 ^operator O1950)
- 975: O: O1950 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N975 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N974 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13738: I3 ^predict-no N975)
- <=WM: (13725: N974 ^status complete)
- <=WM: (13724: I3 ^predict-no N974)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- |\---- Input Phase ---
- =>WM: (13742: I2 ^dir R)
- =>WM: (13741: I2 ^reward 1)
- =>WM: (13740: I2 ^see 0)
- =>WM: (13739: N975 ^status complete)
- <=WM: (13728: I2 ^dir U)
- <=WM: (13727: I2 ^reward 1)
- <=WM: (13726: I2 ^see 0)
- =>WM: (13743: I2 ^level-1 L1-root)
- <=WM: (13729: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1950 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1949 = 0.2631680551648732)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R979 ^value 1 +)
- (R1 ^reward R979 +)
- Firing propose*predict-yes
- -->
- (O1951 ^name predict-yes +)
- (S1 ^operator O1951 +)
- Firing propose*predict-no
- -->
- (O1952 ^name predict-no +)
- (S1 ^operator O1952 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1950 = 0.2572465541807213)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1949 = 0.7368285733915712)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1950 ^name predict-no +)
- (S1 ^operator O1950 +)
- Retracting propose*predict-yes
- -->
- (O1949 ^name predict-yes +)
- (S1 ^operator O1949 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R978 ^value 1 +)
- (R1 ^reward R978 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1950 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1949 = 0.)
- =>WM: (13750: S1 ^operator O1952 +)
- =>WM: (13749: S1 ^operator O1951 +)
- =>WM: (13748: I3 ^dir R)
- =>WM: (13747: O1952 ^name predict-no)
- =>WM: (13746: O1951 ^name predict-yes)
- =>WM: (13745: R979 ^value 1)
- =>WM: (13744: R1 ^reward R979)
- <=WM: (13735: S1 ^operator O1949 +)
- <=WM: (13736: S1 ^operator O1950 +)
- <=WM: (13737: S1 ^operator O1950)
- <=WM: (13720: I3 ^dir U)
- <=WM: (13731: R1 ^reward R978)
- <=WM: (13734: O1950 ^name predict-no)
- <=WM: (13733: O1949 ^name predict-yes)
- <=WM: (13732: R978 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1951 = 0.2631680551648732)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1951 = 0.7368285733915712)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1952 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1952 = 0.2572465541807213)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1950 = 0.2572465541807213)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1950 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1949 = 0.7368285733915712)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1949 = 0.2631680551648732)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13751: S1 ^operator O1951)
- 976: O: O1951 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N976 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N975 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13752: I3 ^predict-yes N976)
- <=WM: (13739: N975 ^status complete)
- <=WM: (13738: I3 ^predict-no N975)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (13756: I2 ^dir U)
- =>WM: (13755: I2 ^reward 1)
- =>WM: (13754: I2 ^see 1)
- =>WM: (13753: N976 ^status complete)
- <=WM: (13742: I2 ^dir R)
- <=WM: (13741: I2 ^reward 1)
- <=WM: (13740: I2 ^see 0)
- =>WM: (13757: I2 ^level-1 R1-root)
- <=WM: (13743: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R980 ^value 1 +)
- (R1 ^reward R980 +)
- Firing propose*predict-yes
- -->
- (O1953 ^name predict-yes +)
- (S1 ^operator O1953 +)
- Firing propose*predict-no
- -->
- (O1954 ^name predict-no +)
- (S1 ^operator O1954 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1952 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1951 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1952 ^name predict-no +)
- (S1 ^operator O1952 +)
- Retracting propose*predict-yes
- -->
- (O1951 ^name predict-yes +)
- (S1 ^operator O1951 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R979 ^value 1 +)
- (R1 ^reward R979 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1952 = 0.2572465541807213)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1952 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1951 = 0.7368285733915712)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1951 = 0.2631680551648732)
- =>WM: (13765: S1 ^operator O1954 +)
- =>WM: (13764: S1 ^operator O1953 +)
- =>WM: (13763: I3 ^dir U)
- =>WM: (13762: O1954 ^name predict-no)
- =>WM: (13761: O1953 ^name predict-yes)
- =>WM: (13760: R980 ^value 1)
- =>WM: (13759: R1 ^reward R980)
- =>WM: (13758: I3 ^see 1)
- <=WM: (13749: S1 ^operator O1951 +)
- <=WM: (13751: S1 ^operator O1951)
- <=WM: (13750: S1 ^operator O1952 +)
- <=WM: (13748: I3 ^dir R)
- <=WM: (13744: R1 ^reward R979)
- <=WM: (13730: I3 ^see 0)
- <=WM: (13747: O1952 ^name predict-no)
- <=WM: (13746: O1951 ^name predict-yes)
- <=WM: (13745: R979 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1953 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1954 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1952 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1951 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114076 0.736829 -> 0.748236 -0.0114073 0.736829(R,m,v=1,0.895062,0.0945096)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*40 0.251763 0.0114052 0.263168 -> 0.251763 0.0114055 0.263169(R,m,v=1,1,0)
- =>WM: (13766: S1 ^operator O1954)
- 977: O: O1954 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N977 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N976 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13767: I3 ^predict-no N977)
- <=WM: (13753: N976 ^status complete)
- <=WM: (13752: I3 ^predict-yes N976)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13771: I2 ^dir U)
- =>WM: (13770: I2 ^reward 1)
- =>WM: (13769: I2 ^see 0)
- =>WM: (13768: N977 ^status complete)
- <=WM: (13756: I2 ^dir U)
- <=WM: (13755: I2 ^reward 1)
- <=WM: (13754: I2 ^see 1)
- =>WM: (13772: I2 ^level-1 R1-root)
- <=WM: (13757: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R981 ^value 1 +)
- (R1 ^reward R981 +)
- Firing propose*predict-yes
- -->
- (O1955 ^name predict-yes +)
- (S1 ^operator O1955 +)
- Firing propose*predict-no
- -->
- (O1956 ^name predict-no +)
- (S1 ^operator O1956 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1954 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1953 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1954 ^name predict-no +)
- (S1 ^operator O1954 +)
- Retracting propose*predict-yes
- -->
- (O1953 ^name predict-yes +)
- (S1 ^operator O1953 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R980 ^value 1 +)
- (R1 ^reward R980 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1954 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1953 = 0.)
- =>WM: (13779: S1 ^operator O1956 +)
- =>WM: (13778: S1 ^operator O1955 +)
- =>WM: (13777: O1956 ^name predict-no)
- =>WM: (13776: O1955 ^name predict-yes)
- =>WM: (13775: R981 ^value 1)
- =>WM: (13774: R1 ^reward R981)
- =>WM: (13773: I3 ^see 0)
- <=WM: (13764: S1 ^operator O1953 +)
- <=WM: (13765: S1 ^operator O1954 +)
- <=WM: (13766: S1 ^operator O1954)
- <=WM: (13759: R1 ^reward R980)
- <=WM: (13758: I3 ^see 1)
- <=WM: (13762: O1954 ^name predict-no)
- <=WM: (13761: O1953 ^name predict-yes)
- <=WM: (13760: R980 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1955 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1956 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1954 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1953 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13780: S1 ^operator O1956)
- 978: O: O1956 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N978 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N977 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13781: I3 ^predict-no N978)
- <=WM: (13768: N977 ^status complete)
- <=WM: (13767: I3 ^predict-no N977)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- \-/--- Input Phase ---
- =>WM: (13785: I2 ^dir R)
- =>WM: (13784: I2 ^reward 1)
- =>WM: (13783: I2 ^see 0)
- =>WM: (13782: N978 ^status complete)
- <=WM: (13771: I2 ^dir U)
- <=WM: (13770: I2 ^reward 1)
- <=WM: (13769: I2 ^see 0)
- =>WM: (13786: I2 ^level-1 R1-root)
- <=WM: (13772: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1955 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1956 = 0.7427521913903472)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R982 ^value 1 +)
- (R1 ^reward R982 +)
- Firing propose*predict-yes
- -->
- (O1957 ^name predict-yes +)
- (S1 ^operator O1957 +)
- Firing propose*predict-no
- -->
- (O1958 ^name predict-no +)
- (S1 ^operator O1958 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1956 = 0.2572465541807213)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1955 = 0.7368290791081045)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1956 ^name predict-no +)
- (S1 ^operator O1956 +)
- Retracting propose*predict-yes
- -->
- (O1955 ^name predict-yes +)
- (S1 ^operator O1955 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R981 ^value 1 +)
- (R1 ^reward R981 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1956 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1955 = 0.)
- =>WM: (13793: S1 ^operator O1958 +)
- =>WM: (13792: S1 ^operator O1957 +)
- =>WM: (13791: I3 ^dir R)
- =>WM: (13790: O1958 ^name predict-no)
- =>WM: (13789: O1957 ^name predict-yes)
- =>WM: (13788: R982 ^value 1)
- =>WM: (13787: R1 ^reward R982)
- <=WM: (13778: S1 ^operator O1955 +)
- <=WM: (13779: S1 ^operator O1956 +)
- <=WM: (13780: S1 ^operator O1956)
- <=WM: (13763: I3 ^dir U)
- <=WM: (13774: R1 ^reward R981)
- <=WM: (13777: O1956 ^name predict-no)
- <=WM: (13776: O1955 ^name predict-yes)
- <=WM: (13775: R981 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1957 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1957 = 0.7368290791081045)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1958 = 0.7427521913903472)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1958 = 0.2572465541807213)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1956 = 0.2572465541807213)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1956 = 0.7427521913903472)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1955 = 0.7368290791081045)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1955 = -0.3011268063455669)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13794: S1 ^operator O1958)
- 979: O: O1958 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N979 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N978 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13795: I3 ^predict-no N979)
- <=WM: (13782: N978 ^status complete)
- <=WM: (13781: I3 ^predict-no N978)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- |--- Input Phase ---
- =>WM: (13799: I2 ^dir U)
- =>WM: (13798: I2 ^reward 1)
- =>WM: (13797: I2 ^see 0)
- =>WM: (13796: N979 ^status complete)
- <=WM: (13785: I2 ^dir R)
- <=WM: (13784: I2 ^reward 1)
- <=WM: (13783: I2 ^see 0)
- =>WM: (13800: I2 ^level-1 R0-root)
- <=WM: (13786: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R983 ^value 1 +)
- (R1 ^reward R983 +)
- Firing propose*predict-yes
- -->
- (O1959 ^name predict-yes +)
- (S1 ^operator O1959 +)
- Firing propose*predict-no
- -->
- (O1960 ^name predict-no +)
- (S1 ^operator O1960 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1958 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1957 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1958 ^name predict-no +)
- (S1 ^operator O1958 +)
- Retracting propose*predict-yes
- -->
- (O1957 ^name predict-yes +)
- (S1 ^operator O1957 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R982 ^value 1 +)
- (R1 ^reward R982 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1958 = 0.2572465541807213)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1958 = 0.7427521913903472)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1957 = 0.7368290791081045)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1957 = -0.3011268063455669)
- =>WM: (13807: S1 ^operator O1960 +)
- =>WM: (13806: S1 ^operator O1959 +)
- =>WM: (13805: I3 ^dir U)
- =>WM: (13804: O1960 ^name predict-no)
- =>WM: (13803: O1959 ^name predict-yes)
- =>WM: (13802: R983 ^value 1)
- =>WM: (13801: R1 ^reward R983)
- <=WM: (13792: S1 ^operator O1957 +)
- <=WM: (13793: S1 ^operator O1958 +)
- <=WM: (13794: S1 ^operator O1958)
- <=WM: (13791: I3 ^dir R)
- <=WM: (13787: R1 ^reward R982)
- <=WM: (13790: O1958 ^name predict-no)
- <=WM: (13789: O1957 ^name predict-yes)
- <=WM: (13788: R982 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1959 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1960 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1958 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1957 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586136 -0.32889 0.257247 -> 0.586137 -0.32889 0.257247(R,m,v=1,0.857988,0.12257)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*36 0.413863 0.32889 0.742752 -> 0.413863 0.32889 0.742752(R,m,v=1,1,0)
- =>WM: (13808: S1 ^operator O1960)
- 980: O: O1960 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N980 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N979 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13809: I3 ^predict-no N980)
- <=WM: (13796: N979 ^status complete)
- <=WM: (13795: I3 ^predict-no N979)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- \-/--- Input Phase ---
- =>WM: (13813: I2 ^dir U)
- =>WM: (13812: I2 ^reward 1)
- =>WM: (13811: I2 ^see 0)
- =>WM: (13810: N980 ^status complete)
- <=WM: (13799: I2 ^dir U)
- <=WM: (13798: I2 ^reward 1)
- <=WM: (13797: I2 ^see 0)
- =>WM: (13814: I2 ^level-1 R0-root)
- <=WM: (13800: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R984 ^value 1 +)
- (R1 ^reward R984 +)
- Firing propose*predict-yes
- -->
- (O1961 ^name predict-yes +)
- (S1 ^operator O1961 +)
- Firing propose*predict-no
- -->
- (O1962 ^name predict-no +)
- (S1 ^operator O1962 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1960 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1959 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1960 ^name predict-no +)
- (S1 ^operator O1960 +)
- Retracting propose*predict-yes
- -->
- (O1959 ^name predict-yes +)
- (S1 ^operator O1959 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R983 ^value 1 +)
- (R1 ^reward R983 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1960 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1959 = 0.)
- =>WM: (13820: S1 ^operator O1962 +)
- =>WM: (13819: S1 ^operator O1961 +)
- =>WM: (13818: O1962 ^name predict-no)
- =>WM: (13817: O1961 ^name predict-yes)
- =>WM: (13816: R984 ^value 1)
- =>WM: (13815: R1 ^reward R984)
- <=WM: (13806: S1 ^operator O1959 +)
- <=WM: (13807: S1 ^operator O1960 +)
- <=WM: (13808: S1 ^operator O1960)
- <=WM: (13801: R1 ^reward R983)
- <=WM: (13804: O1960 ^name predict-no)
- <=WM: (13803: O1959 ^name predict-yes)
- <=WM: (13802: R983 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1961 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1962 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1960 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1959 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13821: S1 ^operator O1962)
- 981: O: O1962 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N981 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N980 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13822: I3 ^predict-no N981)
- <=WM: (13810: N980 ^status complete)
- <=WM: (13809: I3 ^predict-no N980)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- |--- Input Phase ---
- =>WM: (13826: I2 ^dir L)
- =>WM: (13825: I2 ^reward 1)
- =>WM: (13824: I2 ^see 0)
- =>WM: (13823: N981 ^status complete)
- <=WM: (13813: I2 ^dir U)
- <=WM: (13812: I2 ^reward 1)
- <=WM: (13811: I2 ^see 0)
- =>WM: (13827: I2 ^level-1 R0-root)
- <=WM: (13814: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1962 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1961 = 0.5681119444733725)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R985 ^value 1 +)
- (R1 ^reward R985 +)
- Firing propose*predict-yes
- -->
- (O1963 ^name predict-yes +)
- (S1 ^operator O1963 +)
- Firing propose*predict-no
- -->
- (O1964 ^name predict-no +)
- (S1 ^operator O1964 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1962 = 0.3289460588254962)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1961 = 0.4318903853359125)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1962 ^name predict-no +)
- (S1 ^operator O1962 +)
- Retracting propose*predict-yes
- -->
- (O1961 ^name predict-yes +)
- (S1 ^operator O1961 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R984 ^value 1 +)
- (R1 ^reward R984 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1962 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1961 = 0.)
- =>WM: (13834: S1 ^operator O1964 +)
- =>WM: (13833: S1 ^operator O1963 +)
- =>WM: (13832: I3 ^dir L)
- =>WM: (13831: O1964 ^name predict-no)
- =>WM: (13830: O1963 ^name predict-yes)
- =>WM: (13829: R985 ^value 1)
- =>WM: (13828: R1 ^reward R985)
- <=WM: (13819: S1 ^operator O1961 +)
- <=WM: (13820: S1 ^operator O1962 +)
- <=WM: (13821: S1 ^operator O1962)
- <=WM: (13805: I3 ^dir U)
- <=WM: (13815: R1 ^reward R984)
- <=WM: (13818: O1962 ^name predict-no)
- <=WM: (13817: O1961 ^name predict-yes)
- <=WM: (13816: R984 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1963 = 0.5681119444733725)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1963 = 0.4318903853359125)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1964 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1964 = 0.3289460588254962)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1962 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1962 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1961 = 0.4318903853359125)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1961 = 0.5681119444733725)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13835: S1 ^operator O1963)
- 982: O: O1963 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N982 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N981 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13836: I3 ^predict-yes N982)
- <=WM: (13823: N981 ^status complete)
- <=WM: (13822: I3 ^predict-no N981)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- \--- Input Phase ---
- =>WM: (13840: I2 ^dir U)
- =>WM: (13839: I2 ^reward 1)
- =>WM: (13838: I2 ^see 1)
- =>WM: (13837: N982 ^status complete)
- <=WM: (13826: I2 ^dir L)
- <=WM: (13825: I2 ^reward 1)
- <=WM: (13824: I2 ^see 0)
- =>WM: (13841: I2 ^level-1 L1-root)
- <=WM: (13827: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R986 ^value 1 +)
- (R1 ^reward R986 +)
- Firing propose*predict-yes
- -->
- (O1965 ^name predict-yes +)
- (S1 ^operator O1965 +)
- Firing propose*predict-no
- -->
- (O1966 ^name predict-no +)
- (S1 ^operator O1966 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1964 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1963 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1964 ^name predict-no +)
- (S1 ^operator O1964 +)
- Retracting propose*predict-yes
- -->
- (O1963 ^name predict-yes +)
- (S1 ^operator O1963 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R985 ^value 1 +)
- (R1 ^reward R985 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1964 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1964 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1963 = 0.4318903853359125)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1963 = 0.5681119444733725)
- =>WM: (13849: S1 ^operator O1966 +)
- =>WM: (13848: S1 ^operator O1965 +)
- =>WM: (13847: I3 ^dir U)
- =>WM: (13846: O1966 ^name predict-no)
- =>WM: (13845: O1965 ^name predict-yes)
- =>WM: (13844: R986 ^value 1)
- =>WM: (13843: R1 ^reward R986)
- =>WM: (13842: I3 ^see 1)
- <=WM: (13833: S1 ^operator O1963 +)
- <=WM: (13835: S1 ^operator O1963)
- <=WM: (13834: S1 ^operator O1964 +)
- <=WM: (13832: I3 ^dir L)
- <=WM: (13828: R1 ^reward R985)
- <=WM: (13773: I3 ^see 0)
- <=WM: (13831: O1964 ^name predict-no)
- <=WM: (13830: O1963 ^name predict-yes)
- <=WM: (13829: R985 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1965 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1966 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1964 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1963 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683777 -0.251886 0.43189 -> 0.683776 -0.251886 0.43189(R,m,v=1,0.922156,0.072217)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.316226 0.251886 0.568112 -> 0.316225 0.251886 0.568112(R,m,v=1,1,0)
- =>WM: (13850: S1 ^operator O1966)
- 983: O: O1966 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N983 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N982 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13851: I3 ^predict-no N983)
- <=WM: (13837: N982 ^status complete)
- <=WM: (13836: I3 ^predict-yes N982)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13855: I2 ^dir L)
- =>WM: (13854: I2 ^reward 1)
- =>WM: (13853: I2 ^see 0)
- =>WM: (13852: N983 ^status complete)
- <=WM: (13840: I2 ^dir U)
- <=WM: (13839: I2 ^reward 1)
- <=WM: (13838: I2 ^see 1)
- =>WM: (13856: I2 ^level-1 L1-root)
- <=WM: (13841: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1966 = 0.6710520874416326)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1965 = -0.06092862110810815)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R987 ^value 1 +)
- (R1 ^reward R987 +)
- Firing propose*predict-yes
- -->
- (O1967 ^name predict-yes +)
- (S1 ^operator O1967 +)
- Firing propose*predict-no
- -->
- (O1968 ^name predict-no +)
- (S1 ^operator O1968 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1966 = 0.3289460588254962)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1965 = 0.4318900358645197)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1966 ^name predict-no +)
- (S1 ^operator O1966 +)
- Retracting propose*predict-yes
- -->
- (O1965 ^name predict-yes +)
- (S1 ^operator O1965 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R986 ^value 1 +)
- (R1 ^reward R986 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1966 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1965 = 0.)
- =>WM: (13864: S1 ^operator O1968 +)
- =>WM: (13863: S1 ^operator O1967 +)
- =>WM: (13862: I3 ^dir L)
- =>WM: (13861: O1968 ^name predict-no)
- =>WM: (13860: O1967 ^name predict-yes)
- =>WM: (13859: R987 ^value 1)
- =>WM: (13858: R1 ^reward R987)
- =>WM: (13857: I3 ^see 0)
- <=WM: (13848: S1 ^operator O1965 +)
- <=WM: (13849: S1 ^operator O1966 +)
- <=WM: (13850: S1 ^operator O1966)
- <=WM: (13847: I3 ^dir U)
- <=WM: (13843: R1 ^reward R986)
- <=WM: (13842: I3 ^see 1)
- <=WM: (13846: O1966 ^name predict-no)
- <=WM: (13845: O1965 ^name predict-yes)
- <=WM: (13844: R986 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1967 = -0.06092862110810815)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1967 = 0.4318900358645197)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1968 = 0.6710520874416326)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1968 = 0.3289460588254962)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1966 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1966 = 0.6710520874416326)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1965 = 0.4318900358645197)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1965 = -0.06092862110810815)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13865: S1 ^operator O1968)
- 984: O: O1968 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N984 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N983 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13866: I3 ^predict-no N984)
- <=WM: (13852: N983 ^status complete)
- <=WM: (13851: I3 ^predict-no N983)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (13870: I2 ^dir U)
- =>WM: (13869: I2 ^reward 1)
- =>WM: (13868: I2 ^see 0)
- =>WM: (13867: N984 ^status complete)
- <=WM: (13855: I2 ^dir L)
- <=WM: (13854: I2 ^reward 1)
- <=WM: (13853: I2 ^see 0)
- =>WM: (13871: I2 ^level-1 L0-root)
- <=WM: (13856: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R988 ^value 1 +)
- (R1 ^reward R988 +)
- Firing propose*predict-yes
- -->
- (O1969 ^name predict-yes +)
- (S1 ^operator O1969 +)
- Firing propose*predict-no
- -->
- (O1970 ^name predict-no +)
- (S1 ^operator O1970 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1968 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1967 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1968 ^name predict-no +)
- (S1 ^operator O1968 +)
- Retracting propose*predict-yes
- -->
- (O1967 ^name predict-yes +)
- (S1 ^operator O1967 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R987 ^value 1 +)
- (R1 ^reward R987 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1968 = 0.3289460588254962)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1968 = 0.6710520874416326)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1967 = 0.4318900358645197)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1967 = -0.06092862110810815)
- =>WM: (13878: S1 ^operator O1970 +)
- =>WM: (13877: S1 ^operator O1969 +)
- =>WM: (13876: I3 ^dir U)
- =>WM: (13875: O1970 ^name predict-no)
- =>WM: (13874: O1969 ^name predict-yes)
- =>WM: (13873: R988 ^value 1)
- =>WM: (13872: R1 ^reward R988)
- <=WM: (13863: S1 ^operator O1967 +)
- <=WM: (13864: S1 ^operator O1968 +)
- <=WM: (13865: S1 ^operator O1968)
- <=WM: (13862: I3 ^dir L)
- <=WM: (13858: R1 ^reward R987)
- <=WM: (13861: O1968 ^name predict-no)
- <=WM: (13860: O1967 ^name predict-yes)
- <=WM: (13859: R987 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1969 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1970 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1968 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1967 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*6 0.565403 -0.236457 0.328946 -> 0.565404 -0.236458 0.328946(R,m,v=1,0.904459,0.0869672)
- RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*43 0.434593 0.236459 0.671052 -> 0.434593 0.236459 0.671052(R,m,v=1,1,0)
- =>WM: (13879: S1 ^operator O1970)
- 985: O: O1970 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N985 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N984 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13880: I3 ^predict-no N985)
- <=WM: (13867: N984 ^status complete)
- <=WM: (13866: I3 ^predict-no N984)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- /|\-sleeping...
- /--- Input Phase ---
- =>WM: (13884: I2 ^dir R)
- =>WM: (13883: I2 ^reward 1)
- =>WM: (13882: I2 ^see 0)
- =>WM: (13881: N985 ^status complete)
- <=WM: (13870: I2 ^dir U)
- <=WM: (13869: I2 ^reward 1)
- <=WM: (13868: I2 ^see 0)
- =>WM: (13885: I2 ^level-1 L0-root)
- <=WM: (13871: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1970 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1969 = 0.2631756442840678)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R989 ^value 1 +)
- (R1 ^reward R989 +)
- Firing propose*predict-yes
- -->
- (O1971 ^name predict-yes +)
- (S1 ^operator O1971 +)
- Firing propose*predict-no
- -->
- (O1972 ^name predict-no +)
- (S1 ^operator O1972 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1970 = 0.257246742345061)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1969 = 0.7368290791081045)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1970 ^name predict-no +)
- (S1 ^operator O1970 +)
- Retracting propose*predict-yes
- -->
- (O1969 ^name predict-yes +)
- (S1 ^operator O1969 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R988 ^value 1 +)
- (R1 ^reward R988 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1970 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1969 = 0.)
- =>WM: (13892: S1 ^operator O1972 +)
- =>WM: (13891: S1 ^operator O1971 +)
- =>WM: (13890: I3 ^dir R)
- =>WM: (13889: O1972 ^name predict-no)
- =>WM: (13888: O1971 ^name predict-yes)
- =>WM: (13887: R989 ^value 1)
- =>WM: (13886: R1 ^reward R989)
- <=WM: (13877: S1 ^operator O1969 +)
- <=WM: (13878: S1 ^operator O1970 +)
- <=WM: (13879: S1 ^operator O1970)
- <=WM: (13876: I3 ^dir U)
- <=WM: (13872: R1 ^reward R988)
- <=WM: (13875: O1970 ^name predict-no)
- <=WM: (13874: O1969 ^name predict-yes)
- <=WM: (13873: R988 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1971 = 0.2631756442840678)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1971 = 0.7368290791081045)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1972 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1972 = 0.257246742345061)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1970 = 0.257246742345061)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1970 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1969 = 0.7368290791081045)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1969 = 0.2631756442840678)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13893: S1 ^operator O1971)
- 986: O: O1971 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N986 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N985 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13894: I3 ^predict-yes N986)
- <=WM: (13881: N985 ^status complete)
- <=WM: (13880: I3 ^predict-no N985)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- |\--- Input Phase ---
- =>WM: (13898: I2 ^dir U)
- =>WM: (13897: I2 ^reward 1)
- =>WM: (13896: I2 ^see 1)
- =>WM: (13895: N986 ^status complete)
- <=WM: (13884: I2 ^dir R)
- <=WM: (13883: I2 ^reward 1)
- <=WM: (13882: I2 ^see 0)
- =>WM: (13899: I2 ^level-1 R1-root)
- <=WM: (13885: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R990 ^value 1 +)
- (R1 ^reward R990 +)
- Firing propose*predict-yes
- -->
- (O1973 ^name predict-yes +)
- (S1 ^operator O1973 +)
- Firing propose*predict-no
- -->
- (O1974 ^name predict-no +)
- (S1 ^operator O1974 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1972 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1971 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1972 ^name predict-no +)
- (S1 ^operator O1972 +)
- Retracting propose*predict-yes
- -->
- (O1971 ^name predict-yes +)
- (S1 ^operator O1971 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R989 ^value 1 +)
- (R1 ^reward R989 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1972 = 0.257246742345061)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1972 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1971 = 0.7368290791081045)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1971 = 0.2631756442840678)
- =>WM: (13907: S1 ^operator O1974 +)
- =>WM: (13906: S1 ^operator O1973 +)
- =>WM: (13905: I3 ^dir U)
- =>WM: (13904: O1974 ^name predict-no)
- =>WM: (13903: O1973 ^name predict-yes)
- =>WM: (13902: R990 ^value 1)
- =>WM: (13901: R1 ^reward R990)
- =>WM: (13900: I3 ^see 1)
- <=WM: (13891: S1 ^operator O1971 +)
- <=WM: (13893: S1 ^operator O1971)
- <=WM: (13892: S1 ^operator O1972 +)
- <=WM: (13890: I3 ^dir R)
- <=WM: (13886: R1 ^reward R989)
- <=WM: (13857: I3 ^see 0)
- <=WM: (13889: O1972 ^name predict-no)
- <=WM: (13888: O1971 ^name predict-yes)
- <=WM: (13887: R989 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1973 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1974 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1972 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1971 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114073 0.736829 -> 0.748236 -0.0114078 0.736828(R,m,v=1,0.895706,0.0939938)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*35 0.251765 0.0114107 0.263176 -> 0.251765 0.0114102 0.263175(R,m,v=1,1,0)
- =>WM: (13908: S1 ^operator O1974)
- 987: O: O1974 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N987 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N986 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13909: I3 ^predict-no N987)
- <=WM: (13895: N986 ^status complete)
- <=WM: (13894: I3 ^predict-yes N986)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (13913: I2 ^dir R)
- =>WM: (13912: I2 ^reward 1)
- =>WM: (13911: I2 ^see 0)
- =>WM: (13910: N987 ^status complete)
- <=WM: (13898: I2 ^dir U)
- <=WM: (13897: I2 ^reward 1)
- <=WM: (13896: I2 ^see 1)
- =>WM: (13914: I2 ^level-1 R1-root)
- <=WM: (13899: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1973 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1974 = 0.7427523795546869)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R991 ^value 1 +)
- (R1 ^reward R991 +)
- Firing propose*predict-yes
- -->
- (O1975 ^name predict-yes +)
- (S1 ^operator O1975 +)
- Firing propose*predict-no
- -->
- (O1976 ^name predict-no +)
- (S1 ^operator O1976 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1974 = 0.257246742345061)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1973 = 0.7368283705992786)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1974 ^name predict-no +)
- (S1 ^operator O1974 +)
- Retracting propose*predict-yes
- -->
- (O1973 ^name predict-yes +)
- (S1 ^operator O1973 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R990 ^value 1 +)
- (R1 ^reward R990 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1974 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1973 = 0.)
- =>WM: (13922: S1 ^operator O1976 +)
- =>WM: (13921: S1 ^operator O1975 +)
- =>WM: (13920: I3 ^dir R)
- =>WM: (13919: O1976 ^name predict-no)
- =>WM: (13918: O1975 ^name predict-yes)
- =>WM: (13917: R991 ^value 1)
- =>WM: (13916: R1 ^reward R991)
- =>WM: (13915: I3 ^see 0)
- <=WM: (13906: S1 ^operator O1973 +)
- <=WM: (13907: S1 ^operator O1974 +)
- <=WM: (13908: S1 ^operator O1974)
- <=WM: (13905: I3 ^dir U)
- <=WM: (13901: R1 ^reward R990)
- <=WM: (13900: I3 ^see 1)
- <=WM: (13904: O1974 ^name predict-no)
- <=WM: (13903: O1973 ^name predict-yes)
- <=WM: (13902: R990 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1975 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1975 = 0.7368283705992786)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1976 = 0.7427523795546869)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1976 = 0.257246742345061)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1974 = 0.257246742345061)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1974 = 0.7427523795546869)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1973 = 0.7368283705992786)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1973 = -0.3011268063455669)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13923: S1 ^operator O1976)
- 988: O: O1976 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N988 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N987 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13924: I3 ^predict-no N988)
- <=WM: (13910: N987 ^status complete)
- <=WM: (13909: I3 ^predict-no N987)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- \-/--- Input Phase ---
- =>WM: (13928: I2 ^dir R)
- =>WM: (13927: I2 ^reward 1)
- =>WM: (13926: I2 ^see 0)
- =>WM: (13925: N988 ^status complete)
- <=WM: (13913: I2 ^dir R)
- <=WM: (13912: I2 ^reward 1)
- <=WM: (13911: I2 ^see 0)
- =>WM: (13929: I2 ^level-1 R0-root)
- <=WM: (13914: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1976 = 0.7427594337336832)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1975 = -0.1989581826229297)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R992 ^value 1 +)
- (R1 ^reward R992 +)
- Firing propose*predict-yes
- -->
- (O1977 ^name predict-yes +)
- (S1 ^operator O1977 +)
- Firing propose*predict-no
- -->
- (O1978 ^name predict-no +)
- (S1 ^operator O1978 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1976 = 0.257246742345061)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1975 = 0.7368283705992786)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1976 ^name predict-no +)
- (S1 ^operator O1976 +)
- Retracting propose*predict-yes
- -->
- (O1975 ^name predict-yes +)
- (S1 ^operator O1975 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R991 ^value 1 +)
- (R1 ^reward R991 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1976 = 0.257246742345061)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O1976 = 0.7427523795546869)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1975 = 0.7368283705992786)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O1975 = -0.3011268063455669)
- =>WM: (13935: S1 ^operator O1978 +)
- =>WM: (13934: S1 ^operator O1977 +)
- =>WM: (13933: O1978 ^name predict-no)
- =>WM: (13932: O1977 ^name predict-yes)
- =>WM: (13931: R992 ^value 1)
- =>WM: (13930: R1 ^reward R992)
- <=WM: (13921: S1 ^operator O1975 +)
- <=WM: (13922: S1 ^operator O1976 +)
- <=WM: (13923: S1 ^operator O1976)
- <=WM: (13916: R1 ^reward R991)
- <=WM: (13919: O1976 ^name predict-no)
- <=WM: (13918: O1975 ^name predict-yes)
- <=WM: (13917: R991 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1977 = 0.7368283705992786)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1977 = -0.1989581826229297)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1978 = 0.257246742345061)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1978 = 0.7427594337336832)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1976 = 0.257246742345061)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1976 = 0.7427594337336832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1975 = 0.7368283705992786)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1975 = -0.1989581826229297)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586137 -0.32889 0.257247 -> 0.586137 -0.32889 0.257247(R,m,v=1,0.858824,0.121963)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*36 0.413863 0.32889 0.742752 -> 0.413863 0.32889 0.742753(R,m,v=1,1,0)
- =>WM: (13936: S1 ^operator O1978)
- 989: O: O1978 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N989 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N988 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13937: I3 ^predict-no N989)
- <=WM: (13925: N988 ^status complete)
- <=WM: (13924: I3 ^predict-no N988)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- |\---- Input Phase ---
- =>WM: (13941: I2 ^dir L)
- =>WM: (13940: I2 ^reward 1)
- =>WM: (13939: I2 ^see 0)
- =>WM: (13938: N989 ^status complete)
- <=WM: (13928: I2 ^dir R)
- <=WM: (13927: I2 ^reward 1)
- <=WM: (13926: I2 ^see 0)
- =>WM: (13942: I2 ^level-1 R0-root)
- <=WM: (13929: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1978 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1977 = 0.5681115950019797)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R993 ^value 1 +)
- (R1 ^reward R993 +)
- Firing propose*predict-yes
- -->
- (O1979 ^name predict-yes +)
- (S1 ^operator O1979 +)
- Firing propose*predict-no
- -->
- (O1980 ^name predict-no +)
- (S1 ^operator O1980 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1978 = 0.3289463368854268)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1977 = 0.4318900358645197)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1978 ^name predict-no +)
- (S1 ^operator O1978 +)
- Retracting propose*predict-yes
- -->
- (O1977 ^name predict-yes +)
- (S1 ^operator O1977 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R992 ^value 1 +)
- (R1 ^reward R992 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
- -->
- (S1 ^operator O1978 = 0.7427594337336832)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1978 = 0.2572468740600988)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
- -->
- (S1 ^operator O1977 = -0.1989581826229297)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1977 = 0.7368283705992786)
- =>WM: (13949: S1 ^operator O1980 +)
- =>WM: (13948: S1 ^operator O1979 +)
- =>WM: (13947: I3 ^dir L)
- =>WM: (13946: O1980 ^name predict-no)
- =>WM: (13945: O1979 ^name predict-yes)
- =>WM: (13944: R993 ^value 1)
- =>WM: (13943: R1 ^reward R993)
- <=WM: (13934: S1 ^operator O1977 +)
- <=WM: (13935: S1 ^operator O1978 +)
- <=WM: (13936: S1 ^operator O1978)
- <=WM: (13920: I3 ^dir R)
- <=WM: (13930: R1 ^reward R992)
- <=WM: (13933: O1978 ^name predict-no)
- <=WM: (13932: O1977 ^name predict-yes)
- <=WM: (13931: R992 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1979 = 0.5681115950019797)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1979 = 0.4318900358645197)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1980 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1980 = 0.3289463368854268)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1978 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1978 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1977 = 0.4318900358645197)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1977 = 0.5681115950019797)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586137 -0.32889 0.257247 -> 0.586136 -0.32889 0.257246(R,m,v=1,0.859649,0.121362)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.413868 0.328891 0.742759 -> 0.413868 0.328891 0.742758(R,m,v=1,1,0)
- =>WM: (13950: S1 ^operator O1979)
- 990: O: O1979 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N990 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N989 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13951: I3 ^predict-yes N990)
- <=WM: (13938: N989 ^status complete)
- <=WM: (13937: I3 ^predict-no N989)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (13955: I2 ^dir U)
- =>WM: (13954: I2 ^reward 1)
- =>WM: (13953: I2 ^see 1)
- =>WM: (13952: N990 ^status complete)
- <=WM: (13941: I2 ^dir L)
- <=WM: (13940: I2 ^reward 1)
- <=WM: (13939: I2 ^see 0)
- =>WM: (13956: I2 ^level-1 L1-root)
- <=WM: (13942: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R994 ^value 1 +)
- (R1 ^reward R994 +)
- Firing propose*predict-yes
- -->
- (O1981 ^name predict-yes +)
- (S1 ^operator O1981 +)
- Firing propose*predict-no
- -->
- (O1982 ^name predict-no +)
- (S1 ^operator O1982 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1980 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1979 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1980 ^name predict-no +)
- (S1 ^operator O1980 +)
- Retracting propose*predict-yes
- -->
- (O1979 ^name predict-yes +)
- (S1 ^operator O1979 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R993 ^value 1 +)
- (R1 ^reward R993 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1980 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O1980 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1979 = 0.4318900358645197)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O1979 = 0.5681115950019797)
- =>WM: (13964: S1 ^operator O1982 +)
- =>WM: (13963: S1 ^operator O1981 +)
- =>WM: (13962: I3 ^dir U)
- =>WM: (13961: O1982 ^name predict-no)
- =>WM: (13960: O1981 ^name predict-yes)
- =>WM: (13959: R994 ^value 1)
- =>WM: (13958: R1 ^reward R994)
- =>WM: (13957: I3 ^see 1)
- <=WM: (13948: S1 ^operator O1979 +)
- <=WM: (13950: S1 ^operator O1979)
- <=WM: (13949: S1 ^operator O1980 +)
- <=WM: (13947: I3 ^dir L)
- <=WM: (13943: R1 ^reward R993)
- <=WM: (13915: I3 ^see 0)
- <=WM: (13946: O1980 ^name predict-no)
- <=WM: (13945: O1979 ^name predict-yes)
- <=WM: (13944: R993 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1981 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1982 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1980 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1979 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683776 -0.251886 0.43189 -> 0.683776 -0.251886 0.43189(R,m,v=1,0.922619,0.0718206)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.316225 0.251886 0.568112 -> 0.316225 0.251886 0.568111(R,m,v=1,1,0)
- =>WM: (13965: S1 ^operator O1982)
- 991: O: O1982 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N991 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N990 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13966: I3 ^predict-no N991)
- <=WM: (13952: N990 ^status complete)
- <=WM: (13951: I3 ^predict-yes N990)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- ---- Input Phase ---
- =>WM: (13970: I2 ^dir R)
- =>WM: (13969: I2 ^reward 1)
- =>WM: (13968: I2 ^see 0)
- =>WM: (13967: N991 ^status complete)
- <=WM: (13955: I2 ^dir U)
- <=WM: (13954: I2 ^reward 1)
- <=WM: (13953: I2 ^see 1)
- =>WM: (13971: I2 ^level-1 L1-root)
- <=WM: (13956: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1982 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1981 = 0.2631685608814066)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R995 ^value 1 +)
- (R1 ^reward R995 +)
- Firing propose*predict-yes
- -->
- (O1983 ^name predict-yes +)
- (S1 ^operator O1983 +)
- Firing propose*predict-no
- -->
- (O1984 ^name predict-no +)
- (S1 ^operator O1984 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1982 = 0.2572459278910315)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1981 = 0.7368283705992786)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1982 ^name predict-no +)
- (S1 ^operator O1982 +)
- Retracting propose*predict-yes
- -->
- (O1981 ^name predict-yes +)
- (S1 ^operator O1981 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R994 ^value 1 +)
- (R1 ^reward R994 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1982 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1981 = 0.)
- =>WM: (13979: S1 ^operator O1984 +)
- =>WM: (13978: S1 ^operator O1983 +)
- =>WM: (13977: I3 ^dir R)
- =>WM: (13976: O1984 ^name predict-no)
- =>WM: (13975: O1983 ^name predict-yes)
- =>WM: (13974: R995 ^value 1)
- =>WM: (13973: R1 ^reward R995)
- =>WM: (13972: I3 ^see 0)
- <=WM: (13963: S1 ^operator O1981 +)
- <=WM: (13964: S1 ^operator O1982 +)
- <=WM: (13965: S1 ^operator O1982)
- <=WM: (13962: I3 ^dir U)
- <=WM: (13958: R1 ^reward R994)
- <=WM: (13957: I3 ^see 1)
- <=WM: (13961: O1982 ^name predict-no)
- <=WM: (13960: O1981 ^name predict-yes)
- <=WM: (13959: R994 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1983 = 0.2631685608814066)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1983 = 0.7368283705992786)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1984 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1984 = 0.2572459278910315)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1982 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1982 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1981 = 0.7368283705992786)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1981 = 0.2631685608814066)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (13980: S1 ^operator O1983)
- 992: O: O1983 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N992 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N991 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13981: I3 ^predict-yes N992)
- <=WM: (13967: N991 ^status complete)
- <=WM: (13966: I3 ^predict-no N991)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (13985: I2 ^dir U)
- =>WM: (13984: I2 ^reward 1)
- =>WM: (13983: I2 ^see 1)
- =>WM: (13982: N992 ^status complete)
- <=WM: (13970: I2 ^dir R)
- <=WM: (13969: I2 ^reward 1)
- <=WM: (13968: I2 ^see 0)
- =>WM: (13986: I2 ^level-1 R1-root)
- <=WM: (13971: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R996 ^value 1 +)
- (R1 ^reward R996 +)
- Firing propose*predict-yes
- -->
- (O1985 ^name predict-yes +)
- (S1 ^operator O1985 +)
- Firing propose*predict-no
- -->
- (O1986 ^name predict-no +)
- (S1 ^operator O1986 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1984 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1983 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1984 ^name predict-no +)
- (S1 ^operator O1984 +)
- Retracting propose*predict-yes
- -->
- (O1983 ^name predict-yes +)
- (S1 ^operator O1983 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R995 ^value 1 +)
- (R1 ^reward R995 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1984 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O1984 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1983 = 0.7368283705992786)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O1983 = 0.2631685608814066)
- =>WM: (13994: S1 ^operator O1986 +)
- =>WM: (13993: S1 ^operator O1985 +)
- =>WM: (13992: I3 ^dir U)
- =>WM: (13991: O1986 ^name predict-no)
- =>WM: (13990: O1985 ^name predict-yes)
- =>WM: (13989: R996 ^value 1)
- =>WM: (13988: R1 ^reward R996)
- =>WM: (13987: I3 ^see 1)
- <=WM: (13978: S1 ^operator O1983 +)
- <=WM: (13980: S1 ^operator O1983)
- <=WM: (13979: S1 ^operator O1984 +)
- <=WM: (13977: I3 ^dir R)
- <=WM: (13973: R1 ^reward R995)
- <=WM: (13972: I3 ^see 0)
- <=WM: (13976: O1984 ^name predict-no)
- <=WM: (13975: O1983 ^name predict-yes)
- <=WM: (13974: R995 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1985 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1986 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1984 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1983 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114078 0.736828 -> 0.748236 -0.0114074 0.736829(R,m,v=1,0.896341,0.0934835)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*40 0.251763 0.0114055 0.263169 -> 0.251763 0.0114059 0.263169(R,m,v=1,1,0)
- =>WM: (13995: S1 ^operator O1986)
- 993: O: O1986 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N993 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N992 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (13996: I3 ^predict-no N993)
- <=WM: (13982: N992 ^status complete)
- <=WM: (13981: I3 ^predict-yes N992)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- -/--- Input Phase ---
- =>WM: (14000: I2 ^dir L)
- =>WM: (13999: I2 ^reward 1)
- =>WM: (13998: I2 ^see 0)
- =>WM: (13997: N993 ^status complete)
- <=WM: (13985: I2 ^dir U)
- <=WM: (13984: I2 ^reward 1)
- <=WM: (13983: I2 ^see 1)
- =>WM: (14001: I2 ^level-1 R1-root)
- <=WM: (13986: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1985 = 0.5681057054973254)
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1986 = -0.1549421060161498)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R997 ^value 1 +)
- (R1 ^reward R997 +)
- Firing propose*predict-yes
- -->
- (O1987 ^name predict-yes +)
- (S1 ^operator O1987 +)
- Firing propose*predict-no
- -->
- (O1988 ^name predict-no +)
- (S1 ^operator O1988 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1986 = 0.3289463368854268)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1985 = 0.4318897912345449)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1986 ^name predict-no +)
- (S1 ^operator O1986 +)
- Retracting propose*predict-yes
- -->
- (O1985 ^name predict-yes +)
- (S1 ^operator O1985 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R996 ^value 1 +)
- (R1 ^reward R996 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1986 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1985 = 0.)
- =>WM: (14009: S1 ^operator O1988 +)
- =>WM: (14008: S1 ^operator O1987 +)
- =>WM: (14007: I3 ^dir L)
- =>WM: (14006: O1988 ^name predict-no)
- =>WM: (14005: O1987 ^name predict-yes)
- =>WM: (14004: R997 ^value 1)
- =>WM: (14003: R1 ^reward R997)
- =>WM: (14002: I3 ^see 0)
- <=WM: (13993: S1 ^operator O1985 +)
- <=WM: (13994: S1 ^operator O1986 +)
- <=WM: (13995: S1 ^operator O1986)
- <=WM: (13992: I3 ^dir U)
- <=WM: (13988: R1 ^reward R996)
- <=WM: (13987: I3 ^see 1)
- <=WM: (13991: O1986 ^name predict-no)
- <=WM: (13990: O1985 ^name predict-yes)
- <=WM: (13989: R996 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1987 = 0.5681057054973254)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1987 = 0.4318897912345449)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1988 = -0.1549421060161498)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1988 = 0.3289463368854268)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1986 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1986 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1985 = 0.4318897912345449)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1985 = 0.5681057054973254)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14010: S1 ^operator O1987)
- 994: O: O1987 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N994 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N993 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14011: I3 ^predict-yes N994)
- <=WM: (13997: N993 ^status complete)
- <=WM: (13996: I3 ^predict-no N993)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- |\---- Input Phase ---
- =>WM: (14015: I2 ^dir L)
- =>WM: (14014: I2 ^reward 1)
- =>WM: (14013: I2 ^see 1)
- =>WM: (14012: N994 ^status complete)
- <=WM: (14000: I2 ^dir L)
- <=WM: (13999: I2 ^reward 1)
- <=WM: (13998: I2 ^see 0)
- =>WM: (14016: I2 ^level-1 L1-root)
- <=WM: (14001: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1988 = 0.6710523655015633)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1987 = -0.06092862110810815)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R998 ^value 1 +)
- (R1 ^reward R998 +)
- Firing propose*predict-yes
- -->
- (O1989 ^name predict-yes +)
- (S1 ^operator O1989 +)
- Firing propose*predict-no
- -->
- (O1990 ^name predict-no +)
- (S1 ^operator O1990 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1988 = 0.3289463368854268)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1987 = 0.4318897912345449)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1988 ^name predict-no +)
- (S1 ^operator O1988 +)
- Retracting propose*predict-yes
- -->
- (O1987 ^name predict-yes +)
- (S1 ^operator O1987 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R997 ^value 1 +)
- (R1 ^reward R997 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1988 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O1988 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1987 = 0.4318897912345449)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O1987 = 0.5681057054973254)
- =>WM: (14023: S1 ^operator O1990 +)
- =>WM: (14022: S1 ^operator O1989 +)
- =>WM: (14021: O1990 ^name predict-no)
- =>WM: (14020: O1989 ^name predict-yes)
- =>WM: (14019: R998 ^value 1)
- =>WM: (14018: R1 ^reward R998)
- =>WM: (14017: I3 ^see 1)
- <=WM: (14008: S1 ^operator O1987 +)
- <=WM: (14010: S1 ^operator O1987)
- <=WM: (14009: S1 ^operator O1988 +)
- <=WM: (14003: R1 ^reward R997)
- <=WM: (14002: I3 ^see 0)
- <=WM: (14006: O1988 ^name predict-no)
- <=WM: (14005: O1987 ^name predict-yes)
- <=WM: (14004: R997 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1989 = 0.4318897912345449)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1989 = -0.06092862110810815)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1990 = 0.3289463368854268)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1990 = 0.6710523655015633)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1988 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1988 = 0.6710523655015633)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1987 = 0.4318897912345449)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1987 = -0.06092862110810815)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683776 -0.251886 0.43189 -> 0.683777 -0.251886 0.43189(R,m,v=1,0.923077,0.0714286)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*45 0.31622 0.251886 0.568106 -> 0.31622 0.251886 0.568106(R,m,v=1,1,0)
- =>WM: (14024: S1 ^operator O1990)
- 995: O: O1990 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N995 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N994 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14025: I3 ^predict-no N995)
- <=WM: (14012: N994 ^status complete)
- <=WM: (14011: I3 ^predict-yes N994)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (14029: I2 ^dir L)
- =>WM: (14028: I2 ^reward 1)
- =>WM: (14027: I2 ^see 0)
- =>WM: (14026: N995 ^status complete)
- <=WM: (14015: I2 ^dir L)
- <=WM: (14014: I2 ^reward 1)
- <=WM: (14013: I2 ^see 1)
- =>WM: (14030: I2 ^level-1 L0-root)
- <=WM: (14016: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1990 = 0.6710552574919724)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1989 = 0.02602968095631553)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R999 ^value 1 +)
- (R1 ^reward R999 +)
- Firing propose*predict-yes
- -->
- (O1991 ^name predict-yes +)
- (S1 ^operator O1991 +)
- Firing propose*predict-no
- -->
- (O1992 ^name predict-no +)
- (S1 ^operator O1992 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1990 = 0.3289463368854268)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1989 = 0.4318904667247643)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O1990 ^name predict-no +)
- (S1 ^operator O1990 +)
- Retracting propose*predict-yes
- -->
- (O1989 ^name predict-yes +)
- (S1 ^operator O1989 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R998 ^value 1 +)
- (R1 ^reward R998 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*43
- -->
- (S1 ^operator O1990 = 0.6710523655015633)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1990 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
- -->
- (S1 ^operator O1989 = -0.06092862110810815)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1989 = 0.4318904667247643)
- =>WM: (14037: S1 ^operator O1992 +)
- =>WM: (14036: S1 ^operator O1991 +)
- =>WM: (14035: O1992 ^name predict-no)
- =>WM: (14034: O1991 ^name predict-yes)
- =>WM: (14033: R999 ^value 1)
- =>WM: (14032: R1 ^reward R999)
- =>WM: (14031: I3 ^see 0)
- <=WM: (14022: S1 ^operator O1989 +)
- <=WM: (14023: S1 ^operator O1990 +)
- <=WM: (14024: S1 ^operator O1990)
- <=WM: (14018: R1 ^reward R998)
- <=WM: (14017: I3 ^see 1)
- <=WM: (14021: O1990 ^name predict-no)
- <=WM: (14020: O1989 ^name predict-yes)
- <=WM: (14019: R998 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1991 = 0.4318904667247643)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1991 = 0.02602968095631553)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1992 = 0.3289463368854268)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1992 = 0.6710552574919724)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1990 = 0.3289463368854268)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1990 = 0.6710552574919724)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1989 = 0.4318904667247643)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1989 = 0.02602968095631553)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*6 0.565404 -0.236458 0.328946 -> 0.565404 -0.236458 0.328947(R,m,v=1,0.905063,0.086471)
- RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*43 0.434593 0.236459 0.671052 -> 0.434594 0.236459 0.671053(R,m,v=1,1,0)
- =>WM: (14038: S1 ^operator O1992)
- 996: O: O1992 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N996 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N995 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14039: I3 ^predict-no N996)
- <=WM: (14026: N995 ^status complete)
- <=WM: (14025: I3 ^predict-no N995)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (14043: I2 ^dir L)
- =>WM: (14042: I2 ^reward 1)
- =>WM: (14041: I2 ^see 0)
- =>WM: (14040: N996 ^status complete)
- <=WM: (14029: I2 ^dir L)
- <=WM: (14028: I2 ^reward 1)
- <=WM: (14027: I2 ^see 0)
- =>WM: (14044: I2 ^level-1 L0-root)
- <=WM: (14030: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1992 = 0.6710552574919724)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1991 = 0.02602968095631553)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1000 ^value 1 +)
- (R1 ^reward R1000 +)
- Firing propose*predict-yes
- -->
- (O1993 ^name predict-yes +)
- (S1 ^operator O1993 +)
- Firing propose*predict-no
- -->
- (O1994 ^name predict-no +)
- (S1 ^operator O1994 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1992 = 0.3289465315273784)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1991 = 0.4318904667247643)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1992 ^name predict-no +)
- (S1 ^operator O1992 +)
- Retracting propose*predict-yes
- -->
- (O1991 ^name predict-yes +)
- (S1 ^operator O1991 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R999 ^value 1 +)
- (R1 ^reward R999 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1992 = 0.6710552574919724)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1992 = 0.3289465315273784)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1991 = 0.02602968095631553)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1991 = 0.4318904667247643)
- =>WM: (14050: S1 ^operator O1994 +)
- =>WM: (14049: S1 ^operator O1993 +)
- =>WM: (14048: O1994 ^name predict-no)
- =>WM: (14047: O1993 ^name predict-yes)
- =>WM: (14046: R1000 ^value 1)
- =>WM: (14045: R1 ^reward R1000)
- <=WM: (14036: S1 ^operator O1991 +)
- <=WM: (14037: S1 ^operator O1992 +)
- <=WM: (14038: S1 ^operator O1992)
- <=WM: (14032: R1 ^reward R999)
- <=WM: (14035: O1992 ^name predict-no)
- <=WM: (14034: O1991 ^name predict-yes)
- <=WM: (14033: R999 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1993 = 0.4318904667247643)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1993 = 0.02602968095631553)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1994 = 0.3289465315273784)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1994 = 0.6710552574919724)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1992 = 0.3289465315273784)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1992 = 0.6710552574919724)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1991 = 0.4318904667247643)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1991 = 0.02602968095631553)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*6 0.565404 -0.236458 0.328947 -> 0.565404 -0.236458 0.328946(R,m,v=1,0.90566,0.0859804)
- RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*33 0.434599 0.236456 0.671055 -> 0.434599 0.236456 0.671055(R,m,v=1,1,0)
- =>WM: (14051: S1 ^operator O1994)
- 997: O: O1994 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N997 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N996 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14052: I3 ^predict-no N997)
- <=WM: (14040: N996 ^status complete)
- <=WM: (14039: I3 ^predict-no N996)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction L in state State-A
- In State-A moving L
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (14056: I2 ^dir U)
- =>WM: (14055: I2 ^reward 1)
- =>WM: (14054: I2 ^see 0)
- =>WM: (14053: N997 ^status complete)
- <=WM: (14043: I2 ^dir L)
- <=WM: (14042: I2 ^reward 1)
- <=WM: (14041: I2 ^see 0)
- =>WM: (14057: I2 ^level-1 L0-root)
- <=WM: (14044: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1001 ^value 1 +)
- (R1 ^reward R1001 +)
- Firing propose*predict-yes
- -->
- (O1995 ^name predict-yes +)
- (S1 ^operator O1995 +)
- Firing propose*predict-no
- -->
- (O1996 ^name predict-no +)
- (S1 ^operator O1996 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1994 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1993 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1994 ^name predict-no +)
- (S1 ^operator O1994 +)
- Retracting propose*predict-yes
- -->
- (O1993 ^name predict-yes +)
- (S1 ^operator O1993 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1000 ^value 1 +)
- (R1 ^reward R1000 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*33
- -->
- (S1 ^operator O1994 = 0.6710549891390698)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O1994 = 0.3289462631744757)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
- -->
- (S1 ^operator O1993 = 0.02602968095631553)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O1993 = 0.4318904667247643)
- =>WM: (14064: S1 ^operator O1996 +)
- =>WM: (14063: S1 ^operator O1995 +)
- =>WM: (14062: I3 ^dir U)
- =>WM: (14061: O1996 ^name predict-no)
- =>WM: (14060: O1995 ^name predict-yes)
- =>WM: (14059: R1001 ^value 1)
- =>WM: (14058: R1 ^reward R1001)
- <=WM: (14049: S1 ^operator O1993 +)
- <=WM: (14050: S1 ^operator O1994 +)
- <=WM: (14051: S1 ^operator O1994)
- <=WM: (14007: I3 ^dir L)
- <=WM: (14045: R1 ^reward R1000)
- <=WM: (14048: O1994 ^name predict-no)
- <=WM: (14047: O1993 ^name predict-yes)
- <=WM: (14046: R1000 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1995 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1996 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1994 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1993 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*6 0.565404 -0.236458 0.328946 -> 0.565404 -0.236457 0.328946(R,m,v=1,0.90625,0.0854953)
- RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*33 0.434599 0.236456 0.671055 -> 0.434598 0.236457 0.671055(R,m,v=1,1,0)
- =>WM: (14065: S1 ^operator O1996)
- 998: O: O1996 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N998 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N997 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14066: I3 ^predict-no N998)
- <=WM: (14053: N997 ^status complete)
- <=WM: (14052: I3 ^predict-no N997)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (14070: I2 ^dir U)
- =>WM: (14069: I2 ^reward 1)
- =>WM: (14068: I2 ^see 0)
- =>WM: (14067: N998 ^status complete)
- <=WM: (14056: I2 ^dir U)
- <=WM: (14055: I2 ^reward 1)
- <=WM: (14054: I2 ^see 0)
- =>WM: (14071: I2 ^level-1 L0-root)
- <=WM: (14057: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1002 ^value 1 +)
- (R1 ^reward R1002 +)
- Firing propose*predict-yes
- -->
- (O1997 ^name predict-yes +)
- (S1 ^operator O1997 +)
- Firing propose*predict-no
- -->
- (O1998 ^name predict-no +)
- (S1 ^operator O1998 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1996 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1995 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1996 ^name predict-no +)
- (S1 ^operator O1996 +)
- Retracting propose*predict-yes
- -->
- (O1995 ^name predict-yes +)
- (S1 ^operator O1995 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1001 ^value 1 +)
- (R1 ^reward R1001 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1996 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1995 = 0.)
- =>WM: (14077: S1 ^operator O1998 +)
- =>WM: (14076: S1 ^operator O1997 +)
- =>WM: (14075: O1998 ^name predict-no)
- =>WM: (14074: O1997 ^name predict-yes)
- =>WM: (14073: R1002 ^value 1)
- =>WM: (14072: R1 ^reward R1002)
- <=WM: (14063: S1 ^operator O1995 +)
- <=WM: (14064: S1 ^operator O1996 +)
- <=WM: (14065: S1 ^operator O1996)
- <=WM: (14058: R1 ^reward R1001)
- <=WM: (14061: O1996 ^name predict-no)
- <=WM: (14060: O1995 ^name predict-yes)
- <=WM: (14059: R1001 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1997 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1998 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1996 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1995 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14078: S1 ^operator O1998)
- 999: O: O1998 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N999 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N998 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14079: I3 ^predict-no N999)
- <=WM: (14067: N998 ^status complete)
- <=WM: (14066: I3 ^predict-no N998)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-A
- In State-A moving U
- ENV: (next state, see, prediction correct?) = (State-A, 0, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- ---- Input Phase ---
- =>WM: (14083: I2 ^dir R)
- =>WM: (14082: I2 ^reward 1)
- =>WM: (14081: I2 ^see 0)
- =>WM: (14080: N999 ^status complete)
- <=WM: (14070: I2 ^dir U)
- <=WM: (14069: I2 ^reward 1)
- <=WM: (14068: I2 ^see 0)
- =>WM: (14084: I2 ^level-1 L0-root)
- <=WM: (14071: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1998 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1997 = 0.263174935775242)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1003 ^value 1 +)
- (R1 ^reward R1003 +)
- Firing propose*predict-yes
- -->
- (O1999 ^name predict-yes +)
- (S1 ^operator O1999 +)
- Firing propose*predict-no
- -->
- (O2000 ^name predict-no +)
- (S1 ^operator O2000 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1998 = 0.2572459278910315)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1997 = 0.7368288308771758)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O1998 ^name predict-no +)
- (S1 ^operator O1998 +)
- Retracting propose*predict-yes
- -->
- (O1997 ^name predict-yes +)
- (S1 ^operator O1997 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1002 ^value 1 +)
- (R1 ^reward R1002 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O1998 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1997 = 0.)
- =>WM: (14091: S1 ^operator O2000 +)
- =>WM: (14090: S1 ^operator O1999 +)
- =>WM: (14089: I3 ^dir R)
- =>WM: (14088: O2000 ^name predict-no)
- =>WM: (14087: O1999 ^name predict-yes)
- =>WM: (14086: R1003 ^value 1)
- =>WM: (14085: R1 ^reward R1003)
- <=WM: (14076: S1 ^operator O1997 +)
- <=WM: (14077: S1 ^operator O1998 +)
- <=WM: (14078: S1 ^operator O1998)
- <=WM: (14062: I3 ^dir U)
- <=WM: (14072: R1 ^reward R1002)
- <=WM: (14075: O1998 ^name predict-no)
- <=WM: (14074: O1997 ^name predict-yes)
- <=WM: (14073: R1002 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1999 = 0.263174935775242)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1999 = 0.7368288308771758)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O2000 = -0.07401383653737587)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2000 = 0.2572459278910315)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O1998 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O1998 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1997 = 0.7368288308771758)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1997 = 0.263174935775242)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14092: S1 ^operator O1999)
- 1000: O: O1999 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N1000 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N999 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14093: I3 ^predict-yes N1000)
- <=WM: (14080: N999 ^status complete)
- <=WM: (14079: I3 ^predict-no N999)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\-/|\-/|\--- Input Phase ---
- =>WM: (14097: I2 ^dir U)
- =>WM: (14096: I2 ^reward 1)
- =>WM: (14095: I2 ^see 1)
- =>WM: (14094: N1000 ^status complete)
- <=WM: (14083: I2 ^dir R)
- <=WM: (14082: I2 ^reward 1)
- <=WM: (14081: I2 ^see 0)
- =>WM: (14098: I2 ^level-1 R1-root)
- <=WM: (14084: I2 ^level-1 L0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1004 ^value 1 +)
- (R1 ^reward R1004 +)
- Firing propose*predict-yes
- -->
- (O2001 ^name predict-yes +)
- (S1 ^operator O2001 +)
- Firing propose*predict-no
- -->
- (O2002 ^name predict-no +)
- (S1 ^operator O2002 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2000 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1999 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O2000 ^name predict-no +)
- (S1 ^operator O2000 +)
- Retracting propose*predict-yes
- -->
- (O1999 ^name predict-yes +)
- (S1 ^operator O1999 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1003 ^value 1 +)
- (R1 ^reward R1003 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2000 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*34
- -->
- (S1 ^operator O2000 = -0.07401383653737587)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O1999 = 0.7368288308771758)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*35
- -->
- (S1 ^operator O1999 = 0.263174935775242)
- =>WM: (14106: S1 ^operator O2002 +)
- =>WM: (14105: S1 ^operator O2001 +)
- =>WM: (14104: I3 ^dir U)
- =>WM: (14103: O2002 ^name predict-no)
- =>WM: (14102: O2001 ^name predict-yes)
- =>WM: (14101: R1004 ^value 1)
- =>WM: (14100: R1 ^reward R1004)
- =>WM: (14099: I3 ^see 1)
- <=WM: (14090: S1 ^operator O1999 +)
- <=WM: (14092: S1 ^operator O1999)
- <=WM: (14091: S1 ^operator O2000 +)
- <=WM: (14089: I3 ^dir R)
- <=WM: (14085: R1 ^reward R1003)
- <=WM: (14031: I3 ^see 0)
- <=WM: (14088: O2000 ^name predict-no)
- <=WM: (14087: O1999 ^name predict-yes)
- <=WM: (14086: R1003 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2001 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2002 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2000 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O1999 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114074 0.736829 -> 0.748236 -0.0114079 0.736828(R,m,v=1,0.89697,0.0929786)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*35 0.251765 0.0114102 0.263175 -> 0.251765 0.0114098 0.263174(R,m,v=1,1,0)
- =>WM: (14107: S1 ^operator O2002)
- 1001: O: O2002 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N1001 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N1000 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14108: I3 ^predict-no N1001)
- <=WM: (14094: N1000 ^status complete)
- <=WM: (14093: I3 ^predict-yes N1000)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- ---- Input Phase ---
- =>WM: (14112: I2 ^dir U)
- =>WM: (14111: I2 ^reward 1)
- =>WM: (14110: I2 ^see 0)
- =>WM: (14109: N1001 ^status complete)
- <=WM: (14097: I2 ^dir U)
- <=WM: (14096: I2 ^reward 1)
- <=WM: (14095: I2 ^see 1)
- =>WM: (14113: I2 ^level-1 R1-root)
- <=WM: (14098: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1005 ^value 1 +)
- (R1 ^reward R1005 +)
- Firing propose*predict-yes
- -->
- (O2003 ^name predict-yes +)
- (S1 ^operator O2003 +)
- Firing propose*predict-no
- -->
- (O2004 ^name predict-no +)
- (S1 ^operator O2004 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2002 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2001 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O2002 ^name predict-no +)
- (S1 ^operator O2002 +)
- Retracting propose*predict-yes
- -->
- (O2001 ^name predict-yes +)
- (S1 ^operator O2001 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1004 ^value 1 +)
- (R1 ^reward R1004 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2002 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2001 = 0.)
- =>WM: (14120: S1 ^operator O2004 +)
- =>WM: (14119: S1 ^operator O2003 +)
- =>WM: (14118: O2004 ^name predict-no)
- =>WM: (14117: O2003 ^name predict-yes)
- =>WM: (14116: R1005 ^value 1)
- =>WM: (14115: R1 ^reward R1005)
- =>WM: (14114: I3 ^see 0)
- <=WM: (14105: S1 ^operator O2001 +)
- <=WM: (14106: S1 ^operator O2002 +)
- <=WM: (14107: S1 ^operator O2002)
- <=WM: (14100: R1 ^reward R1004)
- <=WM: (14099: I3 ^see 1)
- <=WM: (14103: O2002 ^name predict-no)
- <=WM: (14102: O2001 ^name predict-yes)
- <=WM: (14101: R1004 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2003 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2004 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2002 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2001 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14121: S1 ^operator O2004)
- 1002: O: O2004 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N1002 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N1001 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14122: I3 ^predict-no N1002)
- <=WM: (14109: N1001 ^status complete)
- <=WM: (14108: I3 ^predict-no N1001)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- /|\--- Input Phase ---
- =>WM: (14126: I2 ^dir U)
- =>WM: (14125: I2 ^reward 1)
- =>WM: (14124: I2 ^see 0)
- =>WM: (14123: N1002 ^status complete)
- <=WM: (14112: I2 ^dir U)
- <=WM: (14111: I2 ^reward 1)
- <=WM: (14110: I2 ^see 0)
- =>WM: (14127: I2 ^level-1 R1-root)
- <=WM: (14113: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1006 ^value 1 +)
- (R1 ^reward R1006 +)
- Firing propose*predict-yes
- -->
- (O2005 ^name predict-yes +)
- (S1 ^operator O2005 +)
- Firing propose*predict-no
- -->
- (O2006 ^name predict-no +)
- (S1 ^operator O2006 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2004 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2003 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O2004 ^name predict-no +)
- (S1 ^operator O2004 +)
- Retracting propose*predict-yes
- -->
- (O2003 ^name predict-yes +)
- (S1 ^operator O2003 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1005 ^value 1 +)
- (R1 ^reward R1005 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2004 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2003 = 0.)
- =>WM: (14133: S1 ^operator O2006 +)
- =>WM: (14132: S1 ^operator O2005 +)
- =>WM: (14131: O2006 ^name predict-no)
- =>WM: (14130: O2005 ^name predict-yes)
- =>WM: (14129: R1006 ^value 1)
- =>WM: (14128: R1 ^reward R1006)
- <=WM: (14119: S1 ^operator O2003 +)
- <=WM: (14120: S1 ^operator O2004 +)
- <=WM: (14121: S1 ^operator O2004)
- <=WM: (14115: R1 ^reward R1005)
- <=WM: (14118: O2004 ^name predict-no)
- <=WM: (14117: O2003 ^name predict-yes)
- <=WM: (14116: R1005 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2005 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2006 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2004 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2003 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14134: S1 ^operator O2006)
- 1003: O: O2006 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N1003 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N1002 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14135: I3 ^predict-no N1003)
- <=WM: (14123: N1002 ^status complete)
- <=WM: (14122: I3 ^predict-no N1002)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (14139: I2 ^dir U)
- =>WM: (14138: I2 ^reward 1)
- =>WM: (14137: I2 ^see 0)
- =>WM: (14136: N1003 ^status complete)
- <=WM: (14126: I2 ^dir U)
- <=WM: (14125: I2 ^reward 1)
- <=WM: (14124: I2 ^see 0)
- =>WM: (14140: I2 ^level-1 R1-root)
- <=WM: (14127: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1007 ^value 1 +)
- (R1 ^reward R1007 +)
- Firing propose*predict-yes
- -->
- (O2007 ^name predict-yes +)
- (S1 ^operator O2007 +)
- Firing propose*predict-no
- -->
- (O2008 ^name predict-no +)
- (S1 ^operator O2008 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2006 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2005 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O2006 ^name predict-no +)
- (S1 ^operator O2006 +)
- Retracting propose*predict-yes
- -->
- (O2005 ^name predict-yes +)
- (S1 ^operator O2005 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1006 ^value 1 +)
- (R1 ^reward R1006 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2006 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2005 = 0.)
- =>WM: (14146: S1 ^operator O2008 +)
- =>WM: (14145: S1 ^operator O2007 +)
- =>WM: (14144: O2008 ^name predict-no)
- =>WM: (14143: O2007 ^name predict-yes)
- =>WM: (14142: R1007 ^value 1)
- =>WM: (14141: R1 ^reward R1007)
- <=WM: (14132: S1 ^operator O2005 +)
- <=WM: (14133: S1 ^operator O2006 +)
- <=WM: (14134: S1 ^operator O2006)
- <=WM: (14128: R1 ^reward R1006)
- <=WM: (14131: O2006 ^name predict-no)
- <=WM: (14130: O2005 ^name predict-yes)
- <=WM: (14129: R1006 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2007 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2008 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2006 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2005 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14147: S1 ^operator O2008)
- 1004: O: O2008 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N1004 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N1003 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14148: I3 ^predict-no N1004)
- <=WM: (14136: N1003 ^status complete)
- <=WM: (14135: I3 ^predict-no N1003)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (14152: I2 ^dir L)
- =>WM: (14151: I2 ^reward 1)
- =>WM: (14150: I2 ^see 0)
- =>WM: (14149: N1004 ^status complete)
- <=WM: (14139: I2 ^dir U)
- <=WM: (14138: I2 ^reward 1)
- <=WM: (14137: I2 ^see 0)
- =>WM: (14153: I2 ^level-1 R1-root)
- <=WM: (14140: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O2007 = 0.5681063809875448)
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O2008 = -0.1549421060161498)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1008 ^value 1 +)
- (R1 ^reward R1008 +)
- Firing propose*predict-yes
- -->
- (O2009 ^name predict-yes +)
- (S1 ^operator O2009 +)
- Firing propose*predict-no
- -->
- (O2010 ^name predict-no +)
- (S1 ^operator O2010 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2008 = 0.3289460753274439)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2007 = 0.4318904667247643)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O2008 ^name predict-no +)
- (S1 ^operator O2008 +)
- Retracting propose*predict-yes
- -->
- (O2007 ^name predict-yes +)
- (S1 ^operator O2007 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1007 ^value 1 +)
- (R1 ^reward R1007 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2008 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2007 = 0.)
- =>WM: (14160: S1 ^operator O2010 +)
- =>WM: (14159: S1 ^operator O2009 +)
- =>WM: (14158: I3 ^dir L)
- =>WM: (14157: O2010 ^name predict-no)
- =>WM: (14156: O2009 ^name predict-yes)
- =>WM: (14155: R1008 ^value 1)
- =>WM: (14154: R1 ^reward R1008)
- <=WM: (14145: S1 ^operator O2007 +)
- <=WM: (14146: S1 ^operator O2008 +)
- <=WM: (14147: S1 ^operator O2008)
- <=WM: (14104: I3 ^dir U)
- <=WM: (14141: R1 ^reward R1007)
- <=WM: (14144: O2008 ^name predict-no)
- <=WM: (14143: O2007 ^name predict-yes)
- <=WM: (14142: R1007 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O2009 = 0.5681063809875448)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2009 = 0.4318904667247643)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O2010 = -0.1549421060161498)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2010 = 0.3289460753274439)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2008 = 0.3289460753274439)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O2008 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2007 = 0.4318904667247643)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O2007 = 0.5681063809875448)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14161: S1 ^operator O2009)
- 1005: O: O2009 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N1005 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N1004 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14162: I3 ^predict-yes N1005)
- <=WM: (14149: N1004 ^status complete)
- <=WM: (14148: I3 ^predict-no N1004)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction L in state State-B
- In State-B moving L
- ENV: (next state, see, prediction correct?) = (State-A, 1, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- /--- Input Phase ---
- =>WM: (14166: I2 ^dir R)
- =>WM: (14165: I2 ^reward 1)
- =>WM: (14164: I2 ^see 1)
- =>WM: (14163: N1005 ^status complete)
- <=WM: (14152: I2 ^dir L)
- <=WM: (14151: I2 ^reward 1)
- <=WM: (14150: I2 ^see 0)
- =>WM: (14167: I2 ^level-1 L1-root)
- <=WM: (14153: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O2010 = -0.1377248055371832)
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O2009 = 0.2631690211593038)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1009 ^value 1 +)
- (R1 ^reward R1009 +)
- Firing propose*predict-yes
- -->
- (O2011 ^name predict-yes +)
- (S1 ^operator O2011 +)
- Firing propose*predict-no
- -->
- (O2012 ^name predict-no +)
- (S1 ^operator O2012 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2010 = 0.2572459278910315)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2009 = 0.7368282658793132)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O2010 ^name predict-no +)
- (S1 ^operator O2010 +)
- Retracting propose*predict-yes
- -->
- (O2009 ^name predict-yes +)
- (S1 ^operator O2009 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1008 ^value 1 +)
- (R1 ^reward R1008 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2010 = 0.3289460753274439)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*44
- -->
- (S1 ^operator O2010 = -0.1549421060161498)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2009 = 0.4318904667247643)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*45
- -->
- (S1 ^operator O2009 = 0.5681063809875448)
- =>WM: (14175: S1 ^operator O2012 +)
- =>WM: (14174: S1 ^operator O2011 +)
- =>WM: (14173: I3 ^dir R)
- =>WM: (14172: O2012 ^name predict-no)
- =>WM: (14171: O2011 ^name predict-yes)
- =>WM: (14170: R1009 ^value 1)
- =>WM: (14169: R1 ^reward R1009)
- =>WM: (14168: I3 ^see 1)
- <=WM: (14159: S1 ^operator O2009 +)
- <=WM: (14161: S1 ^operator O2009)
- <=WM: (14160: S1 ^operator O2010 +)
- <=WM: (14158: I3 ^dir L)
- <=WM: (14154: R1 ^reward R1008)
- <=WM: (14114: I3 ^see 0)
- <=WM: (14157: O2010 ^name predict-no)
- <=WM: (14156: O2009 ^name predict-yes)
- <=WM: (14155: R1008 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2011 = 0.7368282658793132)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O2011 = 0.2631690211593038)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2012 = 0.2572459278910315)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O2012 = -0.1377248055371832)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2010 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O2010 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2009 = 0.7368282658793132)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O2009 = 0.2631690211593038)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*5 0.683777 -0.251886 0.43189 -> 0.683777 -0.251886 0.431891(R,m,v=1,0.923529,0.0710407)
- RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*45 0.31622 0.251886 0.568106 -> 0.316221 0.251886 0.568107(R,m,v=1,1,0)
- =>WM: (14176: S1 ^operator O2011)
- 1006: O: O2011 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N1006 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N1005 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14177: I3 ^predict-yes N1006)
- <=WM: (14163: N1005 ^status complete)
- <=WM: (14162: I3 ^predict-yes N1005)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-yes for direction R in state State-A
- In State-A moving R
- ENV: (next state, see, prediction correct?) = (State-B, 1, True)
- predict error 0
- dir: dir isR
- --- END Output Phase ---
- |\--- Input Phase ---
- =>WM: (14181: I2 ^dir R)
- =>WM: (14180: I2 ^reward 1)
- =>WM: (14179: I2 ^see 1)
- =>WM: (14178: N1006 ^status complete)
- <=WM: (14166: I2 ^dir R)
- <=WM: (14165: I2 ^reward 1)
- <=WM: (14164: I2 ^see 1)
- =>WM: (14182: I2 ^level-1 R1-root)
- <=WM: (14167: I2 ^level-1 L1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O2011 = -0.3011268063455669)
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O2012 = 0.7427525112697247)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1010 ^value 1 +)
- (R1 ^reward R1010 +)
- Firing propose*predict-yes
- -->
- (O2013 ^name predict-yes +)
- (S1 ^operator O2013 +)
- Firing propose*predict-no
- -->
- (O2014 ^name predict-no +)
- (S1 ^operator O2014 +)
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2012 = 0.2572459278910315)
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2011 = 0.7368282658793132)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O2012 ^name predict-no +)
- (S1 ^operator O2012 +)
- Retracting propose*predict-yes
- -->
- (O2011 ^name predict-yes +)
- (S1 ^operator O2011 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1009 ^value 1 +)
- (R1 ^reward R1009 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
- -->
- (S1 ^operator O2012 = -0.1377248055371832)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2012 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
- -->
- (S1 ^operator O2011 = 0.2631690211593038)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2011 = 0.7368282658793132)
- =>WM: (14188: S1 ^operator O2014 +)
- =>WM: (14187: S1 ^operator O2013 +)
- =>WM: (14186: O2014 ^name predict-no)
- =>WM: (14185: O2013 ^name predict-yes)
- =>WM: (14184: R1010 ^value 1)
- =>WM: (14183: R1 ^reward R1010)
- <=WM: (14174: S1 ^operator O2011 +)
- <=WM: (14176: S1 ^operator O2011)
- <=WM: (14175: S1 ^operator O2012 +)
- <=WM: (14169: R1 ^reward R1009)
- <=WM: (14172: O2012 ^name predict-no)
- <=WM: (14171: O2011 ^name predict-yes)
- <=WM: (14170: R1009 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2013 = 0.7368282658793132)
- Firing prefer*rvt*predict-yes*H0*3*v1*H1
- -->
- Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O2013 = -0.3011268063455669)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2014 = 0.2572459278910315)
- Firing prefer*rvt*predict-no*H0*4*v1*H1
- -->
- Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O2014 = 0.7427525112697247)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2012 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O2012 = 0.7427525112697247)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2011 = 0.7368282658793132)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O2011 = -0.3011268063455669)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-yes*H0*3 0.748236 -0.0114079 0.736828 -> 0.748236 -0.0114076 0.736829(R,m,v=1,0.89759,0.092479)
- RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*40 0.251763 0.0114059 0.263169 -> 0.251763 0.0114062 0.263169(R,m,v=1,1,0)
- =>WM: (14189: S1 ^operator O2014)
- 1007: O: O2014 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N1007 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-yes N1006 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14190: I3 ^predict-no N1007)
- <=WM: (14178: N1006 ^status complete)
- <=WM: (14177: I3 ^predict-yes N1006)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction R in state State-B
- In State-B moving R
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isU
- --- END Output Phase ---
- -/|--- Input Phase ---
- =>WM: (14194: I2 ^dir U)
- =>WM: (14193: I2 ^reward 1)
- =>WM: (14192: I2 ^see 0)
- =>WM: (14191: N1007 ^status complete)
- <=WM: (14181: I2 ^dir R)
- <=WM: (14180: I2 ^reward 1)
- <=WM: (14179: I2 ^see 1)
- =>WM: (14195: I2 ^level-1 R0-root)
- <=WM: (14182: I2 ^level-1 R1-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1011 ^value 1 +)
- (R1 ^reward R1011 +)
- Firing propose*predict-yes
- -->
- (O2015 ^name predict-yes +)
- (S1 ^operator O2015 +)
- Firing propose*predict-no
- -->
- (O2016 ^name predict-no +)
- (S1 ^operator O2016 +)
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2014 = 0.9999999999999999)
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2013 = 0.)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 1 +)
- Retracting propose*predict-no
- -->
- (O2014 ^name predict-no +)
- (S1 ^operator O2014 +)
- Retracting propose*predict-yes
- -->
- (O2013 ^name predict-yes +)
- (S1 ^operator O2013 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1010 ^value 1 +)
- (R1 ^reward R1010 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir R +)
- Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*36
- -->
- (S1 ^operator O2014 = 0.7427525112697247)
- Retracting rl*prefer*rvt*predict-no*H0*4
- -->
- (S1 ^operator O2014 = 0.2572459278910315)
- Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*37
- -->
- (S1 ^operator O2013 = -0.3011268063455669)
- Retracting rl*prefer*rvt*predict-yes*H0*3
- -->
- (S1 ^operator O2013 = 0.7368286728235206)
- =>WM: (14203: S1 ^operator O2016 +)
- =>WM: (14202: S1 ^operator O2015 +)
- =>WM: (14201: I3 ^dir U)
- =>WM: (14200: O2016 ^name predict-no)
- =>WM: (14199: O2015 ^name predict-yes)
- =>WM: (14198: R1011 ^value 1)
- =>WM: (14197: R1 ^reward R1011)
- =>WM: (14196: I3 ^see 0)
- <=WM: (14187: S1 ^operator O2013 +)
- <=WM: (14188: S1 ^operator O2014 +)
- <=WM: (14189: S1 ^operator O2014)
- <=WM: (14173: I3 ^dir R)
- <=WM: (14183: R1 ^reward R1010)
- <=WM: (14168: I3 ^see 1)
- <=WM: (14186: O2014 ^name predict-no)
- <=WM: (14185: O2013 ^name predict-yes)
- <=WM: (14184: R1010 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2015 = 0.)
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2016 = 0.9999999999999999)
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2014 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2013 = 0.)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*4 0.586136 -0.32889 0.257246 -> 0.586136 -0.32889 0.257246(R,m,v=1,0.860465,0.120767)
- RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*36 0.413863 0.32889 0.742753 -> 0.413863 0.32889 0.742753(R,m,v=1,1,0)
- =>WM: (14204: S1 ^operator O2016)
- 1008: O: O2016 (predict-no)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-no N1008 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N1007 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Memory (PE) ---
- =>WM: (14205: I3 ^predict-no N1008)
- <=WM: (14191: N1007 ^status complete)
- <=WM: (14190: I3 ^predict-no N1007)
- --- Firing Productions (IE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing monitor*world
- -->
- I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
- --- Change Working Memory (IE) ---
- --- END Application Phase ---
- --- Output Phase ---
- ENV: Agent did: predict-no for direction U in state State-B
- In State-B moving U
- ENV: (next state, see, prediction correct?) = (State-B, 0, True)
- predict error 0
- dir: dir isL
- --- END Output Phase ---
- \---- Input Phase ---
- =>WM: (14209: I2 ^dir L)
- =>WM: (14208: I2 ^reward 1)
- =>WM: (14207: I2 ^see 0)
- =>WM: (14206: N1008 ^status complete)
- <=WM: (14194: I2 ^dir U)
- <=WM: (14193: I2 ^reward 1)
- <=WM: (14192: I2 ^see 0)
- =>WM: (14210: I2 ^level-1 R0-root)
- <=WM: (14195: I2 ^level-1 R0-root)
- --- END Input Phase ---
- --- Proposal Phase ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O2016 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O2015 = 0.5681113503720048)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Firing elaborate*reward*based*on*reward
- -->
- (R1012 ^value 1 +)
- (R1 ^reward R1012 +)
- Firing propose*predict-yes
- -->
- (O2017 ^name predict-yes +)
- (S1 ^operator O2017 +)
- Firing propose*predict-no
- -->
- (O2018 ^name predict-no +)
- (S1 ^operator O2018 +)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2016 = 0.3289460753274439)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2015 = 0.4318909395679179)
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir L +)
- inner elaboration loop at bottom goal.
- Retracting elaborate*copy-see-to-output-link
- -->
- (I3 ^see 0 +)
- Retracting propose*predict-no
- -->
- (O2016 ^name predict-no +)
- (S1 ^operator O2016 +)
- Retracting propose*predict-yes
- -->
- (O2015 ^name predict-yes +)
- (S1 ^operator O2015 +)
- Retracting elaborate*reward*based*on*reward
- -->
- (R1011 ^value 1 +)
- (R1 ^reward R1011 +)
- Retracting elaborate*copy-dir-to-output-link
- -->
- (I3 ^dir U +)
- Retracting rl*prefer*rvt*predict-no*H0*2
- -->
- (S1 ^operator O2016 = 0.9999999999999999)
- Retracting rl*prefer*rvt*predict-yes*H0*1
- -->
- (S1 ^operator O2015 = 0.)
- =>WM: (14217: S1 ^operator O2018 +)
- =>WM: (14216: S1 ^operator O2017 +)
- =>WM: (14215: I3 ^dir L)
- =>WM: (14214: O2018 ^name predict-no)
- =>WM: (14213: O2017 ^name predict-yes)
- =>WM: (14212: R1012 ^value 1)
- =>WM: (14211: R1 ^reward R1012)
- <=WM: (14202: S1 ^operator O2015 +)
- <=WM: (14203: S1 ^operator O2016 +)
- <=WM: (14204: S1 ^operator O2016)
- <=WM: (14201: I3 ^dir U)
- <=WM: (14197: R1 ^reward R1011)
- <=WM: (14200: O2016 ^name predict-no)
- <=WM: (14199: O2015 ^name predict-yes)
- <=WM: (14198: R1011 ^value 1)
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing prefer*rvt*predict-yes*H0
- -->
- Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O2017 = 0.5681113503720048)
- Firing rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2017 = 0.4318909395679179)
- Firing prefer*rvt*predict-yes*H0*5*v1*H1
- -->
- Firing prefer*rvt*predict-no*H0
- -->
- Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O2018 = 0.04178081990804111)
- Firing rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2018 = 0.3289460753274439)
- Firing prefer*rvt*predict-no*H0*6*v1*H1
- -->
- inner elaboration loop at bottom goal.
- Retracting rl*prefer*rvt*predict-no*H0*6
- -->
- (S1 ^operator O2016 = 0.3289460753274439)
- Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*38
- -->
- (S1 ^operator O2016 = 0.04178081990804111)
- Retracting rl*prefer*rvt*predict-yes*H0*5
- -->
- (S1 ^operator O2015 = 0.4318909395679179)
- Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
- -->
- (S1 ^operator O2015 = 0.5681113503720048)
- --- END Proposal Phase ---
- --- Decision Phase ---
- RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
- =>WM: (14218: S1 ^operator O2017)
- 1009: O: O2017 (predict-yes)
- --- END Decision Phase ---
- --- Application Phase ---
- --- Firing Productions (PE) For State At Depth 1 ---
- --- Inner Elaboration Phase, active level 1 (S1) ---
- Firing apply*operator
- -->
- (I3 ^predict-yes N1009 + :O )
- Firing apply*operator*complete
- -->
- (I3 ^predict-no N1008 - :O )
- inner elaboration loop at bottom goal.
- --- Change Working Me