PageRenderTime 143ms CodeModel.GetById 22ms RepoModel.GetById 0ms app.codeStats 1ms

/flipv2/20121112-101138-2.5K-ReLST-Evan/stdout-flip-2.5K_2.txt

https://bitbucket.org/evan13579b/soar-ziggurat
Plain Text | 16463 lines | 15731 code | 732 blank | 0 comment | 0 complexity | 89a52a67c906c4fc3a1a3d7f661a0621 MD5 | raw file
Possible License(s): BSD-3-Clause
  1. Seeding... 2
  2. dir: dir isU
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 2 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_2.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\-sleeping...
  20. /|\-/|\sleeping...
  21. -1: O: O1 (predict-yes)
  22. I see 0 and I'm going to do: predict-yes
  23. ENV: Agent did: predict-yes for direction U in state State-A
  24. In State-A moving U
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  26. predict error 1
  27. dir: dir isU
  28. rule alias: '*'
  29. rule alias: '*'
  30. /|\-/|\-2: O: O4 (predict-no)
  31. I see 0 and I'm going to do: predict-no
  32. ENV: Agent did: predict-no for direction U in state State-A
  33. In State-A moving U
  34. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  35. predict error 0
  36. dir: dir isL
  37. /|\-3: O: O5 (predict-yes)
  38. I see 1 and I'm going to do: predict-yes
  39. ENV: Agent did: predict-yes for direction L in state State-A
  40. In State-A moving L
  41. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  42. predict error 1
  43. dir: dir isL
  44. /|\4: O: O7 (predict-yes)
  45. I see 0 and I'm going to do: predict-yes
  46. ENV: Agent did: predict-yes for direction L in state State-A
  47. In State-A moving L
  48. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  49. predict error 1
  50. dir: dir isU
  51. -5: O: O10 (predict-no)
  52. I see 0 and I'm going to do: predict-no
  53. ENV: Agent did: predict-no for direction U in state State-A
  54. In State-A moving U
  55. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  56. predict error 0
  57. dir: dir isU
  58. /|\6: O: O12 (predict-no)
  59. I see 1 and I'm going to do: predict-no
  60. ENV: Agent did: predict-no for direction U in state State-A
  61. In State-A moving U
  62. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  63. predict error 0
  64. dir: dir isU
  65. -/|7: O: O14 (predict-no)
  66. I see 1 and I'm going to do: predict-no
  67. ENV: Agent did: predict-no for direction U in state State-A
  68. In State-A moving U
  69. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  70. predict error 0
  71. dir: dir isL
  72. \8: O: O15 (predict-yes)
  73. I see 1 and I'm going to do: predict-yes
  74. ENV: Agent did: predict-yes for direction L in state State-A
  75. In State-A moving L
  76. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  77. predict error 1
  78. dir: dir isR
  79. -/|9: O: O17 (predict-yes)
  80. I see 0 and I'm going to do: predict-yes
  81. ENV: Agent did: predict-yes for direction R in state State-A
  82. In State-A moving R
  83. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  84. predict error 0
  85. dir: dir isR
  86. \-/10: O: O19 (predict-yes)
  87. I see 1 and I'm going to do: predict-yes
  88. ENV: Agent did: predict-yes for direction R in state State-B
  89. In State-B moving R
  90. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  91. predict error 1
  92. dir: dir isR
  93. |\-11: O: O21 (predict-yes)
  94. I see 0 and I'm going to do: predict-yes
  95. ENV: Agent did: predict-yes for direction R in state State-B
  96. In State-B moving R
  97. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  98. predict error 1
  99. dir: dir isL
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. /12: O: O24 (predict-no)
  105. I see 0 and I'm going to do: predict-no
  106. ENV: Agent did: predict-no for direction L in state State-B
  107. In State-B moving L
  108. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  109. predict error 1
  110. dir: dir isR
  111. |\-13: O: O25 (predict-yes)
  112. I see 0 and I'm going to do: predict-yes
  113. ENV: Agent did: predict-yes for direction R in state State-A
  114. In State-A moving R
  115. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  116. predict error 0
  117. dir: dir isR
  118. /|\14: O: O27 (predict-yes)
  119. I see 1 and I'm going to do: predict-yes
  120. ENV: Agent did: predict-yes for direction R in state State-B
  121. In State-B moving R
  122. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  123. predict error 1
  124. dir: dir isU
  125. -/15: O: O30 (predict-no)
  126. I see 0 and I'm going to do: predict-no
  127. ENV: Agent did: predict-no for direction U in state State-B
  128. In State-B moving U
  129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  130. predict error 0
  131. dir: dir isU
  132. |\16: O: O32 (predict-no)
  133. I see 1 and I'm going to do: predict-no
  134. ENV: Agent did: predict-no for direction U in state State-B
  135. In State-B moving U
  136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  137. predict error 0
  138. dir: dir isU
  139. -/|17: O: O34 (predict-no)
  140. I see 1 and I'm going to do: predict-no
  141. ENV: Agent did: predict-no for direction U in state State-B
  142. In State-B moving U
  143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  144. predict error 0
  145. dir: dir isR
  146. \18: O: O35 (predict-yes)
  147. I see 1 and I'm going to do: predict-yes
  148. ENV: Agent did: predict-yes for direction R in state State-B
  149. In State-B moving R
  150. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  151. predict error 1
  152. dir: dir isR
  153. -/|19: O: O37 (predict-yes)
  154. I see 0 and I'm going to do: predict-yes
  155. ENV: Agent did: predict-yes for direction R in state State-B
  156. In State-B moving R
  157. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  158. predict error 1
  159. dir: dir isL
  160. \-/20: O: O39 (predict-yes)
  161. I see 0 and I'm going to do: predict-yes
  162. ENV: Agent did: predict-yes for direction L in state State-B
  163. In State-B moving L
  164. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  165. predict error 0
  166. dir: dir isL
  167. |\-21: O: O42 (predict-no)
  168. I see 1 and I'm going to do: predict-no
  169. ENV: Agent did: predict-no for direction L in state State-A
  170. In State-A moving L
  171. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  172. predict error 0
  173. dir: dir isL
  174. /22: O: O43 (predict-yes)
  175. I see 1 and I'm going to do: predict-yes
  176. ENV: Agent did: predict-yes for direction L in state State-A
  177. In State-A moving L
  178. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  179. predict error 1
  180. dir: dir isR
  181. |\-23: O: O45 (predict-yes)
  182. I see 0 and I'm going to do: predict-yes
  183. ENV: Agent did: predict-yes for direction R in state State-A
  184. In State-A moving R
  185. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  186. predict error 0
  187. dir: dir isL
  188. /|\24: O: O48 (predict-no)
  189. I see 1 and I'm going to do: predict-no
  190. ENV: Agent did: predict-no for direction L in state State-B
  191. In State-B moving L
  192. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  193. predict error 1
  194. dir: dir isR
  195. -/25: O: O49 (predict-yes)
  196. I see 0 and I'm going to do: predict-yes
  197. ENV: Agent did: predict-yes for direction R in state State-A
  198. In State-A moving R
  199. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  200. predict error 0
  201. dir: dir isU
  202. |\-26: O: O52 (predict-no)
  203. I see 1 and I'm going to do: predict-no
  204. ENV: Agent did: predict-no for direction U in state State-B
  205. In State-B moving U
  206. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  207. predict error 0
  208. dir: dir isR
  209. /|27: O: O53 (predict-yes)
  210. I see 1 and I'm going to do: predict-yes
  211. ENV: Agent did: predict-yes for direction R in state State-B
  212. In State-B moving R
  213. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  214. predict error 1
  215. dir: dir isR
  216. \-/28: O: O55 (predict-yes)
  217. I see 0 and I'm going to do: predict-yes
  218. ENV: Agent did: predict-yes for direction R in state State-B
  219. In State-B moving R
  220. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  221. predict error 1
  222. dir: dir isL
  223. |\-29: O: O58 (predict-no)
  224. I see 0 and I'm going to do: predict-no
  225. ENV: Agent did: predict-no for direction L in state State-B
  226. In State-B moving L
  227. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  228. predict error 1
  229. dir: dir isL
  230. /30: O: O60 (predict-no)
  231. I see 0 and I'm going to do: predict-no
  232. ENV: Agent did: predict-no for direction L in state State-A
  233. In State-A moving L
  234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  235. predict error 0
  236. dir: dir isL
  237. |\31: O: O61 (predict-yes)
  238. I see 1 and I'm going to do: predict-yes
  239. ENV: Agent did: predict-yes for direction L in state State-A
  240. In State-A moving L
  241. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  242. predict error 1
  243. dir: dir isL
  244. -32: O: O64 (predict-no)
  245. I see 0 and I'm going to do: predict-no
  246. ENV: Agent did: predict-no for direction L in state State-A
  247. In State-A moving L
  248. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  249. predict error 0
  250. dir: dir isR
  251. /|\33: O: O65 (predict-yes)
  252. I see 1 and I'm going to do: predict-yes
  253. ENV: Agent did: predict-yes for direction R in state State-A
  254. In State-A moving R
  255. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  256. predict error 0
  257. dir: dir isU
  258. -34: O: O68 (predict-no)
  259. I see 1 and I'm going to do: predict-no
  260. ENV: Agent did: predict-no for direction U in state State-B
  261. In State-B moving U
  262. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  263. predict error 0
  264. dir: dir isU
  265. /|\35: O: O70 (predict-no)
  266. I see 1 and I'm going to do: predict-no
  267. ENV: Agent did: predict-no for direction U in state State-B
  268. In State-B moving U
  269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  270. predict error 0
  271. dir: dir isL
  272. -36: O: O72 (predict-no)
  273. I see 1 and I'm going to do: predict-no
  274. ENV: Agent did: predict-no for direction L in state State-B
  275. In State-B moving L
  276. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  277. predict error 1
  278. dir: dir isU
  279. /|\37: O: O74 (predict-no)
  280. I see 0 and I'm going to do: predict-no
  281. ENV: Agent did: predict-no for direction U in state State-A
  282. In State-A moving U
  283. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  284. predict error 0
  285. dir: dir isU
  286. -/|38: O: O76 (predict-no)
  287. I see 1 and I'm going to do: predict-no
  288. ENV: Agent did: predict-no for direction U in state State-A
  289. In State-A moving U
  290. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  291. predict error 0
  292. dir: dir isU
  293. \-/39: O: O77 (predict-yes)
  294. I see 1 and I'm going to do: predict-yes
  295. ENV: Agent did: predict-yes for direction U in state State-A
  296. In State-A moving U
  297. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  298. predict error 1
  299. dir: dir isU
  300. |\-40: O: O79 (predict-yes)
  301. I see 0 and I'm going to do: predict-yes
  302. ENV: Agent did: predict-yes for direction U in state State-A
  303. In State-A moving U
  304. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  305. predict error 1
  306. dir: dir isU
  307. /|\41: O: O82 (predict-no)
  308. I see 0 and I'm going to do: predict-no
  309. ENV: Agent did: predict-no for direction U in state State-A
  310. In State-A moving U
  311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  312. predict error 0
  313. dir: dir isU
  314. -42: O: O84 (predict-no)
  315. I see 1 and I'm going to do: predict-no
  316. ENV: Agent did: predict-no for direction U in state State-A
  317. In State-A moving U
  318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  319. predict error 0
  320. dir: dir isR
  321. /|\43: O: O85 (predict-yes)
  322. I see 1 and I'm going to do: predict-yes
  323. ENV: Agent did: predict-yes for direction R in state State-A
  324. In State-A moving R
  325. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  326. predict error 0
  327. dir: dir isU
  328. -/|44: O: O87 (predict-yes)
  329. I see 1 and I'm going to do: predict-yes
  330. ENV: Agent did: predict-yes for direction U in state State-B
  331. In State-B moving U
  332. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  333. predict error 1
  334. dir: dir isU
  335. \-/45: O: O90 (predict-no)
  336. I see 0 and I'm going to do: predict-no
  337. ENV: Agent did: predict-no for direction U in state State-B
  338. In State-B moving U
  339. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  340. predict error 0
  341. dir: dir isL
  342. |\46: O: O92 (predict-no)
  343. I see 1 and I'm going to do: predict-no
  344. ENV: Agent did: predict-no for direction L in state State-B
  345. In State-B moving L
  346. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  347. predict error 1
  348. dir: dir isU
  349. -/|47: O: O94 (predict-no)
  350. I see 0 and I'm going to do: predict-no
  351. ENV: Agent did: predict-no for direction U in state State-A
  352. In State-A moving U
  353. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  354. predict error 0
  355. dir: dir isU
  356. \-48: O: O96 (predict-no)
  357. I see 1 and I'm going to do: predict-no
  358. ENV: Agent did: predict-no for direction U in state State-A
  359. In State-A moving U
  360. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  361. predict error 0
  362. dir: dir isR
  363. /|49: O: O97 (predict-yes)
  364. I see 1 and I'm going to do: predict-yes
  365. ENV: Agent did: predict-yes for direction R in state State-A
  366. In State-A moving R
  367. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  368. predict error 0
  369. dir: dir isR
  370. \-/50: O: O99 (predict-yes)
  371. I see 1 and I'm going to do: predict-yes
  372. ENV: Agent did: predict-yes for direction R in state State-B
  373. In State-B moving R
  374. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  375. predict error 1
  376. dir: dir isR
  377. |\-/|\-sleeping...
  378. /51: O: O101 (predict-yes)
  379. I see 0 and I'm going to do: predict-yes
  380. ENV: Agent did: predict-yes for direction R in state State-B
  381. In State-B moving R
  382. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  383. predict error 1
  384. dir: dir isU
  385. rule alias: '*'
  386. |52: O: O104 (predict-no)
  387. I see 0 and I'm going to do: predict-no
  388. ENV: Agent did: predict-no for direction U in state State-B
  389. In State-B moving U
  390. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  391. predict error 0
  392. dir: dir isR
  393. \53: O: O105 (predict-yes)
  394. I see 1 and I'm going to do: predict-yes
  395. ENV: Agent did: predict-yes for direction R in state State-B
  396. In State-B moving R
  397. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  398. predict error 1
  399. dir: dir isU
  400. -54: O: O108 (predict-no)
  401. I see 0 and I'm going to do: predict-no
  402. ENV: Agent did: predict-no for direction U in state State-B
  403. In State-B moving U
  404. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  405. predict error 0
  406. dir: dir isR
  407. /|\55: O: O109 (predict-yes)
  408. I see 1 and I'm going to do: predict-yes
  409. ENV: Agent did: predict-yes for direction R in state State-B
  410. In State-B moving R
  411. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  412. predict error 1
  413. dir: dir isU
  414. -/|\sleeping...
  415. -56: O: O111 (predict-yes)
  416. I see 0 and I'm going to do: predict-yes
  417. ENV: Agent did: predict-yes for direction U in state State-B
  418. In State-B moving U
  419. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  420. predict error 1
  421. dir: dir isU
  422. /|57: O: O114 (predict-no)
  423. I see 0 and I'm going to do: predict-no
  424. ENV: Agent did: predict-no for direction U in state State-B
  425. In State-B moving U
  426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  427. predict error 0
  428. dir: dir isR
  429. \58: O: O115 (predict-yes)
  430. I see 1 and I'm going to do: predict-yes
  431. ENV: Agent did: predict-yes for direction R in state State-B
  432. In State-B moving R
  433. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  434. predict error 1
  435. dir: dir isR
  436. -/|59: O: O117 (predict-yes)
  437. I see 0 and I'm going to do: predict-yes
  438. ENV: Agent did: predict-yes for direction R in state State-B
  439. In State-B moving R
  440. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  441. predict error 1
  442. dir: dir isU
  443. \-/60: O: O120 (predict-no)
  444. I see 0 and I'm going to do: predict-no
  445. ENV: Agent did: predict-no for direction U in state State-B
  446. In State-B moving U
  447. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  448. predict error 0
  449. dir: dir isU
  450. |\-61: O: O122 (predict-no)
  451. I see 1 and I'm going to do: predict-no
  452. ENV: Agent did: predict-no for direction U in state State-B
  453. In State-B moving U
  454. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  455. predict error 0
  456. dir: dir isR
  457. /62: O: O123 (predict-yes)
  458. I see 1 and I'm going to do: predict-yes
  459. ENV: Agent did: predict-yes for direction R in state State-B
  460. In State-B moving R
  461. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  462. predict error 1
  463. dir: dir isL
  464. |\63: O: O126 (predict-no)
  465. I see 0 and I'm going to do: predict-no
  466. ENV: Agent did: predict-no for direction L in state State-B
  467. In State-B moving L
  468. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  469. predict error 1
  470. dir: dir isL
  471. -/64: O: O128 (predict-no)
  472. I see 0 and I'm going to do: predict-no
  473. ENV: Agent did: predict-no for direction L in state State-A
  474. In State-A moving L
  475. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  476. predict error 0
  477. dir: dir isU
  478. |\-65: O: O130 (predict-no)
  479. I see 1 and I'm going to do: predict-no
  480. ENV: Agent did: predict-no for direction U in state State-A
  481. In State-A moving U
  482. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  483. predict error 0
  484. dir: dir isL
  485. /66: O: O132 (predict-no)
  486. I see 1 and I'm going to do: predict-no
  487. ENV: Agent did: predict-no for direction L in state State-A
  488. In State-A moving L
  489. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  490. predict error 0
  491. dir: dir isU
  492. |67: O: O134 (predict-no)
  493. I see 1 and I'm going to do: predict-no
  494. ENV: Agent did: predict-no for direction U in state State-A
  495. In State-A moving U
  496. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  497. predict error 0
  498. dir: dir isL
  499. \68: O: O136 (predict-no)
  500. I see 1 and I'm going to do: predict-no
  501. ENV: Agent did: predict-no for direction L in state State-A
  502. In State-A moving L
  503. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  504. predict error 0
  505. dir: dir isU
  506. -/69: O: O138 (predict-no)
  507. I see 1 and I'm going to do: predict-no
  508. ENV: Agent did: predict-no for direction U in state State-A
  509. In State-A moving U
  510. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  511. predict error 0
  512. dir: dir isL
  513. |70: O: O140 (predict-no)
  514. I see 1 and I'm going to do: predict-no
  515. ENV: Agent did: predict-no for direction L in state State-A
  516. In State-A moving L
  517. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  518. predict error 0
  519. dir: dir isU
  520. \-71: O: O142 (predict-no)
  521. I see 1 and I'm going to do: predict-no
  522. ENV: Agent did: predict-no for direction U in state State-A
  523. In State-A moving U
  524. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  525. predict error 0
  526. dir: dir isU
  527. rule alias: '*'
  528. rule alias: '*'
  529. rule alias: '*'
  530. rule alias: '*'
  531. rule alias: '*'
  532. rule alias: '*'
  533. /72: O: O144 (predict-no)
  534. I see 1 and I'm going to do: predict-no
  535. ENV: Agent did: predict-no for direction U in state State-A
  536. In State-A moving U
  537. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  538. predict error 0
  539. dir: dir isR
  540. |\-73: O: O145 (predict-yes)
  541. I see 1 and I'm going to do: predict-yes
  542. ENV: Agent did: predict-yes for direction R in state State-A
  543. In State-A moving R
  544. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  545. predict error 0
  546. dir: dir isR
  547. /|74: O: O147 (predict-yes)
  548. I see 1 and I'm going to do: predict-yes
  549. ENV: Agent did: predict-yes for direction R in state State-B
  550. In State-B moving R
  551. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  552. predict error 1
  553. dir: dir isR
  554. \-75: O: O149 (predict-yes)
  555. I see 0 and I'm going to do: predict-yes
  556. ENV: Agent did: predict-yes for direction R in state State-B
  557. In State-B moving R
  558. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  559. predict error 1
  560. dir: dir isR
  561. /|\76: O: O152 (predict-no)
  562. I see 0 and I'm going to do: predict-no
  563. ENV: Agent did: predict-no for direction R in state State-B
  564. In State-B moving R
  565. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  566. predict error 0
  567. dir: dir isL
  568. -/|77: O: O154 (predict-no)
  569. I see 1 and I'm going to do: predict-no
  570. ENV: Agent did: predict-no for direction L in state State-B
  571. In State-B moving L
  572. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  573. predict error 1
  574. dir: dir isL
  575. \-/78: O: O156 (predict-no)
  576. I see 0 and I'm going to do: predict-no
  577. ENV: Agent did: predict-no for direction L in state State-A
  578. In State-A moving L
  579. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  580. predict error 0
  581. dir: dir isR
  582. |79: O: O158 (predict-no)
  583. I see 1 and I'm going to do: predict-no
  584. ENV: Agent did: predict-no for direction R in state State-A
  585. In State-A moving R
  586. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  587. predict error 1
  588. dir: dir isU
  589. \-80: O: O160 (predict-no)
  590. I see 0 and I'm going to do: predict-no
  591. ENV: Agent did: predict-no for direction U in state State-B
  592. In State-B moving U
  593. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  594. predict error 0
  595. dir: dir isR
  596. /|\81: O: O162 (predict-no)
  597. I see 1 and I'm going to do: predict-no
  598. ENV: Agent did: predict-no for direction R in state State-B
  599. In State-B moving R
  600. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  601. predict error 0
  602. dir: dir isL
  603. rule alias: '*'
  604. rule alias: '*'
  605. -82: O: O163 (predict-yes)
  606. I see 1 and I'm going to do: predict-yes
  607. ENV: Agent did: predict-yes for direction L in state State-B
  608. In State-B moving L
  609. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  610. predict error 0
  611. dir: dir isU
  612. /|\83: O: O166 (predict-no)
  613. I see 1 and I'm going to do: predict-no
  614. ENV: Agent did: predict-no for direction U in state State-A
  615. In State-A moving U
  616. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  617. predict error 0
  618. dir: dir isU
  619. -/|84: O: O168 (predict-no)
  620. I see 1 and I'm going to do: predict-no
  621. ENV: Agent did: predict-no for direction U in state State-A
  622. In State-A moving U
  623. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  624. predict error 0
  625. dir: dir isU
  626. \-85: O: O169 (predict-yes)
  627. I see 1 and I'm going to do: predict-yes
  628. ENV: Agent did: predict-yes for direction U in state State-A
  629. In State-A moving U
  630. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  631. predict error 1
  632. dir: dir isL
  633. /|\86: O: O172 (predict-no)
  634. I see 0 and I'm going to do: predict-no
  635. ENV: Agent did: predict-no for direction L in state State-A
  636. In State-A moving L
  637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  638. predict error 0
  639. dir: dir isU
  640. -/|87: O: O173 (predict-yes)
  641. I see 1 and I'm going to do: predict-yes
  642. ENV: Agent did: predict-yes for direction U in state State-A
  643. In State-A moving U
  644. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  645. predict error 1
  646. dir: dir isL
  647. \-/88: O: O176 (predict-no)
  648. I see 0 and I'm going to do: predict-no
  649. ENV: Agent did: predict-no for direction L in state State-A
  650. In State-A moving L
  651. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  652. predict error 0
  653. dir: dir isR
  654. |\89: O: O177 (predict-yes)
  655. I see 1 and I'm going to do: predict-yes
  656. ENV: Agent did: predict-yes for direction R in state State-A
  657. In State-A moving R
  658. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  659. predict error 0
  660. dir: dir isL
  661. -/90: O: O180 (predict-no)
  662. I see 1 and I'm going to do: predict-no
  663. ENV: Agent did: predict-no for direction L in state State-B
  664. In State-B moving L
  665. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  666. predict error 1
  667. dir: dir isL
  668. |\-91: O: O182 (predict-no)
  669. I see 0 and I'm going to do: predict-no
  670. ENV: Agent did: predict-no for direction L in state State-A
  671. In State-A moving L
  672. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  673. predict error 0
  674. dir: dir isU
  675. rule alias: '*'
  676. rule alias: '*'
  677. /92: O: O184 (predict-no)
  678. I see 1 and I'm going to do: predict-no
  679. ENV: Agent did: predict-no for direction U in state State-A
  680. In State-A moving U
  681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  682. predict error 0
  683. dir: dir isL
  684. |\93: O: O186 (predict-no)
  685. I see 1 and I'm going to do: predict-no
  686. ENV: Agent did: predict-no for direction L in state State-A
  687. In State-A moving L
  688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  689. predict error 0
  690. dir: dir isR
  691. -/|94: O: O187 (predict-yes)
  692. I see 1 and I'm going to do: predict-yes
  693. ENV: Agent did: predict-yes for direction R in state State-A
  694. In State-A moving R
  695. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  696. predict error 0
  697. dir: dir isU
  698. \-/95: O: O190 (predict-no)
  699. I see 1 and I'm going to do: predict-no
  700. ENV: Agent did: predict-no for direction U in state State-B
  701. In State-B moving U
  702. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  703. predict error 0
  704. dir: dir isL
  705. |\-96: O: O191 (predict-yes)
  706. I see 1 and I'm going to do: predict-yes
  707. ENV: Agent did: predict-yes for direction L in state State-B
  708. In State-B moving L
  709. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  710. predict error 0
  711. dir: dir isL
  712. /97: O: O194 (predict-no)
  713. I see 1 and I'm going to do: predict-no
  714. ENV: Agent did: predict-no for direction L in state State-A
  715. In State-A moving L
  716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  717. predict error 0
  718. dir: dir isU
  719. |\-98: O: O196 (predict-no)
  720. I see 1 and I'm going to do: predict-no
  721. ENV: Agent did: predict-no for direction U in state State-A
  722. In State-A moving U
  723. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  724. predict error 0
  725. dir: dir isR
  726. /|\99: O: O198 (predict-no)
  727. I see 1 and I'm going to do: predict-no
  728. ENV: Agent did: predict-no for direction R in state State-A
  729. In State-A moving R
  730. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  731. predict error 1
  732. dir: dir isU
  733. -100: O: O200 (predict-no)
  734. I see 0 and I'm going to do: predict-no
  735. ENV: Agent did: predict-no for direction U in state State-B
  736. In State-B moving U
  737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  738. predict error 0
  739. dir: dir isL
  740. /|\101: O: O201 (predict-yes)
  741. I see 1 and I'm going to do: predict-yes
  742. ENV: Agent did: predict-yes for direction L in state State-B
  743. In State-B moving L
  744. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  745. predict error 0
  746. dir: dir isR
  747. -/102: O: O203 (predict-yes)
  748. I see 1 and I'm going to do: predict-yes
  749. ENV: Agent did: predict-yes for direction R in state State-A
  750. In State-A moving R
  751. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  752. predict error 0
  753. dir: dir isL
  754. |\-103: O: O205 (predict-yes)
  755. I see 1 and I'm going to do: predict-yes
  756. ENV: Agent did: predict-yes for direction L in state State-B
  757. In State-B moving L
  758. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  759. predict error 0
  760. dir: dir isU
  761. /|104: O: O208 (predict-no)
  762. I see 1 and I'm going to do: predict-no
  763. ENV: Agent did: predict-no for direction U in state State-A
  764. In State-A moving U
  765. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  766. predict error 0
  767. dir: dir isL
  768. \-/105: O: O210 (predict-no)
  769. I see 1 and I'm going to do: predict-no
  770. ENV: Agent did: predict-no for direction L in state State-A
  771. In State-A moving L
  772. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  773. predict error 0
  774. dir: dir isU
  775. |\106: O: O212 (predict-no)
  776. I see 1 and I'm going to do: predict-no
  777. ENV: Agent did: predict-no for direction U in state State-A
  778. In State-A moving U
  779. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  780. predict error 0
  781. dir: dir isL
  782. -/|107: O: O214 (predict-no)
  783. I see 1 and I'm going to do: predict-no
  784. ENV: Agent did: predict-no for direction L in state State-A
  785. In State-A moving L
  786. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  787. predict error 0
  788. dir: dir isU
  789. \-/108: O: O216 (predict-no)
  790. I see 1 and I'm going to do: predict-no
  791. ENV: Agent did: predict-no for direction U in state State-A
  792. In State-A moving U
  793. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  794. predict error 0
  795. dir: dir isL
  796. |\-109: O: O218 (predict-no)
  797. I see 1 and I'm going to do: predict-no
  798. ENV: Agent did: predict-no for direction L in state State-A
  799. In State-A moving L
  800. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  801. predict error 0
  802. dir: dir isL
  803. /|110: O: O220 (predict-no)
  804. I see 1 and I'm going to do: predict-no
  805. ENV: Agent did: predict-no for direction L in state State-A
  806. In State-A moving L
  807. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  808. predict error 0
  809. dir: dir isU
  810. \-/111: O: O222 (predict-no)
  811. I see 1 and I'm going to do: predict-no
  812. ENV: Agent did: predict-no for direction U in state State-A
  813. In State-A moving U
  814. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  815. predict error 0
  816. dir: dir isL
  817. rule alias: '*'
  818. |112: O: O224 (predict-no)
  819. I see 1 and I'm going to do: predict-no
  820. ENV: Agent did: predict-no for direction L in state State-A
  821. In State-A moving L
  822. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  823. predict error 0
  824. dir: dir isL
  825. \113: O: O226 (predict-no)
  826. I see 1 and I'm going to do: predict-no
  827. ENV: Agent did: predict-no for direction L in state State-A
  828. In State-A moving L
  829. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  830. predict error 0
  831. dir: dir isU
  832. -/114: O: O228 (predict-no)
  833. I see 1 and I'm going to do: predict-no
  834. ENV: Agent did: predict-no for direction U in state State-A
  835. In State-A moving U
  836. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  837. predict error 0
  838. dir: dir isR
  839. |\-115: O: O229 (predict-yes)
  840. I see 1 and I'm going to do: predict-yes
  841. ENV: Agent did: predict-yes for direction R in state State-A
  842. In State-A moving R
  843. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  844. predict error 0
  845. dir: dir isL
  846. /|\-116: O: O231 (predict-yes)
  847. I see 1 and I'm going to do: predict-yes
  848. ENV: Agent did: predict-yes for direction L in state State-B
  849. In State-B moving L
  850. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  851. predict error 0
  852. dir: dir isU
  853. /|117: O: O234 (predict-no)
  854. I see 1 and I'm going to do: predict-no
  855. ENV: Agent did: predict-no for direction U in state State-A
  856. In State-A moving U
  857. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  858. predict error 0
  859. dir: dir isL
  860. \-/118: O: O236 (predict-no)
  861. I see 1 and I'm going to do: predict-no
  862. ENV: Agent did: predict-no for direction L in state State-A
  863. In State-A moving L
  864. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  865. predict error 0
  866. dir: dir isL
  867. |119: O: O238 (predict-no)
  868. I see 1 and I'm going to do: predict-no
  869. ENV: Agent did: predict-no for direction L in state State-A
  870. In State-A moving L
  871. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  872. predict error 0
  873. dir: dir isR
  874. \-/120: O: O239 (predict-yes)
  875. I see 1 and I'm going to do: predict-yes
  876. ENV: Agent did: predict-yes for direction R in state State-A
  877. In State-A moving R
  878. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  879. predict error 0
  880. dir: dir isR
  881. |\-121: O: O241 (predict-yes)
  882. I see 1 and I'm going to do: predict-yes
  883. ENV: Agent did: predict-yes for direction R in state State-B
  884. In State-B moving R
  885. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  886. predict error 1
  887. dir: dir isU
  888. /122: O: O244 (predict-no)
  889. I see 0 and I'm going to do: predict-no
  890. ENV: Agent did: predict-no for direction U in state State-B
  891. In State-B moving U
  892. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  893. predict error 0
  894. dir: dir isL
  895. |\-123: O: O245 (predict-yes)
  896. I see 1 and I'm going to do: predict-yes
  897. ENV: Agent did: predict-yes for direction L in state State-B
  898. In State-B moving L
  899. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  900. predict error 0
  901. dir: dir isR
  902. /|124: O: O248 (predict-no)
  903. I see 1 and I'm going to do: predict-no
  904. ENV: Agent did: predict-no for direction R in state State-A
  905. In State-A moving R
  906. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  907. predict error 1
  908. dir: dir isL
  909. \-/125: O: O249 (predict-yes)
  910. I see 0 and I'm going to do: predict-yes
  911. ENV: Agent did: predict-yes for direction L in state State-B
  912. In State-B moving L
  913. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  914. predict error 0
  915. dir: dir isR
  916. |126: O: O251 (predict-yes)
  917. I see 1 and I'm going to do: predict-yes
  918. ENV: Agent did: predict-yes for direction R in state State-A
  919. In State-A moving R
  920. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  921. predict error 0
  922. dir: dir isU
  923. \-/127: O: O254 (predict-no)
  924. I see 1 and I'm going to do: predict-no
  925. ENV: Agent did: predict-no for direction U in state State-B
  926. In State-B moving U
  927. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  928. predict error 0
  929. dir: dir isU
  930. |\128: O: O256 (predict-no)
  931. I see 1 and I'm going to do: predict-no
  932. ENV: Agent did: predict-no for direction U in state State-B
  933. In State-B moving U
  934. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  935. predict error 0
  936. dir: dir isU
  937. -/|129: O: O258 (predict-no)
  938. I see 1 and I'm going to do: predict-no
  939. ENV: Agent did: predict-no for direction U in state State-B
  940. In State-B moving U
  941. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  942. predict error 0
  943. dir: dir isU
  944. \130: O: O259 (predict-yes)
  945. I see 1 and I'm going to do: predict-yes
  946. ENV: Agent did: predict-yes for direction U in state State-B
  947. In State-B moving U
  948. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  949. predict error 1
  950. dir: dir isU
  951. -/131: O: O262 (predict-no)
  952. I see 0 and I'm going to do: predict-no
  953. ENV: Agent did: predict-no for direction U in state State-B
  954. In State-B moving U
  955. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  956. predict error 0
  957. dir: dir isU
  958. |132: O: O264 (predict-no)
  959. I see 1 and I'm going to do: predict-no
  960. ENV: Agent did: predict-no for direction U in state State-B
  961. In State-B moving U
  962. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  963. predict error 0
  964. dir: dir isR
  965. \-/133: O: O265 (predict-yes)
  966. I see 1 and I'm going to do: predict-yes
  967. ENV: Agent did: predict-yes for direction R in state State-B
  968. In State-B moving R
  969. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  970. predict error 1
  971. dir: dir isL
  972. |\-134: O: O267 (predict-yes)
  973. I see 0 and I'm going to do: predict-yes
  974. ENV: Agent did: predict-yes for direction L in state State-B
  975. In State-B moving L
  976. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  977. predict error 0
  978. dir: dir isR
  979. /135: O: O269 (predict-yes)
  980. I see 1 and I'm going to do: predict-yes
  981. ENV: Agent did: predict-yes for direction R in state State-A
  982. In State-A moving R
  983. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  984. predict error 0
  985. dir: dir isL
  986. |\136: O: O271 (predict-yes)
  987. I see 1 and I'm going to do: predict-yes
  988. ENV: Agent did: predict-yes for direction L in state State-B
  989. In State-B moving L
  990. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  991. predict error 0
  992. dir: dir isL
  993. -137: O: O274 (predict-no)
  994. I see 1 and I'm going to do: predict-no
  995. ENV: Agent did: predict-no for direction L in state State-A
  996. In State-A moving L
  997. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  998. predict error 0
  999. dir: dir isR
  1000. /|\138: O: O275 (predict-yes)
  1001. I see 1 and I'm going to do: predict-yes
  1002. ENV: Agent did: predict-yes for direction R in state State-A
  1003. In State-A moving R
  1004. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1005. predict error 0
  1006. dir: dir isU
  1007. -/|139: O: O278 (predict-no)
  1008. I see 1 and I'm going to do: predict-no
  1009. ENV: Agent did: predict-no for direction U in state State-B
  1010. In State-B moving U
  1011. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1012. predict error 0
  1013. dir: dir isL
  1014. \-/140: O: O279 (predict-yes)
  1015. I see 1 and I'm going to do: predict-yes
  1016. ENV: Agent did: predict-yes for direction L in state State-B
  1017. In State-B moving L
  1018. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1019. predict error 0
  1020. dir: dir isR
  1021. |\-141: O: O281 (predict-yes)
  1022. I see 1 and I'm going to do: predict-yes
  1023. ENV: Agent did: predict-yes for direction R in state State-A
  1024. In State-A moving R
  1025. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1026. predict error 0
  1027. dir: dir isR
  1028. /142: O: O284 (predict-no)
  1029. I see 1 and I'm going to do: predict-no
  1030. ENV: Agent did: predict-no for direction R in state State-B
  1031. In State-B moving R
  1032. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1033. predict error 0
  1034. dir: dir isR
  1035. |\143: O: O286 (predict-no)
  1036. I see 1 and I'm going to do: predict-no
  1037. ENV: Agent did: predict-no for direction R in state State-B
  1038. In State-B moving R
  1039. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1040. predict error 0
  1041. dir: dir isL
  1042. -/144: O: O287 (predict-yes)
  1043. I see 1 and I'm going to do: predict-yes
  1044. ENV: Agent did: predict-yes for direction L in state State-B
  1045. In State-B moving L
  1046. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1047. predict error 0
  1048. dir: dir isU
  1049. |\-145: O: O290 (predict-no)
  1050. I see 1 and I'm going to do: predict-no
  1051. ENV: Agent did: predict-no for direction U in state State-A
  1052. In State-A moving U
  1053. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1054. predict error 0
  1055. dir: dir isL
  1056. /|146: O: O292 (predict-no)
  1057. I see 1 and I'm going to do: predict-no
  1058. ENV: Agent did: predict-no for direction L in state State-A
  1059. In State-A moving L
  1060. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1061. predict error 0
  1062. dir: dir isR
  1063. \-147: O: O293 (predict-yes)
  1064. I see 1 and I'm going to do: predict-yes
  1065. ENV: Agent did: predict-yes for direction R in state State-A
  1066. In State-A moving R
  1067. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1068. predict error 0
  1069. dir: dir isR
  1070. /|\148: O: O296 (predict-no)
  1071. I see 1 and I'm going to do: predict-no
  1072. ENV: Agent did: predict-no for direction R in state State-B
  1073. In State-B moving R
  1074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1075. predict error 0
  1076. dir: dir isL
  1077. -/149: O: O297 (predict-yes)
  1078. I see 1 and I'm going to do: predict-yes
  1079. ENV: Agent did: predict-yes for direction L in state State-B
  1080. In State-B moving L
  1081. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1082. predict error 0
  1083. dir: dir isR
  1084. |\-150: O: O299 (predict-yes)
  1085. I see 1 and I'm going to do: predict-yes
  1086. ENV: Agent did: predict-yes for direction R in state State-A
  1087. In State-A moving R
  1088. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1089. predict error 0
  1090. dir: dir isL
  1091. /|\151: O: O301 (predict-yes)
  1092. I see 1 and I'm going to do: predict-yes
  1093. ENV: Agent did: predict-yes for direction L in state State-B
  1094. In State-B moving L
  1095. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1096. predict error 0
  1097. dir: dir isL
  1098. -152: O: O304 (predict-no)
  1099. I see 1 and I'm going to do: predict-no
  1100. ENV: Agent did: predict-no for direction L in state State-A
  1101. In State-A moving L
  1102. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1103. predict error 0
  1104. dir: dir isL
  1105. /|\153: O: O306 (predict-no)
  1106. I see 1 and I'm going to do: predict-no
  1107. ENV: Agent did: predict-no for direction L in state State-A
  1108. In State-A moving L
  1109. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1110. predict error 0
  1111. dir: dir isL
  1112. -/|154: O: O308 (predict-no)
  1113. I see 1 and I'm going to do: predict-no
  1114. ENV: Agent did: predict-no for direction L in state State-A
  1115. In State-A moving L
  1116. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1117. predict error 0
  1118. dir: dir isL
  1119. \-/155: O: O310 (predict-no)
  1120. I see 1 and I'm going to do: predict-no
  1121. ENV: Agent did: predict-no for direction L in state State-A
  1122. In State-A moving L
  1123. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1124. predict error 0
  1125. dir: dir isR
  1126. |156: O: O311 (predict-yes)
  1127. I see 1 and I'm going to do: predict-yes
  1128. ENV: Agent did: predict-yes for direction R in state State-A
  1129. In State-A moving R
  1130. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1131. predict error 0
  1132. dir: dir isR
  1133. \-157: O: O314 (predict-no)
  1134. I see 1 and I'm going to do: predict-no
  1135. ENV: Agent did: predict-no for direction R in state State-B
  1136. In State-B moving R
  1137. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1138. predict error 0
  1139. dir: dir isU
  1140. /|158: O: O316 (predict-no)
  1141. I see 1 and I'm going to do: predict-no
  1142. ENV: Agent did: predict-no for direction U in state State-B
  1143. In State-B moving U
  1144. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1145. predict error 0
  1146. dir: dir isU
  1147. \-159: O: O318 (predict-no)
  1148. I see 1 and I'm going to do: predict-no
  1149. ENV: Agent did: predict-no for direction U in state State-B
  1150. In State-B moving U
  1151. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1152. predict error 0
  1153. dir: dir isU
  1154. /|\160: O: O320 (predict-no)
  1155. I see 1 and I'm going to do: predict-no
  1156. ENV: Agent did: predict-no for direction U in state State-B
  1157. In State-B moving U
  1158. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1159. predict error 0
  1160. dir: dir isL
  1161. -/|161: O: O321 (predict-yes)
  1162. I see 1 and I'm going to do: predict-yes
  1163. ENV: Agent did: predict-yes for direction L in state State-B
  1164. In State-B moving L
  1165. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1166. predict error 0
  1167. dir: dir isR
  1168. \162: O: O323 (predict-yes)
  1169. I see 1 and I'm going to do: predict-yes
  1170. ENV: Agent did: predict-yes for direction R in state State-A
  1171. In State-A moving R
  1172. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1173. predict error 0
  1174. dir: dir isL
  1175. -/|163: O: O325 (predict-yes)
  1176. I see 1 and I'm going to do: predict-yes
  1177. ENV: Agent did: predict-yes for direction L in state State-B
  1178. In State-B moving L
  1179. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1180. predict error 0
  1181. dir: dir isR
  1182. \-164: O: O327 (predict-yes)
  1183. I see 1 and I'm going to do: predict-yes
  1184. ENV: Agent did: predict-yes for direction R in state State-A
  1185. In State-A moving R
  1186. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1187. predict error 0
  1188. dir: dir isR
  1189. /|\165: O: O329 (predict-yes)
  1190. I see 1 and I'm going to do: predict-yes
  1191. ENV: Agent did: predict-yes for direction R in state State-B
  1192. In State-B moving R
  1193. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1194. predict error 1
  1195. dir: dir isU
  1196. -/|166: O: O332 (predict-no)
  1197. I see 0 and I'm going to do: predict-no
  1198. ENV: Agent did: predict-no for direction U in state State-B
  1199. In State-B moving U
  1200. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1201. predict error 0
  1202. dir: dir isU
  1203. \-/167: O: O334 (predict-no)
  1204. I see 1 and I'm going to do: predict-no
  1205. ENV: Agent did: predict-no for direction U in state State-B
  1206. In State-B moving U
  1207. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1208. predict error 0
  1209. dir: dir isL
  1210. |\168: O: O335 (predict-yes)
  1211. I see 1 and I'm going to do: predict-yes
  1212. ENV: Agent did: predict-yes for direction L in state State-B
  1213. In State-B moving L
  1214. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1215. predict error 0
  1216. dir: dir isR
  1217. -/169: O: O337 (predict-yes)
  1218. I see 1 and I'm going to do: predict-yes
  1219. ENV: Agent did: predict-yes for direction R in state State-A
  1220. In State-A moving R
  1221. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1222. predict error 0
  1223. dir: dir isR
  1224. |\-170: O: O340 (predict-no)
  1225. I see 1 and I'm going to do: predict-no
  1226. ENV: Agent did: predict-no for direction R in state State-B
  1227. In State-B moving R
  1228. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1229. predict error 0
  1230. dir: dir isL
  1231. /|171: O: O342 (predict-no)
  1232. I see 1 and I'm going to do: predict-no
  1233. ENV: Agent did: predict-no for direction L in state State-B
  1234. In State-B moving L
  1235. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1236. predict error 1
  1237. dir: dir isL
  1238. \172: O: O344 (predict-no)
  1239. I see 0 and I'm going to do: predict-no
  1240. ENV: Agent did: predict-no for direction L in state State-A
  1241. In State-A moving L
  1242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1243. predict error 0
  1244. dir: dir isR
  1245. -/173: O: O345 (predict-yes)
  1246. I see 1 and I'm going to do: predict-yes
  1247. ENV: Agent did: predict-yes for direction R in state State-A
  1248. In State-A moving R
  1249. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1250. predict error 0
  1251. dir: dir isL
  1252. |\174: O: O347 (predict-yes)
  1253. I see 1 and I'm going to do: predict-yes
  1254. ENV: Agent did: predict-yes for direction L in state State-B
  1255. In State-B moving L
  1256. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1257. predict error 0
  1258. dir: dir isU
  1259. -/|\175: O: O350 (predict-no)
  1260. I see 1 and I'm going to do: predict-no
  1261. ENV: Agent did: predict-no for direction U in state State-A
  1262. In State-A moving U
  1263. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1264. predict error 0
  1265. dir: dir isL
  1266. -/176: O: O352 (predict-no)
  1267. I see 1 and I'm going to do: predict-no
  1268. ENV: Agent did: predict-no for direction L in state State-A
  1269. In State-A moving L
  1270. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1271. predict error 0
  1272. dir: dir isL
  1273. |\-177: O: O354 (predict-no)
  1274. I see 1 and I'm going to do: predict-no
  1275. ENV: Agent did: predict-no for direction L in state State-A
  1276. In State-A moving L
  1277. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1278. predict error 0
  1279. dir: dir isL
  1280. /|178: O: O356 (predict-no)
  1281. I see 1 and I'm going to do: predict-no
  1282. ENV: Agent did: predict-no for direction L in state State-A
  1283. In State-A moving L
  1284. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1285. predict error 0
  1286. dir: dir isL
  1287. \179: O: O358 (predict-no)
  1288. I see 1 and I'm going to do: predict-no
  1289. ENV: Agent did: predict-no for direction L in state State-A
  1290. In State-A moving L
  1291. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1292. predict error 0
  1293. dir: dir isL
  1294. -/|180: O: O360 (predict-no)
  1295. I see 1 and I'm going to do: predict-no
  1296. ENV: Agent did: predict-no for direction L in state State-A
  1297. In State-A moving L
  1298. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1299. predict error 0
  1300. dir: dir isR
  1301. \181: O: O361 (predict-yes)
  1302. I see 1 and I'm going to do: predict-yes
  1303. ENV: Agent did: predict-yes for direction R in state State-A
  1304. In State-A moving R
  1305. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1306. predict error 0
  1307. dir: dir isR
  1308. -182: O: O364 (predict-no)
  1309. I see 1 and I'm going to do: predict-no
  1310. ENV: Agent did: predict-no for direction R in state State-B
  1311. In State-B moving R
  1312. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1313. predict error 0
  1314. dir: dir isL
  1315. /|\183: O: O365 (predict-yes)
  1316. I see 1 and I'm going to do: predict-yes
  1317. ENV: Agent did: predict-yes for direction L in state State-B
  1318. In State-B moving L
  1319. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1320. predict error 0
  1321. dir: dir isR
  1322. -184: O: O367 (predict-yes)
  1323. I see 1 and I'm going to do: predict-yes
  1324. ENV: Agent did: predict-yes for direction R in state State-A
  1325. In State-A moving R
  1326. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1327. predict error 0
  1328. dir: dir isL
  1329. /185: O: O369 (predict-yes)
  1330. I see 1 and I'm going to do: predict-yes
  1331. ENV: Agent did: predict-yes for direction L in state State-B
  1332. In State-B moving L
  1333. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1334. predict error 0
  1335. dir: dir isU
  1336. |\-186: O: O372 (predict-no)
  1337. I see 1 and I'm going to do: predict-no
  1338. ENV: Agent did: predict-no for direction U in state State-A
  1339. In State-A moving U
  1340. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1341. predict error 0
  1342. dir: dir isU
  1343. /|\187: O: O374 (predict-no)
  1344. I see 1 and I'm going to do: predict-no
  1345. ENV: Agent did: predict-no for direction U in state State-A
  1346. In State-A moving U
  1347. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1348. predict error 0
  1349. dir: dir isU
  1350. -/|188: O: O376 (predict-no)
  1351. I see 1 and I'm going to do: predict-no
  1352. ENV: Agent did: predict-no for direction U in state State-A
  1353. In State-A moving U
  1354. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1355. predict error 0
  1356. dir: dir isR
  1357. \-/189: O: O378 (predict-no)
  1358. I see 1 and I'm going to do: predict-no
  1359. ENV: Agent did: predict-no for direction R in state State-A
  1360. In State-A moving R
  1361. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1362. predict error 1
  1363. dir: dir isR
  1364. |\-190: O: O380 (predict-no)
  1365. I see 0 and I'm going to do: predict-no
  1366. ENV: Agent did: predict-no for direction R in state State-B
  1367. In State-B moving R
  1368. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1369. predict error 0
  1370. dir: dir isR
  1371. /|191: O: O382 (predict-no)
  1372. I see 1 and I'm going to do: predict-no
  1373. ENV: Agent did: predict-no for direction R in state State-B
  1374. In State-B moving R
  1375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1376. predict error 0
  1377. dir: dir isL
  1378. \192: O: O383 (predict-yes)
  1379. I see 1 and I'm going to do: predict-yes
  1380. ENV: Agent did: predict-yes for direction L in state State-B
  1381. In State-B moving L
  1382. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1383. predict error 0
  1384. dir: dir isR
  1385. -/|193: O: O385 (predict-yes)
  1386. I see 1 and I'm going to do: predict-yes
  1387. ENV: Agent did: predict-yes for direction R in state State-A
  1388. In State-A moving R
  1389. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1390. predict error 0
  1391. dir: dir isR
  1392. \194: O: O388 (predict-no)
  1393. I see 1 and I'm going to do: predict-no
  1394. ENV: Agent did: predict-no for direction R in state State-B
  1395. In State-B moving R
  1396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1397. predict error 0
  1398. dir: dir isL
  1399. -/|195: O: O389 (predict-yes)
  1400. I see 1 and I'm going to do: predict-yes
  1401. ENV: Agent did: predict-yes for direction L in state State-B
  1402. In State-B moving L
  1403. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1404. predict error 0
  1405. dir: dir isL
  1406. \-/196: O: O392 (predict-no)
  1407. I see 1 and I'm going to do: predict-no
  1408. ENV: Agent did: predict-no for direction L in state State-A
  1409. In State-A moving L
  1410. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1411. predict error 0
  1412. dir: dir isU
  1413. |\-197: O: O394 (predict-no)
  1414. I see 1 and I'm going to do: predict-no
  1415. ENV: Agent did: predict-no for direction U in state State-A
  1416. In State-A moving U
  1417. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1418. predict error 0
  1419. dir: dir isR
  1420. /|\198: O: O395 (predict-yes)
  1421. I see 1 and I'm going to do: predict-yes
  1422. ENV: Agent did: predict-yes for direction R in state State-A
  1423. In State-A moving R
  1424. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1425. predict error 0
  1426. dir: dir isR
  1427. -/199: O: O398 (predict-no)
  1428. I see 1 and I'm going to do: predict-no
  1429. ENV: Agent did: predict-no for direction R in state State-B
  1430. In State-B moving R
  1431. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1432. predict error 0
  1433. dir: dir isU
  1434. |\-200: O: O400 (predict-no)
  1435. I see 1 and I'm going to do: predict-no
  1436. ENV: Agent did: predict-no for direction U in state State-B
  1437. In State-B moving U
  1438. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1439. predict error 0
  1440. dir: dir isR
  1441. /|\-/|201: O: O402 (predict-no)
  1442. I see 1 and I'm going to do: predict-no
  1443. ENV: Agent did: predict-no for direction R in state State-B
  1444. In State-B moving R
  1445. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1446. predict error 0
  1447. dir: dir isL
  1448. \202: O: O403 (predict-yes)
  1449. I see 1 and I'm going to do: predict-yes
  1450. ENV: Agent did: predict-yes for direction L in state State-B
  1451. In State-B moving L
  1452. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1453. predict error 0
  1454. dir: dir isU
  1455. -203: O: O406 (predict-no)
  1456. I see 1 and I'm going to do: predict-no
  1457. ENV: Agent did: predict-no for direction U in state State-A
  1458. In State-A moving U
  1459. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1460. predict error 0
  1461. dir: dir isR
  1462. /|\204: O: O407 (predict-yes)
  1463. I see 1 and I'm going to do: predict-yes
  1464. ENV: Agent did: predict-yes for direction R in state State-A
  1465. In State-A moving R
  1466. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1467. predict error 0
  1468. dir: dir isL
  1469. -/|205: O: O409 (predict-yes)
  1470. I see 1 and I'm going to do: predict-yes
  1471. ENV: Agent did: predict-yes for direction L in state State-B
  1472. In State-B moving L
  1473. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1474. predict error 0
  1475. dir: dir isU
  1476. \206: O: O412 (predict-no)
  1477. I see 1 and I'm going to do: predict-no
  1478. ENV: Agent did: predict-no for direction U in state State-A
  1479. In State-A moving U
  1480. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1481. predict error 0
  1482. dir: dir isL
  1483. -/|207: O: O414 (predict-no)
  1484. I see 1 and I'm going to do: predict-no
  1485. ENV: Agent did: predict-no for direction L in state State-A
  1486. In State-A moving L
  1487. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1488. predict error 0
  1489. dir: dir isL
  1490. \-/208: O: O415 (predict-yes)
  1491. I see 1 and I'm going to do: predict-yes
  1492. ENV: Agent did: predict-yes for direction L in state State-A
  1493. In State-A moving L
  1494. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1495. predict error 1
  1496. dir: dir isU
  1497. |\209: O: O418 (predict-no)
  1498. I see 0 and I'm going to do: predict-no
  1499. ENV: Agent did: predict-no for direction U in state State-A
  1500. In State-A moving U
  1501. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1502. predict error 0
  1503. dir: dir isU
  1504. -/210: O: O420 (predict-no)
  1505. I see 1 and I'm going to do: predict-no
  1506. ENV: Agent did: predict-no for direction U in state State-A
  1507. In State-A moving U
  1508. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1509. predict error 0
  1510. dir: dir isL
  1511. |\211: O: O422 (predict-no)
  1512. I see 1 and I'm going to do: predict-no
  1513. ENV: Agent did: predict-no for direction L in state State-A
  1514. In State-A moving L
  1515. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1516. predict error 0
  1517. dir: dir isR
  1518. -212: O: O423 (predict-yes)
  1519. I see 1 and I'm going to do: predict-yes
  1520. ENV: Agent did: predict-yes for direction R in state State-A
  1521. In State-A moving R
  1522. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1523. predict error 0
  1524. dir: dir isR
  1525. /|\213: O: O426 (predict-no)
  1526. I see 1 and I'm going to do: predict-no
  1527. ENV: Agent did: predict-no for direction R in state State-B
  1528. In State-B moving R
  1529. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1530. predict error 0
  1531. dir: dir isR
  1532. -214: O: O428 (predict-no)
  1533. I see 1 and I'm going to do: predict-no
  1534. ENV: Agent did: predict-no for direction R in state State-B
  1535. In State-B moving R
  1536. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1537. predict error 0
  1538. dir: dir isL
  1539. /|\215: O: O429 (predict-yes)
  1540. I see 1 and I'm going to do: predict-yes
  1541. ENV: Agent did: predict-yes for direction L in state State-B
  1542. In State-B moving L
  1543. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1544. predict error 0
  1545. dir: dir isR
  1546. -/216: O: O431 (predict-yes)
  1547. I see 1 and I'm going to do: predict-yes
  1548. ENV: Agent did: predict-yes for direction R in state State-A
  1549. In State-A moving R
  1550. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1551. predict error 0
  1552. dir: dir isL
  1553. |\-217: O: O433 (predict-yes)
  1554. I see 1 and I'm going to do: predict-yes
  1555. ENV: Agent did: predict-yes for direction L in state State-B
  1556. In State-B moving L
  1557. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1558. predict error 0
  1559. dir: dir isL
  1560. /|\218: O: O436 (predict-no)
  1561. I see 1 and I'm going to do: predict-no
  1562. ENV: Agent did: predict-no for direction L in state State-A
  1563. In State-A moving L
  1564. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1565. predict error 0
  1566. dir: dir isR
  1567. -/|219: O: O437 (predict-yes)
  1568. I see 1 and I'm going to do: predict-yes
  1569. ENV: Agent did: predict-yes for direction R in state State-A
  1570. In State-A moving R
  1571. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1572. predict error 0
  1573. dir: dir isU
  1574. \-220: O: O440 (predict-no)
  1575. I see 1 and I'm going to do: predict-no
  1576. ENV: Agent did: predict-no for direction U in state State-B
  1577. In State-B moving U
  1578. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1579. predict error 0
  1580. dir: dir isL
  1581. /|\221: O: O441 (predict-yes)
  1582. I see 1 and I'm going to do: predict-yes
  1583. ENV: Agent did: predict-yes for direction L in state State-B
  1584. In State-B moving L
  1585. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1586. predict error 0
  1587. dir: dir isU
  1588. -222: O: O444 (predict-no)
  1589. I see 1 and I'm going to do: predict-no
  1590. ENV: Agent did: predict-no for direction U in state State-A
  1591. In State-A moving U
  1592. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1593. predict error 0
  1594. dir: dir isL
  1595. /|223: O: O446 (predict-no)
  1596. I see 1 and I'm going to do: predict-no
  1597. ENV: Agent did: predict-no for direction L in state State-A
  1598. In State-A moving L
  1599. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1600. predict error 0
  1601. dir: dir isR
  1602. \-224: O: O447 (predict-yes)
  1603. I see 1 and I'm going to do: predict-yes
  1604. ENV: Agent did: predict-yes for direction R in state State-A
  1605. In State-A moving R
  1606. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1607. predict error 0
  1608. dir: dir isR
  1609. /|225: O: O449 (predict-yes)
  1610. I see 1 and I'm going to do: predict-yes
  1611. ENV: Agent did: predict-yes for direction R in state State-B
  1612. In State-B moving R
  1613. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1614. predict error 1
  1615. dir: dir isR
  1616. \-/226: O: O452 (predict-no)
  1617. I see 0 and I'm going to do: predict-no
  1618. ENV: Agent did: predict-no for direction R in state State-B
  1619. In State-B moving R
  1620. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1621. predict error 0
  1622. dir: dir isU
  1623. |\227: O: O454 (predict-no)
  1624. I see 1 and I'm going to do: predict-no
  1625. ENV: Agent did: predict-no for direction U in state State-B
  1626. In State-B moving U
  1627. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1628. predict error 0
  1629. dir: dir isR
  1630. -/|228: O: O456 (predict-no)
  1631. I see 1 and I'm going to do: predict-no
  1632. ENV: Agent did: predict-no for direction R in state State-B
  1633. In State-B moving R
  1634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1635. predict error 0
  1636. dir: dir isL
  1637. \229: O: O457 (predict-yes)
  1638. I see 1 and I'm going to do: predict-yes
  1639. ENV: Agent did: predict-yes for direction L in state State-B
  1640. In State-B moving L
  1641. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1642. predict error 0
  1643. dir: dir isL
  1644. -/|230: O: O460 (predict-no)
  1645. I see 1 and I'm going to do: predict-no
  1646. ENV: Agent did: predict-no for direction L in state State-A
  1647. In State-A moving L
  1648. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1649. predict error 0
  1650. dir: dir isL
  1651. \-231: O: O462 (predict-no)
  1652. I see 1 and I'm going to do: predict-no
  1653. ENV: Agent did: predict-no for direction L in state State-A
  1654. In State-A moving L
  1655. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1656. predict error 0
  1657. dir: dir isU
  1658. /232: O: O464 (predict-no)
  1659. I see 1 and I'm going to do: predict-no
  1660. ENV: Agent did: predict-no for direction U in state State-A
  1661. In State-A moving U
  1662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1663. predict error 0
  1664. dir: dir isR
  1665. |\233: O: O465 (predict-yes)
  1666. I see 1 and I'm going to do: predict-yes
  1667. ENV: Agent did: predict-yes for direction R in state State-A
  1668. In State-A moving R
  1669. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1670. predict error 0
  1671. dir: dir isU
  1672. -/|234: O: O468 (predict-no)
  1673. I see 1 and I'm going to do: predict-no
  1674. ENV: Agent did: predict-no for direction U in state State-B
  1675. In State-B moving U
  1676. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1677. predict error 0
  1678. dir: dir isU
  1679. \-235: O: O470 (predict-no)
  1680. I see 1 and I'm going to do: predict-no
  1681. ENV: Agent did: predict-no for direction U in state State-B
  1682. In State-B moving U
  1683. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1684. predict error 0
  1685. dir: dir isL
  1686. /|236: O: O471 (predict-yes)
  1687. I see 1 and I'm going to do: predict-yes
  1688. ENV: Agent did: predict-yes for direction L in state State-B
  1689. In State-B moving L
  1690. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1691. predict error 0
  1692. dir: dir isR
  1693. \-/237: O: O474 (predict-no)
  1694. I see 1 and I'm going to do: predict-no
  1695. ENV: Agent did: predict-no for direction R in state State-A
  1696. In State-A moving R
  1697. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1698. predict error 1
  1699. dir: dir isU
  1700. |\238: O: O476 (predict-no)
  1701. I see 0 and I'm going to do: predict-no
  1702. ENV: Agent did: predict-no for direction U in state State-B
  1703. In State-B moving U
  1704. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1705. predict error 0
  1706. dir: dir isU
  1707. -/|\239: O: O478 (predict-no)
  1708. I see 1 and I'm going to do: predict-no
  1709. ENV: Agent did: predict-no for direction U in state State-B
  1710. In State-B moving U
  1711. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1712. predict error 0
  1713. dir: dir isR
  1714. -/240: O: O480 (predict-no)
  1715. I see 1 and I'm going to do: predict-no
  1716. ENV: Agent did: predict-no for direction R in state State-B
  1717. In State-B moving R
  1718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1719. predict error 0
  1720. dir: dir isR
  1721. |\-241: O: O481 (predict-yes)
  1722. I see 1 and I'm going to do: predict-yes
  1723. ENV: Agent did: predict-yes for direction R in state State-B
  1724. In State-B moving R
  1725. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1726. predict error 1
  1727. dir: dir isR
  1728. /242: O: O484 (predict-no)
  1729. I see 0 and I'm going to do: predict-no
  1730. ENV: Agent did: predict-no for direction R in state State-B
  1731. In State-B moving R
  1732. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1733. predict error 0
  1734. dir: dir isU
  1735. |\-243: O: O486 (predict-no)
  1736. I see 1 and I'm going to do: predict-no
  1737. ENV: Agent did: predict-no for direction U in state State-B
  1738. In State-B moving U
  1739. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1740. predict error 0
  1741. dir: dir isL
  1742. /|\244: O: O487 (predict-yes)
  1743. I see 1 and I'm going to do: predict-yes
  1744. ENV: Agent did: predict-yes for direction L in state State-B
  1745. In State-B moving L
  1746. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1747. predict error 0
  1748. dir: dir isR
  1749. -/245: O: O489 (predict-yes)
  1750. I see 1 and I'm going to do: predict-yes
  1751. ENV: Agent did: predict-yes for direction R in state State-A
  1752. In State-A moving R
  1753. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1754. predict error 0
  1755. dir: dir isR
  1756. |\-246: O: O491 (predict-yes)
  1757. I see 1 and I'm going to do: predict-yes
  1758. ENV: Agent did: predict-yes for direction R in state State-B
  1759. In State-B moving R
  1760. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1761. predict error 1
  1762. dir: dir isU
  1763. /|247: O: O494 (predict-no)
  1764. I see 0 and I'm going to do: predict-no
  1765. ENV: Agent did: predict-no for direction U in state State-B
  1766. In State-B moving U
  1767. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1768. predict error 0
  1769. dir: dir isU
  1770. \-/248: O: O496 (predict-no)
  1771. I see 1 and I'm going to do: predict-no
  1772. ENV: Agent did: predict-no for direction U in state State-B
  1773. In State-B moving U
  1774. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1775. predict error 0
  1776. dir: dir isU
  1777. |\-249: O: O498 (predict-no)
  1778. I see 1 and I'm going to do: predict-no
  1779. ENV: Agent did: predict-no for direction U in state State-B
  1780. In State-B moving U
  1781. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1782. predict error 0
  1783. dir: dir isU
  1784. /|\250: O: O500 (predict-no)
  1785. I see 1 and I'm going to do: predict-no
  1786. ENV: Agent did: predict-no for direction U in state State-B
  1787. In State-B moving U
  1788. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1789. predict error 0
  1790. dir: dir isL
  1791. -/|251: O: O501 (predict-yes)
  1792. I see 1 and I'm going to do: predict-yes
  1793. ENV: Agent did: predict-yes for direction L in state State-B
  1794. In State-B moving L
  1795. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1796. predict error 0
  1797. dir: dir isL
  1798. \252: O: O504 (predict-no)
  1799. I see 1 and I'm going to do: predict-no
  1800. ENV: Agent did: predict-no for direction L in state State-A
  1801. In State-A moving L
  1802. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1803. predict error 0
  1804. dir: dir isR
  1805. -/|253: O: O506 (predict-no)
  1806. I see 1 and I'm going to do: predict-no
  1807. ENV: Agent did: predict-no for direction R in state State-A
  1808. In State-A moving R
  1809. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1810. predict error 1
  1811. dir: dir isL
  1812. \-/254: O: O508 (predict-no)
  1813. I see 0 and I'm going to do: predict-no
  1814. ENV: Agent did: predict-no for direction L in state State-B
  1815. In State-B moving L
  1816. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1817. predict error 1
  1818. dir: dir isR
  1819. |\255: O: O509 (predict-yes)
  1820. I see 0 and I'm going to do: predict-yes
  1821. ENV: Agent did: predict-yes for direction R in state State-A
  1822. In State-A moving R
  1823. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1824. predict error 0
  1825. dir: dir isU
  1826. -/256: O: O511 (predict-yes)
  1827. I see 1 and I'm going to do: predict-yes
  1828. ENV: Agent did: predict-yes for direction U in state State-B
  1829. In State-B moving U
  1830. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1831. predict error 1
  1832. dir: dir isU
  1833. |\-257: O: O514 (predict-no)
  1834. I see 0 and I'm going to do: predict-no
  1835. ENV: Agent did: predict-no for direction U in state State-B
  1836. In State-B moving U
  1837. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1838. predict error 0
  1839. dir: dir isR
  1840. /|\258: O: O516 (predict-no)
  1841. I see 1 and I'm going to do: predict-no
  1842. ENV: Agent did: predict-no for direction R in state State-B
  1843. In State-B moving R
  1844. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1845. predict error 0
  1846. dir: dir isU
  1847. -/|259: O: O517 (predict-yes)
  1848. I see 1 and I'm going to do: predict-yes
  1849. ENV: Agent did: predict-yes for direction U in state State-B
  1850. In State-B moving U
  1851. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1852. predict error 1
  1853. dir: dir isL
  1854. \-/260: O: O519 (predict-yes)
  1855. I see 0 and I'm going to do: predict-yes
  1856. ENV: Agent did: predict-yes for direction L in state State-B
  1857. In State-B moving L
  1858. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1859. predict error 0
  1860. dir: dir isR
  1861. |261: O: O521 (predict-yes)
  1862. I see 1 and I'm going to do: predict-yes
  1863. ENV: Agent did: predict-yes for direction R in state State-A
  1864. In State-A moving R
  1865. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1866. predict error 0
  1867. dir: dir isU
  1868. \262: O: O524 (predict-no)
  1869. I see 1 and I'm going to do: predict-no
  1870. ENV: Agent did: predict-no for direction U in state State-B
  1871. In State-B moving U
  1872. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1873. predict error 0
  1874. dir: dir isL
  1875. -/|263: O: O525 (predict-yes)
  1876. I see 1 and I'm going to do: predict-yes
  1877. ENV: Agent did: predict-yes for direction L in state State-B
  1878. In State-B moving L
  1879. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1880. predict error 0
  1881. dir: dir isR
  1882. \-/264: O: O527 (predict-yes)
  1883. I see 1 and I'm going to do: predict-yes
  1884. ENV: Agent did: predict-yes for direction R in state State-A
  1885. In State-A moving R
  1886. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1887. predict error 0
  1888. dir: dir isL
  1889. |\-265: O: O529 (predict-yes)
  1890. I see 1 and I'm going to do: predict-yes
  1891. ENV: Agent did: predict-yes for direction L in state State-B
  1892. In State-B moving L
  1893. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1894. predict error 0
  1895. dir: dir isL
  1896. /266: O: O532 (predict-no)
  1897. I see 1 and I'm going to do: predict-no
  1898. ENV: Agent did: predict-no for direction L in state State-A
  1899. In State-A moving L
  1900. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1901. predict error 0
  1902. dir: dir isU
  1903. |\-267: O: O534 (predict-no)
  1904. I see 1 and I'm going to do: predict-no
  1905. ENV: Agent did: predict-no for direction U in state State-A
  1906. In State-A moving U
  1907. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1908. predict error 0
  1909. dir: dir isU
  1910. /|\268: O: O536 (predict-no)
  1911. I see 1 and I'm going to do: predict-no
  1912. ENV: Agent did: predict-no for direction U in state State-A
  1913. In State-A moving U
  1914. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1915. predict error 0
  1916. dir: dir isR
  1917. -/|269: O: O537 (predict-yes)
  1918. I see 1 and I'm going to do: predict-yes
  1919. ENV: Agent did: predict-yes for direction R in state State-A
  1920. In State-A moving R
  1921. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1922. predict error 0
  1923. dir: dir isL
  1924. \-270: O: O539 (predict-yes)
  1925. I see 1 and I'm going to do: predict-yes
  1926. ENV: Agent did: predict-yes for direction L in state State-B
  1927. In State-B moving L
  1928. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1929. predict error 0
  1930. dir: dir isL
  1931. /|\271: O: O542 (predict-no)
  1932. I see 1 and I'm going to do: predict-no
  1933. ENV: Agent did: predict-no for direction L in state State-A
  1934. In State-A moving L
  1935. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1936. predict error 0
  1937. dir: dir isL
  1938. -272: O: O544 (predict-no)
  1939. I see 1 and I'm going to do: predict-no
  1940. ENV: Agent did: predict-no for direction L in state State-A
  1941. In State-A moving L
  1942. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1943. predict error 0
  1944. dir: dir isU
  1945. /|\273: O: O546 (predict-no)
  1946. I see 1 and I'm going to do: predict-no
  1947. ENV: Agent did: predict-no for direction U in state State-A
  1948. In State-A moving U
  1949. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1950. predict error 0
  1951. dir: dir isU
  1952. -/|274: O: O548 (predict-no)
  1953. I see 1 and I'm going to do: predict-no
  1954. ENV: Agent did: predict-no for direction U in state State-A
  1955. In State-A moving U
  1956. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1957. predict error 0
  1958. dir: dir isR
  1959. \-/275: O: O549 (predict-yes)
  1960. I see 1 and I'm going to do: predict-yes
  1961. ENV: Agent did: predict-yes for direction R in state State-A
  1962. In State-A moving R
  1963. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1964. predict error 0
  1965. dir: dir isR
  1966. |\-276: O: O552 (predict-no)
  1967. I see 1 and I'm going to do: predict-no
  1968. ENV: Agent did: predict-no for direction R in state State-B
  1969. In State-B moving R
  1970. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1971. predict error 0
  1972. dir: dir isL
  1973. /277: O: O553 (predict-yes)
  1974. I see 1 and I'm going to do: predict-yes
  1975. ENV: Agent did: predict-yes for direction L in state State-B
  1976. In State-B moving L
  1977. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1978. predict error 0
  1979. dir: dir isR
  1980. |\278: O: O555 (predict-yes)
  1981. I see 1 and I'm going to do: predict-yes
  1982. ENV: Agent did: predict-yes for direction R in state State-A
  1983. In State-A moving R
  1984. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1985. predict error 0
  1986. dir: dir isU
  1987. -279: O: O558 (predict-no)
  1988. I see 1 and I'm going to do: predict-no
  1989. ENV: Agent did: predict-no for direction U in state State-B
  1990. In State-B moving U
  1991. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1992. predict error 0
  1993. dir: dir isU
  1994. /|\280: O: O560 (predict-no)
  1995. I see 1 and I'm going to do: predict-no
  1996. ENV: Agent did: predict-no for direction U in state State-B
  1997. In State-B moving U
  1998. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1999. predict error 0
  2000. dir: dir isL
  2001. -/|281: O: O561 (predict-yes)
  2002. I see 1 and I'm going to do: predict-yes
  2003. ENV: Agent did: predict-yes for direction L in state State-B
  2004. In State-B moving L
  2005. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2006. predict error 0
  2007. dir: dir isR
  2008. \282: O: O563 (predict-yes)
  2009. I see 1 and I'm going to do: predict-yes
  2010. ENV: Agent did: predict-yes for direction R in state State-A
  2011. In State-A moving R
  2012. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2013. predict error 0
  2014. dir: dir isU
  2015. -283: O: O565 (predict-yes)
  2016. I see 1 and I'm going to do: predict-yes
  2017. ENV: Agent did: predict-yes for direction U in state State-B
  2018. In State-B moving U
  2019. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2020. predict error 1
  2021. dir: dir isL
  2022. /|\284: O: O567 (predict-yes)
  2023. I see 0 and I'm going to do: predict-yes
  2024. ENV: Agent did: predict-yes for direction L in state State-B
  2025. In State-B moving L
  2026. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2027. predict error 0
  2028. dir: dir isU
  2029. -/|285: O: O569 (predict-yes)
  2030. I see 1 and I'm going to do: predict-yes
  2031. ENV: Agent did: predict-yes for direction U in state State-A
  2032. In State-A moving U
  2033. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2034. predict error 1
  2035. dir: dir isR
  2036. \-/286: O: O572 (predict-no)
  2037. I see 0 and I'm going to do: predict-no
  2038. ENV: Agent did: predict-no for direction R in state State-A
  2039. In State-A moving R
  2040. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2041. predict error 1
  2042. dir: dir isU
  2043. |\-287: O: O574 (predict-no)
  2044. I see 0 and I'm going to do: predict-no
  2045. ENV: Agent did: predict-no for direction U in state State-B
  2046. In State-B moving U
  2047. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2048. predict error 0
  2049. dir: dir isR
  2050. /|\288: O: O576 (predict-no)
  2051. I see 1 and I'm going to do: predict-no
  2052. ENV: Agent did: predict-no for direction R in state State-B
  2053. In State-B moving R
  2054. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2055. predict error 0
  2056. dir: dir isU
  2057. -289: O: O578 (predict-no)
  2058. I see 1 and I'm going to do: predict-no
  2059. ENV: Agent did: predict-no for direction U in state State-B
  2060. In State-B moving U
  2061. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2062. predict error 0
  2063. dir: dir isU
  2064. /|\290: O: O580 (predict-no)
  2065. I see 1 and I'm going to do: predict-no
  2066. ENV: Agent did: predict-no for direction U in state State-B
  2067. In State-B moving U
  2068. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2069. predict error 0
  2070. dir: dir isR
  2071. -/|291: O: O582 (predict-no)
  2072. I see 1 and I'm going to do: predict-no
  2073. ENV: Agent did: predict-no for direction R in state State-B
  2074. In State-B moving R
  2075. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2076. predict error 0
  2077. dir: dir isL
  2078. \292: O: O583 (predict-yes)
  2079. I see 1 and I'm going to do: predict-yes
  2080. ENV: Agent did: predict-yes for direction L in state State-B
  2081. In State-B moving L
  2082. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2083. predict error 0
  2084. dir: dir isR
  2085. -293: O: O585 (predict-yes)
  2086. I see 1 and I'm going to do: predict-yes
  2087. ENV: Agent did: predict-yes for direction R in state State-A
  2088. In State-A moving R
  2089. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2090. predict error 0
  2091. dir: dir isL
  2092. /|\294: O: O587 (predict-yes)
  2093. I see 1 and I'm going to do: predict-yes
  2094. ENV: Agent did: predict-yes for direction L in state State-B
  2095. In State-B moving L
  2096. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2097. predict error 0
  2098. dir: dir isR
  2099. -/295: O: O589 (predict-yes)
  2100. I see 1 and I'm going to do: predict-yes
  2101. ENV: Agent did: predict-yes for direction R in state State-A
  2102. In State-A moving R
  2103. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2104. predict error 0
  2105. dir: dir isU
  2106. |296: O: O592 (predict-no)
  2107. I see 1 and I'm going to do: predict-no
  2108. ENV: Agent did: predict-no for direction U in state State-B
  2109. In State-B moving U
  2110. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2111. predict error 0
  2112. dir: dir isL
  2113. \-/297: O: O593 (predict-yes)
  2114. I see 1 and I'm going to do: predict-yes
  2115. ENV: Agent did: predict-yes for direction L in state State-B
  2116. In State-B moving L
  2117. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2118. predict error 0
  2119. dir: dir isR
  2120. |\-298: O: O595 (predict-yes)
  2121. I see 1 and I'm going to do: predict-yes
  2122. ENV: Agent did: predict-yes for direction R in state State-A
  2123. In State-A moving R
  2124. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2125. predict error 0
  2126. dir: dir isR
  2127. /299: O: O598 (predict-no)
  2128. I see 1 and I'm going to do: predict-no
  2129. ENV: Agent did: predict-no for direction R in state State-B
  2130. In State-B moving R
  2131. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2132. predict error 0
  2133. dir: dir isR
  2134. |\-300: O: O599 (predict-yes)
  2135. I see 1 and I'm going to do: predict-yes
  2136. ENV: Agent did: predict-yes for direction R in state State-B
  2137. In State-B moving R
  2138. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2139. predict error 1
  2140. dir: dir isU
  2141. /|\301: O: O602 (predict-no)
  2142. I see 0 and I'm going to do: predict-no
  2143. ENV: Agent did: predict-no for direction U in state State-B
  2144. In State-B moving U
  2145. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2146. predict error 0
  2147. dir: dir isU
  2148. -302: O: O604 (predict-no)
  2149. I see 1 and I'm going to do: predict-no
  2150. ENV: Agent did: predict-no for direction U in state State-B
  2151. In State-B moving U
  2152. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2153. predict error 0
  2154. dir: dir isU
  2155. /303: O: O606 (predict-no)
  2156. I see 1 and I'm going to do: predict-no
  2157. ENV: Agent did: predict-no for direction U in state State-B
  2158. In State-B moving U
  2159. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2160. predict error 0
  2161. dir: dir isR
  2162. |\-304: O: O608 (predict-no)
  2163. I see 1 and I'm going to do: predict-no
  2164. ENV: Agent did: predict-no for direction R in state State-B
  2165. In State-B moving R
  2166. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2167. predict error 0
  2168. dir: dir isU
  2169. /|\305: O: O610 (predict-no)
  2170. I see 1 and I'm going to do: predict-no
  2171. ENV: Agent did: predict-no for direction U in state State-B
  2172. In State-B moving U
  2173. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2174. predict error 0
  2175. dir: dir isL
  2176. -/306: O: O611 (predict-yes)
  2177. I see 1 and I'm going to do: predict-yes
  2178. ENV: Agent did: predict-yes for direction L in state State-B
  2179. In State-B moving L
  2180. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2181. predict error 0
  2182. dir: dir isU
  2183. |\-307: O: O614 (predict-no)
  2184. I see 1 and I'm going to do: predict-no
  2185. ENV: Agent did: predict-no for direction U in state State-A
  2186. In State-A moving U
  2187. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2188. predict error 0
  2189. dir: dir isR
  2190. /|\308: O: O615 (predict-yes)
  2191. I see 1 and I'm going to do: predict-yes
  2192. ENV: Agent did: predict-yes for direction R in state State-A
  2193. In State-A moving R
  2194. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2195. predict error 0
  2196. dir: dir isU
  2197. -/309: O: O617 (predict-yes)
  2198. I see 1 and I'm going to do: predict-yes
  2199. ENV: Agent did: predict-yes for direction U in state State-B
  2200. In State-B moving U
  2201. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2202. predict error 1
  2203. dir: dir isU
  2204. |\310: O: O620 (predict-no)
  2205. I see 0 and I'm going to do: predict-no
  2206. ENV: Agent did: predict-no for direction U in state State-B
  2207. In State-B moving U
  2208. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2209. predict error 0
  2210. dir: dir isL
  2211. -/|311: O: O621 (predict-yes)
  2212. I see 1 and I'm going to do: predict-yes
  2213. ENV: Agent did: predict-yes for direction L in state State-B
  2214. In State-B moving L
  2215. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2216. predict error 0
  2217. dir: dir isR
  2218. \312: O: O624 (predict-no)
  2219. I see 1 and I'm going to do: predict-no
  2220. ENV: Agent did: predict-no for direction R in state State-A
  2221. In State-A moving R
  2222. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2223. predict error 1
  2224. dir: dir isR
  2225. -/|313: O: O626 (predict-no)
  2226. I see 0 and I'm going to do: predict-no
  2227. ENV: Agent did: predict-no for direction R in state State-B
  2228. In State-B moving R
  2229. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2230. predict error 0
  2231. dir: dir isU
  2232. \-314: O: O628 (predict-no)
  2233. I see 1 and I'm going to do: predict-no
  2234. ENV: Agent did: predict-no for direction U in state State-B
  2235. In State-B moving U
  2236. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2237. predict error 0
  2238. dir: dir isU
  2239. /|\315: O: O630 (predict-no)
  2240. I see 1 and I'm going to do: predict-no
  2241. ENV: Agent did: predict-no for direction U in state State-B
  2242. In State-B moving U
  2243. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2244. predict error 0
  2245. dir: dir isR
  2246. -316: O: O632 (predict-no)
  2247. I see 1 and I'm going to do: predict-no
  2248. ENV: Agent did: predict-no for direction R in state State-B
  2249. In State-B moving R
  2250. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2251. predict error 0
  2252. dir: dir isL
  2253. /|\317: O: O634 (predict-no)
  2254. I see 1 and I'm going to do: predict-no
  2255. ENV: Agent did: predict-no for direction L in state State-B
  2256. In State-B moving L
  2257. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2258. predict error 1
  2259. dir: dir isR
  2260. -/318: O: O635 (predict-yes)
  2261. I see 0 and I'm going to do: predict-yes
  2262. ENV: Agent did: predict-yes for direction R in state State-A
  2263. In State-A moving R
  2264. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2265. predict error 0
  2266. dir: dir isU
  2267. |\-319: O: O638 (predict-no)
  2268. I see 1 and I'm going to do: predict-no
  2269. ENV: Agent did: predict-no for direction U in state State-B
  2270. In State-B moving U
  2271. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2272. predict error 0
  2273. dir: dir isU
  2274. /|320: O: O640 (predict-no)
  2275. I see 1 and I'm going to do: predict-no
  2276. ENV: Agent did: predict-no for direction U in state State-B
  2277. In State-B moving U
  2278. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2279. predict error 0
  2280. dir: dir isR
  2281. \-/321: O: O642 (predict-no)
  2282. I see 1 and I'm going to do: predict-no
  2283. ENV: Agent did: predict-no for direction R in state State-B
  2284. In State-B moving R
  2285. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2286. predict error 0
  2287. dir: dir isU
  2288. |322: O: O644 (predict-no)
  2289. I see 1 and I'm going to do: predict-no
  2290. ENV: Agent did: predict-no for direction U in state State-B
  2291. In State-B moving U
  2292. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2293. predict error 0
  2294. dir: dir isL
  2295. \-/323: O: O645 (predict-yes)
  2296. I see 1 and I'm going to do: predict-yes
  2297. ENV: Agent did: predict-yes for direction L in state State-B
  2298. In State-B moving L
  2299. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2300. predict error 0
  2301. dir: dir isU
  2302. |324: O: O648 (predict-no)
  2303. I see 1 and I'm going to do: predict-no
  2304. ENV: Agent did: predict-no for direction U in state State-A
  2305. In State-A moving U
  2306. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2307. predict error 0
  2308. dir: dir isU
  2309. \-/325: O: O650 (predict-no)
  2310. I see 1 and I'm going to do: predict-no
  2311. ENV: Agent did: predict-no for direction U in state State-A
  2312. In State-A moving U
  2313. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2314. predict error 0
  2315. dir: dir isR
  2316. |326: O: O651 (predict-yes)
  2317. I see 1 and I'm going to do: predict-yes
  2318. ENV: Agent did: predict-yes for direction R in state State-A
  2319. In State-A moving R
  2320. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2321. predict error 0
  2322. dir: dir isU
  2323. \-/327: O: O654 (predict-no)
  2324. I see 1 and I'm going to do: predict-no
  2325. ENV: Agent did: predict-no for direction U in state State-B
  2326. In State-B moving U
  2327. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2328. predict error 0
  2329. dir: dir isU
  2330. |\328: O: O656 (predict-no)
  2331. I see 1 and I'm going to do: predict-no
  2332. ENV: Agent did: predict-no for direction U in state State-B
  2333. In State-B moving U
  2334. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2335. predict error 0
  2336. dir: dir isL
  2337. -/|329: O: O657 (predict-yes)
  2338. I see 1 and I'm going to do: predict-yes
  2339. ENV: Agent did: predict-yes for direction L in state State-B
  2340. In State-B moving L
  2341. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2342. predict error 0
  2343. dir: dir isU
  2344. \-/330: O: O660 (predict-no)
  2345. I see 1 and I'm going to do: predict-no
  2346. ENV: Agent did: predict-no for direction U in state State-A
  2347. In State-A moving U
  2348. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2349. predict error 0
  2350. dir: dir isU
  2351. |\-331: O: O662 (predict-no)
  2352. I see 1 and I'm going to do: predict-no
  2353. ENV: Agent did: predict-no for direction U in state State-A
  2354. In State-A moving U
  2355. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2356. predict error 0
  2357. dir: dir isL
  2358. /332: O: O664 (predict-no)
  2359. I see 1 and I'm going to do: predict-no
  2360. ENV: Agent did: predict-no for direction L in state State-A
  2361. In State-A moving L
  2362. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2363. predict error 0
  2364. dir: dir isU
  2365. |\-333: O: O665 (predict-yes)
  2366. I see 1 and I'm going to do: predict-yes
  2367. ENV: Agent did: predict-yes for direction U in state State-A
  2368. In State-A moving U
  2369. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2370. predict error 1
  2371. dir: dir isR
  2372. /|\334: O: O667 (predict-yes)
  2373. I see 0 and I'm going to do: predict-yes
  2374. ENV: Agent did: predict-yes for direction R in state State-A
  2375. In State-A moving R
  2376. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2377. predict error 0
  2378. dir: dir isL
  2379. -/|335: O: O669 (predict-yes)
  2380. I see 1 and I'm going to do: predict-yes
  2381. ENV: Agent did: predict-yes for direction L in state State-B
  2382. In State-B moving L
  2383. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2384. predict error 0
  2385. dir: dir isU
  2386. \-/336: O: O672 (predict-no)
  2387. I see 1 and I'm going to do: predict-no
  2388. ENV: Agent did: predict-no for direction U in state State-A
  2389. In State-A moving U
  2390. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2391. predict error 0
  2392. dir: dir isL
  2393. |\-337: O: O674 (predict-no)
  2394. I see 1 and I'm going to do: predict-no
  2395. ENV: Agent did: predict-no for direction L in state State-A
  2396. In State-A moving L
  2397. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2398. predict error 0
  2399. dir: dir isR
  2400. /|338: O: O675 (predict-yes)
  2401. I see 1 and I'm going to do: predict-yes
  2402. ENV: Agent did: predict-yes for direction R in state State-A
  2403. In State-A moving R
  2404. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2405. predict error 0
  2406. dir: dir isR
  2407. \-339: O: O678 (predict-no)
  2408. I see 1 and I'm going to do: predict-no
  2409. ENV: Agent did: predict-no for direction R in state State-B
  2410. In State-B moving R
  2411. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2412. predict error 0
  2413. dir: dir isL
  2414. /|340: O: O679 (predict-yes)
  2415. I see 1 and I'm going to do: predict-yes
  2416. ENV: Agent did: predict-yes for direction L in state State-B
  2417. In State-B moving L
  2418. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2419. predict error 0
  2420. dir: dir isR
  2421. \341: O: O681 (predict-yes)
  2422. I see 1 and I'm going to do: predict-yes
  2423. ENV: Agent did: predict-yes for direction R in state State-A
  2424. In State-A moving R
  2425. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2426. predict error 0
  2427. dir: dir isR
  2428. -342: O: O684 (predict-no)
  2429. I see 1 and I'm going to do: predict-no
  2430. ENV: Agent did: predict-no for direction R in state State-B
  2431. In State-B moving R
  2432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2433. predict error 0
  2434. dir: dir isL
  2435. /|\343: O: O685 (predict-yes)
  2436. I see 1 and I'm going to do: predict-yes
  2437. ENV: Agent did: predict-yes for direction L in state State-B
  2438. In State-B moving L
  2439. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2440. predict error 0
  2441. dir: dir isU
  2442. -/|344: O: O688 (predict-no)
  2443. I see 1 and I'm going to do: predict-no
  2444. ENV: Agent did: predict-no for direction U in state State-A
  2445. In State-A moving U
  2446. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2447. predict error 0
  2448. dir: dir isL
  2449. \-/345: O: O690 (predict-no)
  2450. I see 1 and I'm going to do: predict-no
  2451. ENV: Agent did: predict-no for direction L in state State-A
  2452. In State-A moving L
  2453. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2454. predict error 0
  2455. dir: dir isL
  2456. |\-346: O: O692 (predict-no)
  2457. I see 1 and I'm going to do: predict-no
  2458. ENV: Agent did: predict-no for direction L in state State-A
  2459. In State-A moving L
  2460. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2461. predict error 0
  2462. dir: dir isR
  2463. /|347: O: O693 (predict-yes)
  2464. I see 1 and I'm going to do: predict-yes
  2465. ENV: Agent did: predict-yes for direction R in state State-A
  2466. In State-A moving R
  2467. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2468. predict error 0
  2469. dir: dir isU
  2470. \-/348: O: O696 (predict-no)
  2471. I see 1 and I'm going to do: predict-no
  2472. ENV: Agent did: predict-no for direction U in state State-B
  2473. In State-B moving U
  2474. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2475. predict error 0
  2476. dir: dir isR
  2477. |\-349: O: O698 (predict-no)
  2478. I see 1 and I'm going to do: predict-no
  2479. ENV: Agent did: predict-no for direction R in state State-B
  2480. In State-B moving R
  2481. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2482. predict error 0
  2483. dir: dir isU
  2484. /|\350: O: O700 (predict-no)
  2485. I see 1 and I'm going to do: predict-no
  2486. ENV: Agent did: predict-no for direction U in state State-B
  2487. In State-B moving U
  2488. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2489. predict error 0
  2490. dir: dir isR
  2491. -351: O: O702 (predict-no)
  2492. I see 1 and I'm going to do: predict-no
  2493. ENV: Agent did: predict-no for direction R in state State-B
  2494. In State-B moving R
  2495. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2496. predict error 0
  2497. dir: dir isU
  2498. /352: O: O703 (predict-yes)
  2499. I see 1 and I'm going to do: predict-yes
  2500. ENV: Agent did: predict-yes for direction U in state State-B
  2501. In State-B moving U
  2502. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2503. predict error 1
  2504. dir: dir isR
  2505. |\-353: O: O706 (predict-no)
  2506. I see 0 and I'm going to do: predict-no
  2507. ENV: Agent did: predict-no for direction R in state State-B
  2508. In State-B moving R
  2509. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2510. predict error 0
  2511. dir: dir isL
  2512. /|\354: O: O707 (predict-yes)
  2513. I see 1 and I'm going to do: predict-yes
  2514. ENV: Agent did: predict-yes for direction L in state State-B
  2515. In State-B moving L
  2516. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2517. predict error 0
  2518. dir: dir isR
  2519. -/|355: O: O709 (predict-yes)
  2520. I see 1 and I'm going to do: predict-yes
  2521. ENV: Agent did: predict-yes for direction R in state State-A
  2522. In State-A moving R
  2523. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2524. predict error 0
  2525. dir: dir isL
  2526. \-356: O: O711 (predict-yes)
  2527. I see 1 and I'm going to do: predict-yes
  2528. ENV: Agent did: predict-yes for direction L in state State-B
  2529. In State-B moving L
  2530. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2531. predict error 0
  2532. dir: dir isL
  2533. /357: O: O714 (predict-no)
  2534. I see 1 and I'm going to do: predict-no
  2535. ENV: Agent did: predict-no for direction L in state State-A
  2536. In State-A moving L
  2537. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2538. predict error 0
  2539. dir: dir isU
  2540. |\-358: O: O716 (predict-no)
  2541. I see 1 and I'm going to do: predict-no
  2542. ENV: Agent did: predict-no for direction U in state State-A
  2543. In State-A moving U
  2544. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2545. predict error 0
  2546. dir: dir isL
  2547. /|359: O: O718 (predict-no)
  2548. I see 1 and I'm going to do: predict-no
  2549. ENV: Agent did: predict-no for direction L in state State-A
  2550. In State-A moving L
  2551. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2552. predict error 0
  2553. dir: dir isU
  2554. \360: O: O720 (predict-no)
  2555. I see 1 and I'm going to do: predict-no
  2556. ENV: Agent did: predict-no for direction U in state State-A
  2557. In State-A moving U
  2558. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2559. predict error 0
  2560. dir: dir isU
  2561. -/|361: O: O721 (predict-yes)
  2562. I see 1 and I'm going to do: predict-yes
  2563. ENV: Agent did: predict-yes for direction U in state State-A
  2564. In State-A moving U
  2565. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2566. predict error 1
  2567. dir: dir isR
  2568. \362: O: O723 (predict-yes)
  2569. I see 0 and I'm going to do: predict-yes
  2570. ENV: Agent did: predict-yes for direction R in state State-A
  2571. In State-A moving R
  2572. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2573. predict error 0
  2574. dir: dir isU
  2575. -/363: O: O726 (predict-no)
  2576. I see 1 and I'm going to do: predict-no
  2577. ENV: Agent did: predict-no for direction U in state State-B
  2578. In State-B moving U
  2579. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2580. predict error 0
  2581. dir: dir isU
  2582. |\364: O: O728 (predict-no)
  2583. I see 1 and I'm going to do: predict-no
  2584. ENV: Agent did: predict-no for direction U in state State-B
  2585. In State-B moving U
  2586. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2587. predict error 0
  2588. dir: dir isU
  2589. -/|365: O: O730 (predict-no)
  2590. I see 1 and I'm going to do: predict-no
  2591. ENV: Agent did: predict-no for direction U in state State-B
  2592. In State-B moving U
  2593. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2594. predict error 0
  2595. dir: dir isL
  2596. \366: O: O731 (predict-yes)
  2597. I see 1 and I'm going to do: predict-yes
  2598. ENV: Agent did: predict-yes for direction L in state State-B
  2599. In State-B moving L
  2600. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2601. predict error 0
  2602. dir: dir isL
  2603. -/|367: O: O734 (predict-no)
  2604. I see 1 and I'm going to do: predict-no
  2605. ENV: Agent did: predict-no for direction L in state State-A
  2606. In State-A moving L
  2607. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2608. predict error 0
  2609. dir: dir isR
  2610. \-/368: O: O735 (predict-yes)
  2611. I see 1 and I'm going to do: predict-yes
  2612. ENV: Agent did: predict-yes for direction R in state State-A
  2613. In State-A moving R
  2614. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2615. predict error 0
  2616. dir: dir isR
  2617. |369: O: O737 (predict-yes)
  2618. I see 1 and I'm going to do: predict-yes
  2619. ENV: Agent did: predict-yes for direction R in state State-B
  2620. In State-B moving R
  2621. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2622. predict error 1
  2623. dir: dir isL
  2624. \370: O: O739 (predict-yes)
  2625. I see 0 and I'm going to do: predict-yes
  2626. ENV: Agent did: predict-yes for direction L in state State-B
  2627. In State-B moving L
  2628. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2629. predict error 0
  2630. dir: dir isL
  2631. -/371: O: O742 (predict-no)
  2632. I see 1 and I'm going to do: predict-no
  2633. ENV: Agent did: predict-no for direction L in state State-A
  2634. In State-A moving L
  2635. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2636. predict error 0
  2637. dir: dir isR
  2638. |372: O: O743 (predict-yes)
  2639. I see 1 and I'm going to do: predict-yes
  2640. ENV: Agent did: predict-yes for direction R in state State-A
  2641. In State-A moving R
  2642. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2643. predict error 0
  2644. dir: dir isU
  2645. \373: O: O746 (predict-no)
  2646. I see 1 and I'm going to do: predict-no
  2647. ENV: Agent did: predict-no for direction U in state State-B
  2648. In State-B moving U
  2649. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2650. predict error 0
  2651. dir: dir isR
  2652. -/374: O: O748 (predict-no)
  2653. I see 1 and I'm going to do: predict-no
  2654. ENV: Agent did: predict-no for direction R in state State-B
  2655. In State-B moving R
  2656. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2657. predict error 0
  2658. dir: dir isR
  2659. |\375: O: O750 (predict-no)
  2660. I see 1 and I'm going to do: predict-no
  2661. ENV: Agent did: predict-no for direction R in state State-B
  2662. In State-B moving R
  2663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2664. predict error 0
  2665. dir: dir isR
  2666. -/376: O: O752 (predict-no)
  2667. I see 1 and I'm going to do: predict-no
  2668. ENV: Agent did: predict-no for direction R in state State-B
  2669. In State-B moving R
  2670. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2671. predict error 0
  2672. dir: dir isR
  2673. |\-377: O: O754 (predict-no)
  2674. I see 1 and I'm going to do: predict-no
  2675. ENV: Agent did: predict-no for direction R in state State-B
  2676. In State-B moving R
  2677. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2678. predict error 0
  2679. dir: dir isU
  2680. /|378: O: O756 (predict-no)
  2681. I see 1 and I'm going to do: predict-no
  2682. ENV: Agent did: predict-no for direction U in state State-B
  2683. In State-B moving U
  2684. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2685. predict error 0
  2686. dir: dir isL
  2687. \-/379: O: O757 (predict-yes)
  2688. I see 1 and I'm going to do: predict-yes
  2689. ENV: Agent did: predict-yes for direction L in state State-B
  2690. In State-B moving L
  2691. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2692. predict error 0
  2693. dir: dir isR
  2694. |\-380: O: O759 (predict-yes)
  2695. I see 1 and I'm going to do: predict-yes
  2696. ENV: Agent did: predict-yes for direction R in state State-A
  2697. In State-A moving R
  2698. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2699. predict error 0
  2700. dir: dir isR
  2701. /|\381: O: O762 (predict-no)
  2702. I see 1 and I'm going to do: predict-no
  2703. ENV: Agent did: predict-no for direction R in state State-B
  2704. In State-B moving R
  2705. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2706. predict error 0
  2707. dir: dir isR
  2708. -382: O: O764 (predict-no)
  2709. I see 1 and I'm going to do: predict-no
  2710. ENV: Agent did: predict-no for direction R in state State-B
  2711. In State-B moving R
  2712. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2713. predict error 0
  2714. dir: dir isU
  2715. /|\383: O: O766 (predict-no)
  2716. I see 1 and I'm going to do: predict-no
  2717. ENV: Agent did: predict-no for direction U in state State-B
  2718. In State-B moving U
  2719. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2720. predict error 0
  2721. dir: dir isL
  2722. -/|384: O: O767 (predict-yes)
  2723. I see 1 and I'm going to do: predict-yes
  2724. ENV: Agent did: predict-yes for direction L in state State-B
  2725. In State-B moving L
  2726. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2727. predict error 0
  2728. dir: dir isU
  2729. \-/385: O: O770 (predict-no)
  2730. I see 1 and I'm going to do: predict-no
  2731. ENV: Agent did: predict-no for direction U in state State-A
  2732. In State-A moving U
  2733. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2734. predict error 0
  2735. dir: dir isL
  2736. |\-386: O: O772 (predict-no)
  2737. I see 1 and I'm going to do: predict-no
  2738. ENV: Agent did: predict-no for direction L in state State-A
  2739. In State-A moving L
  2740. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2741. predict error 0
  2742. dir: dir isR
  2743. /|387: O: O773 (predict-yes)
  2744. I see 1 and I'm going to do: predict-yes
  2745. ENV: Agent did: predict-yes for direction R in state State-A
  2746. In State-A moving R
  2747. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2748. predict error 0
  2749. dir: dir isU
  2750. \-388: O: O776 (predict-no)
  2751. I see 1 and I'm going to do: predict-no
  2752. ENV: Agent did: predict-no for direction U in state State-B
  2753. In State-B moving U
  2754. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2755. predict error 0
  2756. dir: dir isR
  2757. /|389: O: O778 (predict-no)
  2758. I see 1 and I'm going to do: predict-no
  2759. ENV: Agent did: predict-no for direction R in state State-B
  2760. In State-B moving R
  2761. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2762. predict error 0
  2763. dir: dir isR
  2764. \-/390: O: O780 (predict-no)
  2765. I see 1 and I'm going to do: predict-no
  2766. ENV: Agent did: predict-no for direction R in state State-B
  2767. In State-B moving R
  2768. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2769. predict error 0
  2770. dir: dir isU
  2771. |\-391: O: O782 (predict-no)
  2772. I see 1 and I'm going to do: predict-no
  2773. ENV: Agent did: predict-no for direction U in state State-B
  2774. In State-B moving U
  2775. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2776. predict error 0
  2777. dir: dir isL
  2778. /392: O: O783 (predict-yes)
  2779. I see 1 and I'm going to do: predict-yes
  2780. ENV: Agent did: predict-yes for direction L in state State-B
  2781. In State-B moving L
  2782. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2783. predict error 0
  2784. dir: dir isR
  2785. |\393: O: O785 (predict-yes)
  2786. I see 1 and I'm going to do: predict-yes
  2787. ENV: Agent did: predict-yes for direction R in state State-A
  2788. In State-A moving R
  2789. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2790. predict error 0
  2791. dir: dir isR
  2792. -/|394: O: O788 (predict-no)
  2793. I see 1 and I'm going to do: predict-no
  2794. ENV: Agent did: predict-no for direction R in state State-B
  2795. In State-B moving R
  2796. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2797. predict error 0
  2798. dir: dir isR
  2799. \-/395: O: O790 (predict-no)
  2800. I see 1 and I'm going to do: predict-no
  2801. ENV: Agent did: predict-no for direction R in state State-B
  2802. In State-B moving R
  2803. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2804. predict error 0
  2805. dir: dir isU
  2806. |\396: O: O792 (predict-no)
  2807. I see 1 and I'm going to do: predict-no
  2808. ENV: Agent did: predict-no for direction U in state State-B
  2809. In State-B moving U
  2810. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2811. predict error 0
  2812. dir: dir isU
  2813. -/|397: O: O794 (predict-no)
  2814. I see 1 and I'm going to do: predict-no
  2815. ENV: Agent did: predict-no for direction U in state State-B
  2816. In State-B moving U
  2817. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2818. predict error 0
  2819. dir: dir isR
  2820. \-/398: O: O796 (predict-no)
  2821. I see 1 and I'm going to do: predict-no
  2822. ENV: Agent did: predict-no for direction R in state State-B
  2823. In State-B moving R
  2824. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2825. predict error 0
  2826. dir: dir isL
  2827. |\-399: O: O797 (predict-yes)
  2828. I see 1 and I'm going to do: predict-yes
  2829. ENV: Agent did: predict-yes for direction L in state State-B
  2830. In State-B moving L
  2831. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2832. predict error 0
  2833. dir: dir isL
  2834. /|\400: O: O800 (predict-no)
  2835. I see 1 and I'm going to do: predict-no
  2836. ENV: Agent did: predict-no for direction L in state State-A
  2837. In State-A moving L
  2838. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2839. predict error 0
  2840. dir: dir isR
  2841. -/401: O: O802 (predict-no)
  2842. I see 1 and I'm going to do: predict-no
  2843. ENV: Agent did: predict-no for direction R in state State-A
  2844. In State-A moving R
  2845. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2846. predict error 1
  2847. dir: dir isL
  2848. |402: O: O803 (predict-yes)
  2849. I see 0 and I'm going to do: predict-yes
  2850. ENV: Agent did: predict-yes for direction L in state State-B
  2851. In State-B moving L
  2852. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2853. predict error 0
  2854. dir: dir isL
  2855. \-/403: O: O806 (predict-no)
  2856. I see 1 and I'm going to do: predict-no
  2857. ENV: Agent did: predict-no for direction L in state State-A
  2858. In State-A moving L
  2859. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2860. predict error 0
  2861. dir: dir isU
  2862. |\-404: O: O808 (predict-no)
  2863. I see 1 and I'm going to do: predict-no
  2864. ENV: Agent did: predict-no for direction U in state State-A
  2865. In State-A moving U
  2866. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2867. predict error 0
  2868. dir: dir isL
  2869. /405: O: O810 (predict-no)
  2870. I see 1 and I'm going to do: predict-no
  2871. ENV: Agent did: predict-no for direction L in state State-A
  2872. In State-A moving L
  2873. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2874. predict error 0
  2875. dir: dir isU
  2876. |\406: O: O812 (predict-no)
  2877. I see 1 and I'm going to do: predict-no
  2878. ENV: Agent did: predict-no for direction U in state State-A
  2879. In State-A moving U
  2880. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2881. predict error 0
  2882. dir: dir isR
  2883. -407: O: O813 (predict-yes)
  2884. I see 1 and I'm going to do: predict-yes
  2885. ENV: Agent did: predict-yes for direction R in state State-A
  2886. In State-A moving R
  2887. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2888. predict error 0
  2889. dir: dir isR
  2890. /|408: O: O816 (predict-no)
  2891. I see 1 and I'm going to do: predict-no
  2892. ENV: Agent did: predict-no for direction R in state State-B
  2893. In State-B moving R
  2894. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2895. predict error 0
  2896. dir: dir isL
  2897. \-409: O: O817 (predict-yes)
  2898. I see 1 and I'm going to do: predict-yes
  2899. ENV: Agent did: predict-yes for direction L in state State-B
  2900. In State-B moving L
  2901. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2902. predict error 0
  2903. dir: dir isL
  2904. /|\410: O: O820 (predict-no)
  2905. I see 1 and I'm going to do: predict-no
  2906. ENV: Agent did: predict-no for direction L in state State-A
  2907. In State-A moving L
  2908. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2909. predict error 0
  2910. dir: dir isR
  2911. -/|411: O: O821 (predict-yes)
  2912. I see 1 and I'm going to do: predict-yes
  2913. ENV: Agent did: predict-yes for direction R in state State-A
  2914. In State-A moving R
  2915. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2916. predict error 0
  2917. dir: dir isL
  2918. \412: O: O823 (predict-yes)
  2919. I see 1 and I'm going to do: predict-yes
  2920. ENV: Agent did: predict-yes for direction L in state State-B
  2921. In State-B moving L
  2922. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2923. predict error 0
  2924. dir: dir isR
  2925. -/413: O: O825 (predict-yes)
  2926. I see 1 and I'm going to do: predict-yes
  2927. ENV: Agent did: predict-yes for direction R in state State-A
  2928. In State-A moving R
  2929. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2930. predict error 0
  2931. dir: dir isL
  2932. |\-414: O: O827 (predict-yes)
  2933. I see 1 and I'm going to do: predict-yes
  2934. ENV: Agent did: predict-yes for direction L in state State-B
  2935. In State-B moving L
  2936. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2937. predict error 0
  2938. dir: dir isU
  2939. /|415: O: O829 (predict-yes)
  2940. I see 1 and I'm going to do: predict-yes
  2941. ENV: Agent did: predict-yes for direction U in state State-A
  2942. In State-A moving U
  2943. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2944. predict error 1
  2945. dir: dir isU
  2946. \-/416: O: O832 (predict-no)
  2947. I see 0 and I'm going to do: predict-no
  2948. ENV: Agent did: predict-no for direction U in state State-A
  2949. In State-A moving U
  2950. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2951. predict error 0
  2952. dir: dir isR
  2953. |417: O: O833 (predict-yes)
  2954. I see 1 and I'm going to do: predict-yes
  2955. ENV: Agent did: predict-yes for direction R in state State-A
  2956. In State-A moving R
  2957. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2958. predict error 0
  2959. dir: dir isR
  2960. \-/418: O: O836 (predict-no)
  2961. I see 1 and I'm going to do: predict-no
  2962. ENV: Agent did: predict-no for direction R in state State-B
  2963. In State-B moving R
  2964. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2965. predict error 0
  2966. dir: dir isU
  2967. |\-419: O: O838 (predict-no)
  2968. I see 1 and I'm going to do: predict-no
  2969. ENV: Agent did: predict-no for direction U in state State-B
  2970. In State-B moving U
  2971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2972. predict error 0
  2973. dir: dir isL
  2974. /420: O: O839 (predict-yes)
  2975. I see 1 and I'm going to do: predict-yes
  2976. ENV: Agent did: predict-yes for direction L in state State-B
  2977. In State-B moving L
  2978. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2979. predict error 0
  2980. dir: dir isL
  2981. |\-421: O: O842 (predict-no)
  2982. I see 1 and I'm going to do: predict-no
  2983. ENV: Agent did: predict-no for direction L in state State-A
  2984. In State-A moving L
  2985. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2986. predict error 0
  2987. dir: dir isL
  2988. /422: O: O844 (predict-no)
  2989. I see 1 and I'm going to do: predict-no
  2990. ENV: Agent did: predict-no for direction L in state State-A
  2991. In State-A moving L
  2992. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2993. predict error 0
  2994. dir: dir isL
  2995. |\423: O: O846 (predict-no)
  2996. I see 1 and I'm going to do: predict-no
  2997. ENV: Agent did: predict-no for direction L in state State-A
  2998. In State-A moving L
  2999. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3000. predict error 0
  3001. dir: dir isU
  3002. -/424: O: O848 (predict-no)
  3003. I see 1 and I'm going to do: predict-no
  3004. ENV: Agent did: predict-no for direction U in state State-A
  3005. In State-A moving U
  3006. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3007. predict error 0
  3008. dir: dir isR
  3009. |\425: O: O849 (predict-yes)
  3010. I see 1 and I'm going to do: predict-yes
  3011. ENV: Agent did: predict-yes for direction R in state State-A
  3012. In State-A moving R
  3013. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3014. predict error 0
  3015. dir: dir isR
  3016. -/426: O: O852 (predict-no)
  3017. I see 1 and I'm going to do: predict-no
  3018. ENV: Agent did: predict-no for direction R in state State-B
  3019. In State-B moving R
  3020. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3021. predict error 0
  3022. dir: dir isU
  3023. |\427: O: O854 (predict-no)
  3024. I see 1 and I'm going to do: predict-no
  3025. ENV: Agent did: predict-no for direction U in state State-B
  3026. In State-B moving U
  3027. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3028. predict error 0
  3029. dir: dir isL
  3030. -/|428: O: O855 (predict-yes)
  3031. I see 1 and I'm going to do: predict-yes
  3032. ENV: Agent did: predict-yes for direction L in state State-B
  3033. In State-B moving L
  3034. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3035. predict error 0
  3036. dir: dir isU
  3037. \-429: O: O858 (predict-no)
  3038. I see 1 and I'm going to do: predict-no
  3039. ENV: Agent did: predict-no for direction U in state State-A
  3040. In State-A moving U
  3041. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3042. predict error 0
  3043. dir: dir isR
  3044. /|\430: O: O859 (predict-yes)
  3045. I see 1 and I'm going to do: predict-yes
  3046. ENV: Agent did: predict-yes for direction R in state State-A
  3047. In State-A moving R
  3048. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3049. predict error 0
  3050. dir: dir isR
  3051. -431: O: O862 (predict-no)
  3052. I see 1 and I'm going to do: predict-no
  3053. ENV: Agent did: predict-no for direction R in state State-B
  3054. In State-B moving R
  3055. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3056. predict error 0
  3057. dir: dir isU
  3058. /432: O: O864 (predict-no)
  3059. I see 1 and I'm going to do: predict-no
  3060. ENV: Agent did: predict-no for direction U in state State-B
  3061. In State-B moving U
  3062. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3063. predict error 0
  3064. dir: dir isR
  3065. |\-433: O: O866 (predict-no)
  3066. I see 1 and I'm going to do: predict-no
  3067. ENV: Agent did: predict-no for direction R in state State-B
  3068. In State-B moving R
  3069. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3070. predict error 0
  3071. dir: dir isU
  3072. /|434: O: O867 (predict-yes)
  3073. I see 1 and I'm going to do: predict-yes
  3074. ENV: Agent did: predict-yes for direction U in state State-B
  3075. In State-B moving U
  3076. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3077. predict error 1
  3078. dir: dir isU
  3079. \-/435: O: O870 (predict-no)
  3080. I see 0 and I'm going to do: predict-no
  3081. ENV: Agent did: predict-no for direction U in state State-B
  3082. In State-B moving U
  3083. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3084. predict error 0
  3085. dir: dir isR
  3086. |436: O: O872 (predict-no)
  3087. I see 1 and I'm going to do: predict-no
  3088. ENV: Agent did: predict-no for direction R in state State-B
  3089. In State-B moving R
  3090. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3091. predict error 0
  3092. dir: dir isU
  3093. \-/437: O: O873 (predict-yes)
  3094. I see 1 and I'm going to do: predict-yes
  3095. ENV: Agent did: predict-yes for direction U in state State-B
  3096. In State-B moving U
  3097. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3098. predict error 1
  3099. dir: dir isU
  3100. |\438: O: O876 (predict-no)
  3101. I see 0 and I'm going to do: predict-no
  3102. ENV: Agent did: predict-no for direction U in state State-B
  3103. In State-B moving U
  3104. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3105. predict error 0
  3106. dir: dir isU
  3107. -/|439: O: O878 (predict-no)
  3108. I see 1 and I'm going to do: predict-no
  3109. ENV: Agent did: predict-no for direction U in state State-B
  3110. In State-B moving U
  3111. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3112. predict error 0
  3113. dir: dir isU
  3114. \-440: O: O880 (predict-no)
  3115. I see 1 and I'm going to do: predict-no
  3116. ENV: Agent did: predict-no for direction U in state State-B
  3117. In State-B moving U
  3118. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3119. predict error 0
  3120. dir: dir isU
  3121. /|441: O: O882 (predict-no)
  3122. I see 1 and I'm going to do: predict-no
  3123. ENV: Agent did: predict-no for direction U in state State-B
  3124. In State-B moving U
  3125. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3126. predict error 0
  3127. dir: dir isU
  3128. \442: O: O884 (predict-no)
  3129. I see 1 and I'm going to do: predict-no
  3130. ENV: Agent did: predict-no for direction U in state State-B
  3131. In State-B moving U
  3132. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3133. predict error 0
  3134. dir: dir isU
  3135. -/443: O: O886 (predict-no)
  3136. I see 1 and I'm going to do: predict-no
  3137. ENV: Agent did: predict-no for direction U in state State-B
  3138. In State-B moving U
  3139. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3140. predict error 0
  3141. dir: dir isU
  3142. |\-444: O: O888 (predict-no)
  3143. I see 1 and I'm going to do: predict-no
  3144. ENV: Agent did: predict-no for direction U in state State-B
  3145. In State-B moving U
  3146. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3147. predict error 0
  3148. dir: dir isL
  3149. /|\445: O: O889 (predict-yes)
  3150. I see 1 and I'm going to do: predict-yes
  3151. ENV: Agent did: predict-yes for direction L in state State-B
  3152. In State-B moving L
  3153. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3154. predict error 0
  3155. dir: dir isR
  3156. -446: O: O891 (predict-yes)
  3157. I see 1 and I'm going to do: predict-yes
  3158. ENV: Agent did: predict-yes for direction R in state State-A
  3159. In State-A moving R
  3160. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3161. predict error 0
  3162. dir: dir isU
  3163. /|\-447: O: O894 (predict-no)
  3164. I see 1 and I'm going to do: predict-no
  3165. ENV: Agent did: predict-no for direction U in state State-B
  3166. In State-B moving U
  3167. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3168. predict error 0
  3169. dir: dir isR
  3170. /|\448: O: O896 (predict-no)
  3171. I see 1 and I'm going to do: predict-no
  3172. ENV: Agent did: predict-no for direction R in state State-B
  3173. In State-B moving R
  3174. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3175. predict error 0
  3176. dir: dir isR
  3177. -/|449: O: O898 (predict-no)
  3178. I see 1 and I'm going to do: predict-no
  3179. ENV: Agent did: predict-no for direction R in state State-B
  3180. In State-B moving R
  3181. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3182. predict error 0
  3183. dir: dir isL
  3184. \-/450: O: O899 (predict-yes)
  3185. I see 1 and I'm going to do: predict-yes
  3186. ENV: Agent did: predict-yes for direction L in state State-B
  3187. In State-B moving L
  3188. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3189. predict error 0
  3190. dir: dir isU
  3191. |\-451: O: O902 (predict-no)
  3192. I see 1 and I'm going to do: predict-no
  3193. ENV: Agent did: predict-no for direction U in state State-A
  3194. In State-A moving U
  3195. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3196. predict error 0
  3197. dir: dir isR
  3198. /452: O: O903 (predict-yes)
  3199. I see 1 and I'm going to do: predict-yes
  3200. ENV: Agent did: predict-yes for direction R in state State-A
  3201. In State-A moving R
  3202. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3203. predict error 0
  3204. dir: dir isR
  3205. |\-453: O: O906 (predict-no)
  3206. I see 1 and I'm going to do: predict-no
  3207. ENV: Agent did: predict-no for direction R in state State-B
  3208. In State-B moving R
  3209. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3210. predict error 0
  3211. dir: dir isU
  3212. /|454: O: O908 (predict-no)
  3213. I see 1 and I'm going to do: predict-no
  3214. ENV: Agent did: predict-no for direction U in state State-B
  3215. In State-B moving U
  3216. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3217. predict error 0
  3218. dir: dir isL
  3219. \455: O: O909 (predict-yes)
  3220. I see 1 and I'm going to do: predict-yes
  3221. ENV: Agent did: predict-yes for direction L in state State-B
  3222. In State-B moving L
  3223. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3224. predict error 0
  3225. dir: dir isU
  3226. -/456: O: O912 (predict-no)
  3227. I see 1 and I'm going to do: predict-no
  3228. ENV: Agent did: predict-no for direction U in state State-A
  3229. In State-A moving U
  3230. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3231. predict error 0
  3232. dir: dir isL
  3233. |457: O: O913 (predict-yes)
  3234. I see 1 and I'm going to do: predict-yes
  3235. ENV: Agent did: predict-yes for direction L in state State-A
  3236. In State-A moving L
  3237. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3238. predict error 1
  3239. dir: dir isL
  3240. \458: O: O916 (predict-no)
  3241. I see 0 and I'm going to do: predict-no
  3242. ENV: Agent did: predict-no for direction L in state State-A
  3243. In State-A moving L
  3244. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3245. predict error 0
  3246. dir: dir isR
  3247. -/|459: O: O917 (predict-yes)
  3248. I see 1 and I'm going to do: predict-yes
  3249. ENV: Agent did: predict-yes for direction R in state State-A
  3250. In State-A moving R
  3251. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3252. predict error 0
  3253. dir: dir isR
  3254. \-/460: O: O920 (predict-no)
  3255. I see 1 and I'm going to do: predict-no
  3256. ENV: Agent did: predict-no for direction R in state State-B
  3257. In State-B moving R
  3258. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3259. predict error 0
  3260. dir: dir isU
  3261. |\-461: O: O922 (predict-no)
  3262. I see 1 and I'm going to do: predict-no
  3263. ENV: Agent did: predict-no for direction U in state State-B
  3264. In State-B moving U
  3265. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3266. predict error 0
  3267. dir: dir isU
  3268. /462: O: O924 (predict-no)
  3269. I see 1 and I'm going to do: predict-no
  3270. ENV: Agent did: predict-no for direction U in state State-B
  3271. In State-B moving U
  3272. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3273. predict error 0
  3274. dir: dir isU
  3275. |463: O: O926 (predict-no)
  3276. I see 1 and I'm going to do: predict-no
  3277. ENV: Agent did: predict-no for direction U in state State-B
  3278. In State-B moving U
  3279. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3280. predict error 0
  3281. dir: dir isR
  3282. \-464: O: O928 (predict-no)
  3283. I see 1 and I'm going to do: predict-no
  3284. ENV: Agent did: predict-no for direction R in state State-B
  3285. In State-B moving R
  3286. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3287. predict error 0
  3288. dir: dir isR
  3289. /|\465: O: O930 (predict-no)
  3290. I see 1 and I'm going to do: predict-no
  3291. ENV: Agent did: predict-no for direction R in state State-B
  3292. In State-B moving R
  3293. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3294. predict error 0
  3295. dir: dir isL
  3296. -/466: O: O931 (predict-yes)
  3297. I see 1 and I'm going to do: predict-yes
  3298. ENV: Agent did: predict-yes for direction L in state State-B
  3299. In State-B moving L
  3300. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3301. predict error 0
  3302. dir: dir isU
  3303. |\-467: O: O934 (predict-no)
  3304. I see 1 and I'm going to do: predict-no
  3305. ENV: Agent did: predict-no for direction U in state State-A
  3306. In State-A moving U
  3307. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3308. predict error 0
  3309. dir: dir isR
  3310. /468: O: O935 (predict-yes)
  3311. I see 1 and I'm going to do: predict-yes
  3312. ENV: Agent did: predict-yes for direction R in state State-A
  3313. In State-A moving R
  3314. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3315. predict error 0
  3316. dir: dir isL
  3317. |\-469: O: O937 (predict-yes)
  3318. I see 1 and I'm going to do: predict-yes
  3319. ENV: Agent did: predict-yes for direction L in state State-B
  3320. In State-B moving L
  3321. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3322. predict error 0
  3323. dir: dir isR
  3324. /|\470: O: O939 (predict-yes)
  3325. I see 1 and I'm going to do: predict-yes
  3326. ENV: Agent did: predict-yes for direction R in state State-A
  3327. In State-A moving R
  3328. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3329. predict error 0
  3330. dir: dir isL
  3331. -/|471: O: O941 (predict-yes)
  3332. I see 1 and I'm going to do: predict-yes
  3333. ENV: Agent did: predict-yes for direction L in state State-B
  3334. In State-B moving L
  3335. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3336. predict error 0
  3337. dir: dir isL
  3338. \472: O: O943 (predict-yes)
  3339. I see 1 and I'm going to do: predict-yes
  3340. ENV: Agent did: predict-yes for direction L in state State-A
  3341. In State-A moving L
  3342. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3343. predict error 1
  3344. dir: dir isU
  3345. -/473: O: O946 (predict-no)
  3346. I see 0 and I'm going to do: predict-no
  3347. ENV: Agent did: predict-no for direction U in state State-A
  3348. In State-A moving U
  3349. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3350. predict error 0
  3351. dir: dir isL
  3352. |\474: O: O948 (predict-no)
  3353. I see 1 and I'm going to do: predict-no
  3354. ENV: Agent did: predict-no for direction L in state State-A
  3355. In State-A moving L
  3356. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3357. predict error 0
  3358. dir: dir isL
  3359. -/|475: O: O950 (predict-no)
  3360. I see 1 and I'm going to do: predict-no
  3361. ENV: Agent did: predict-no for direction L in state State-A
  3362. In State-A moving L
  3363. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3364. predict error 0
  3365. dir: dir isL
  3366. \-/476: O: O952 (predict-no)
  3367. I see 1 and I'm going to do: predict-no
  3368. ENV: Agent did: predict-no for direction L in state State-A
  3369. In State-A moving L
  3370. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3371. predict error 0
  3372. dir: dir isL
  3373. |\-477: O: O953 (predict-yes)
  3374. I see 1 and I'm going to do: predict-yes
  3375. ENV: Agent did: predict-yes for direction L in state State-A
  3376. In State-A moving L
  3377. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3378. predict error 1
  3379. dir: dir isR
  3380. /|\478: O: O955 (predict-yes)
  3381. I see 0 and I'm going to do: predict-yes
  3382. ENV: Agent did: predict-yes for direction R in state State-A
  3383. In State-A moving R
  3384. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3385. predict error 0
  3386. dir: dir isL
  3387. -/|479: O: O957 (predict-yes)
  3388. I see 1 and I'm going to do: predict-yes
  3389. ENV: Agent did: predict-yes for direction L in state State-B
  3390. In State-B moving L
  3391. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3392. predict error 0
  3393. dir: dir isR
  3394. \-/|480: O: O959 (predict-yes)
  3395. I see 1 and I'm going to do: predict-yes
  3396. ENV: Agent did: predict-yes for direction R in state State-A
  3397. In State-A moving R
  3398. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3399. predict error 0
  3400. dir: dir isL
  3401. \-481: O: O962 (predict-no)
  3402. I see 1 and I'm going to do: predict-no
  3403. ENV: Agent did: predict-no for direction L in state State-B
  3404. In State-B moving L
  3405. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3406. predict error 1
  3407. dir: dir isL
  3408. /482: O: O964 (predict-no)
  3409. I see 0 and I'm going to do: predict-no
  3410. ENV: Agent did: predict-no for direction L in state State-A
  3411. In State-A moving L
  3412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3413. predict error 0
  3414. dir: dir isR
  3415. |\483: O: O965 (predict-yes)
  3416. I see 1 and I'm going to do: predict-yes
  3417. ENV: Agent did: predict-yes for direction R in state State-A
  3418. In State-A moving R
  3419. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3420. predict error 0
  3421. dir: dir isR
  3422. -/|484: O: O968 (predict-no)
  3423. I see 1 and I'm going to do: predict-no
  3424. ENV: Agent did: predict-no for direction R in state State-B
  3425. In State-B moving R
  3426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3427. predict error 0
  3428. dir: dir isR
  3429. \-485: O: O970 (predict-no)
  3430. I see 1 and I'm going to do: predict-no
  3431. ENV: Agent did: predict-no for direction R in state State-B
  3432. In State-B moving R
  3433. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3434. predict error 0
  3435. dir: dir isU
  3436. /|\486: O: O972 (predict-no)
  3437. I see 1 and I'm going to do: predict-no
  3438. ENV: Agent did: predict-no for direction U in state State-B
  3439. In State-B moving U
  3440. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3441. predict error 0
  3442. dir: dir isL
  3443. -/|487: O: O973 (predict-yes)
  3444. I see 1 and I'm going to do: predict-yes
  3445. ENV: Agent did: predict-yes for direction L in state State-B
  3446. In State-B moving L
  3447. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3448. predict error 0
  3449. dir: dir isL
  3450. \-488: O: O976 (predict-no)
  3451. I see 1 and I'm going to do: predict-no
  3452. ENV: Agent did: predict-no for direction L in state State-A
  3453. In State-A moving L
  3454. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3455. predict error 0
  3456. dir: dir isU
  3457. /|\489: O: O978 (predict-no)
  3458. I see 1 and I'm going to do: predict-no
  3459. ENV: Agent did: predict-no for direction U in state State-A
  3460. In State-A moving U
  3461. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3462. predict error 0
  3463. dir: dir isR
  3464. -/490: O: O979 (predict-yes)
  3465. I see 1 and I'm going to do: predict-yes
  3466. ENV: Agent did: predict-yes for direction R in state State-A
  3467. In State-A moving R
  3468. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3469. predict error 0
  3470. dir: dir isU
  3471. |\491: O: O982 (predict-no)
  3472. I see 1 and I'm going to do: predict-no
  3473. ENV: Agent did: predict-no for direction U in state State-B
  3474. In State-B moving U
  3475. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3476. predict error 0
  3477. dir: dir isL
  3478. -492: O: O983 (predict-yes)
  3479. I see 1 and I'm going to do: predict-yes
  3480. ENV: Agent did: predict-yes for direction L in state State-B
  3481. In State-B moving L
  3482. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3483. predict error 0
  3484. dir: dir isR
  3485. /|\493: O: O985 (predict-yes)
  3486. I see 1 and I'm going to do: predict-yes
  3487. ENV: Agent did: predict-yes for direction R in state State-A
  3488. In State-A moving R
  3489. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3490. predict error 0
  3491. dir: dir isU
  3492. -/494: O: O988 (predict-no)
  3493. I see 1 and I'm going to do: predict-no
  3494. ENV: Agent did: predict-no for direction U in state State-B
  3495. In State-B moving U
  3496. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3497. predict error 0
  3498. dir: dir isR
  3499. |\-495: O: O990 (predict-no)
  3500. I see 1 and I'm going to do: predict-no
  3501. ENV: Agent did: predict-no for direction R in state State-B
  3502. In State-B moving R
  3503. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3504. predict error 0
  3505. dir: dir isU
  3506. /|\496: O: O992 (predict-no)
  3507. I see 1 and I'm going to do: predict-no
  3508. ENV: Agent did: predict-no for direction U in state State-B
  3509. In State-B moving U
  3510. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3511. predict error 0
  3512. dir: dir isR
  3513. -/|497: O: O994 (predict-no)
  3514. I see 1 and I'm going to do: predict-no
  3515. ENV: Agent did: predict-no for direction R in state State-B
  3516. In State-B moving R
  3517. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3518. predict error 0
  3519. dir: dir isR
  3520. \-/498: O: O996 (predict-no)
  3521. I see 1 and I'm going to do: predict-no
  3522. ENV: Agent did: predict-no for direction R in state State-B
  3523. In State-B moving R
  3524. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3525. predict error 0
  3526. dir: dir isU
  3527. |\499: O: O998 (predict-no)
  3528. I see 1 and I'm going to do: predict-no
  3529. ENV: Agent did: predict-no for direction U in state State-B
  3530. In State-B moving U
  3531. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3532. predict error 0
  3533. dir: dir isR
  3534. -/500: O: O1000 (predict-no)
  3535. I see 1 and I'm going to do: predict-no
  3536. ENV: Agent did: predict-no for direction R in state State-B
  3537. In State-B moving R
  3538. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3539. predict error 0
  3540. dir: dir isR
  3541. |\-501: O: O1002 (predict-no)
  3542. I see 1 and I'm going to do: predict-no
  3543. ENV: Agent did: predict-no for direction R in state State-B
  3544. In State-B moving R
  3545. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3546. predict error 0
  3547. dir: dir isR
  3548. /502: O: O1004 (predict-no)
  3549. I see 1 and I'm going to do: predict-no
  3550. ENV: Agent did: predict-no for direction R in state State-B
  3551. In State-B moving R
  3552. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3553. predict error 0
  3554. dir: dir isL
  3555. |\-503: O: O1005 (predict-yes)
  3556. I see 1 and I'm going to do: predict-yes
  3557. ENV: Agent did: predict-yes for direction L in state State-B
  3558. In State-B moving L
  3559. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3560. predict error 0
  3561. dir: dir isU
  3562. /504: O: O1008 (predict-no)
  3563. I see 1 and I'm going to do: predict-no
  3564. ENV: Agent did: predict-no for direction U in state State-A
  3565. In State-A moving U
  3566. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3567. predict error 0
  3568. dir: dir isR
  3569. |\-505: O: O1009 (predict-yes)
  3570. I see 1 and I'm going to do: predict-yes
  3571. ENV: Agent did: predict-yes for direction R in state State-A
  3572. In State-A moving R
  3573. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3574. predict error 0
  3575. dir: dir isR
  3576. /|506: O: O1012 (predict-no)
  3577. I see 1 and I'm going to do: predict-no
  3578. ENV: Agent did: predict-no for direction R in state State-B
  3579. In State-B moving R
  3580. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3581. predict error 0
  3582. dir: dir isR
  3583. \-/507: O: O1014 (predict-no)
  3584. I see 1 and I'm going to do: predict-no
  3585. ENV: Agent did: predict-no for direction R in state State-B
  3586. In State-B moving R
  3587. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3588. predict error 0
  3589. dir: dir isU
  3590. |\-/508: O: O1016 (predict-no)
  3591. I see 1 and I'm going to do: predict-no
  3592. ENV: Agent did: predict-no for direction U in state State-B
  3593. In State-B moving U
  3594. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3595. predict error 0
  3596. dir: dir isU
  3597. |\-509: O: O1018 (predict-no)
  3598. I see 1 and I'm going to do: predict-no
  3599. ENV: Agent did: predict-no for direction U in state State-B
  3600. In State-B moving U
  3601. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3602. predict error 0
  3603. dir: dir isU
  3604. /|\510: O: O1020 (predict-no)
  3605. I see 1 and I'm going to do: predict-no
  3606. ENV: Agent did: predict-no for direction U in state State-B
  3607. In State-B moving U
  3608. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3609. predict error 0
  3610. dir: dir isR
  3611. -/|511: O: O1022 (predict-no)
  3612. I see 1 and I'm going to do: predict-no
  3613. ENV: Agent did: predict-no for direction R in state State-B
  3614. In State-B moving R
  3615. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3616. predict error 0
  3617. dir: dir isR
  3618. \512: O: O1024 (predict-no)
  3619. I see 1 and I'm going to do: predict-no
  3620. ENV: Agent did: predict-no for direction R in state State-B
  3621. In State-B moving R
  3622. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3623. predict error 0
  3624. dir: dir isR
  3625. -/|\513: O: O1026 (predict-no)
  3626. I see 1 and I'm going to do: predict-no
  3627. ENV: Agent did: predict-no for direction R in state State-B
  3628. In State-B moving R
  3629. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3630. predict error 0
  3631. dir: dir isL
  3632. -/514: O: O1027 (predict-yes)
  3633. I see 1 and I'm going to do: predict-yes
  3634. ENV: Agent did: predict-yes for direction L in state State-B
  3635. In State-B moving L
  3636. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3637. predict error 0
  3638. dir: dir isL
  3639. |\-515: O: O1030 (predict-no)
  3640. I see 1 and I'm going to do: predict-no
  3641. ENV: Agent did: predict-no for direction L in state State-A
  3642. In State-A moving L
  3643. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3644. predict error 0
  3645. dir: dir isU
  3646. /|\516: O: O1032 (predict-no)
  3647. I see 1 and I'm going to do: predict-no
  3648. ENV: Agent did: predict-no for direction U in state State-A
  3649. In State-A moving U
  3650. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3651. predict error 0
  3652. dir: dir isL
  3653. -/|517: O: O1034 (predict-no)
  3654. I see 1 and I'm going to do: predict-no
  3655. ENV: Agent did: predict-no for direction L in state State-A
  3656. In State-A moving L
  3657. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3658. predict error 0
  3659. dir: dir isU
  3660. \-/518: O: O1036 (predict-no)
  3661. I see 1 and I'm going to do: predict-no
  3662. ENV: Agent did: predict-no for direction U in state State-A
  3663. In State-A moving U
  3664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3665. predict error 0
  3666. dir: dir isU
  3667. |\-519: O: O1038 (predict-no)
  3668. I see 1 and I'm going to do: predict-no
  3669. ENV: Agent did: predict-no for direction U in state State-A
  3670. In State-A moving U
  3671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3672. predict error 0
  3673. dir: dir isU
  3674. /520: O: O1040 (predict-no)
  3675. I see 1 and I'm going to do: predict-no
  3676. ENV: Agent did: predict-no for direction U in state State-A
  3677. In State-A moving U
  3678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3679. predict error 0
  3680. dir: dir isL
  3681. |\521: O: O1042 (predict-no)
  3682. I see 1 and I'm going to do: predict-no
  3683. ENV: Agent did: predict-no for direction L in state State-A
  3684. In State-A moving L
  3685. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3686. predict error 0
  3687. dir: dir isL
  3688. -522: O: O1044 (predict-no)
  3689. I see 1 and I'm going to do: predict-no
  3690. ENV: Agent did: predict-no for direction L in state State-A
  3691. In State-A moving L
  3692. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3693. predict error 0
  3694. dir: dir isU
  3695. /|523: O: O1046 (predict-no)
  3696. I see 1 and I'm going to do: predict-no
  3697. ENV: Agent did: predict-no for direction U in state State-A
  3698. In State-A moving U
  3699. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3700. predict error 0
  3701. dir: dir isL
  3702. \-524: O: O1048 (predict-no)
  3703. I see 1 and I'm going to do: predict-no
  3704. ENV: Agent did: predict-no for direction L in state State-A
  3705. In State-A moving L
  3706. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3707. predict error 0
  3708. dir: dir isL
  3709. /|525: O: O1050 (predict-no)
  3710. I see 1 and I'm going to do: predict-no
  3711. ENV: Agent did: predict-no for direction L in state State-A
  3712. In State-A moving L
  3713. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3714. predict error 0
  3715. dir: dir isR
  3716. \-526: O: O1052 (predict-no)
  3717. I see 1 and I'm going to do: predict-no
  3718. ENV: Agent did: predict-no for direction R in state State-A
  3719. In State-A moving R
  3720. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3721. predict error 1
  3722. dir: dir isL
  3723. /|\527: O: O1053 (predict-yes)
  3724. I see 0 and I'm going to do: predict-yes
  3725. ENV: Agent did: predict-yes for direction L in state State-B
  3726. In State-B moving L
  3727. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3728. predict error 0
  3729. dir: dir isL
  3730. -/528: O: O1056 (predict-no)
  3731. I see 1 and I'm going to do: predict-no
  3732. ENV: Agent did: predict-no for direction L in state State-A
  3733. In State-A moving L
  3734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3735. predict error 0
  3736. dir: dir isU
  3737. |\529: O: O1058 (predict-no)
  3738. I see 1 and I'm going to do: predict-no
  3739. ENV: Agent did: predict-no for direction U in state State-A
  3740. In State-A moving U
  3741. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3742. predict error 0
  3743. dir: dir isL
  3744. -/|530: O: O1060 (predict-no)
  3745. I see 1 and I'm going to do: predict-no
  3746. ENV: Agent did: predict-no for direction L in state State-A
  3747. In State-A moving L
  3748. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3749. predict error 0
  3750. dir: dir isU
  3751. \-/531: O: O1062 (predict-no)
  3752. I see 1 and I'm going to do: predict-no
  3753. ENV: Agent did: predict-no for direction U in state State-A
  3754. In State-A moving U
  3755. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3756. predict error 0
  3757. dir: dir isR
  3758. |532: O: O1063 (predict-yes)
  3759. I see 1 and I'm going to do: predict-yes
  3760. ENV: Agent did: predict-yes for direction R in state State-A
  3761. In State-A moving R
  3762. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3763. predict error 0
  3764. dir: dir isL
  3765. \-/533: O: O1065 (predict-yes)
  3766. I see 1 and I'm going to do: predict-yes
  3767. ENV: Agent did: predict-yes for direction L in state State-B
  3768. In State-B moving L
  3769. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3770. predict error 0
  3771. dir: dir isU
  3772. |\-534: O: O1068 (predict-no)
  3773. I see 1 and I'm going to do: predict-no
  3774. ENV: Agent did: predict-no for direction U in state State-A
  3775. In State-A moving U
  3776. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3777. predict error 0
  3778. dir: dir isL
  3779. /|\535: O: O1070 (predict-no)
  3780. I see 1 and I'm going to do: predict-no
  3781. ENV: Agent did: predict-no for direction L in state State-A
  3782. In State-A moving L
  3783. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3784. predict error 0
  3785. dir: dir isR
  3786. -/|536: O: O1071 (predict-yes)
  3787. I see 1 and I'm going to do: predict-yes
  3788. ENV: Agent did: predict-yes for direction R in state State-A
  3789. In State-A moving R
  3790. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3791. predict error 0
  3792. dir: dir isR
  3793. \-/537: O: O1074 (predict-no)
  3794. I see 1 and I'm going to do: predict-no
  3795. ENV: Agent did: predict-no for direction R in state State-B
  3796. In State-B moving R
  3797. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3798. predict error 0
  3799. dir: dir isL
  3800. |538: O: O1075 (predict-yes)
  3801. I see 1 and I'm going to do: predict-yes
  3802. ENV: Agent did: predict-yes for direction L in state State-B
  3803. In State-B moving L
  3804. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3805. predict error 0
  3806. dir: dir isR
  3807. \-539: O: O1077 (predict-yes)
  3808. I see 1 and I'm going to do: predict-yes
  3809. ENV: Agent did: predict-yes for direction R in state State-A
  3810. In State-A moving R
  3811. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3812. predict error 0
  3813. dir: dir isL
  3814. /|\540: O: O1079 (predict-yes)
  3815. I see 1 and I'm going to do: predict-yes
  3816. ENV: Agent did: predict-yes for direction L in state State-B
  3817. In State-B moving L
  3818. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3819. predict error 0
  3820. dir: dir isL
  3821. -/|541: O: O1082 (predict-no)
  3822. I see 1 and I'm going to do: predict-no
  3823. ENV: Agent did: predict-no for direction L in state State-A
  3824. In State-A moving L
  3825. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3826. predict error 0
  3827. dir: dir isU
  3828. \542: O: O1084 (predict-no)
  3829. I see 1 and I'm going to do: predict-no
  3830. ENV: Agent did: predict-no for direction U in state State-A
  3831. In State-A moving U
  3832. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3833. predict error 0
  3834. dir: dir isU
  3835. -/|543: O: O1086 (predict-no)
  3836. I see 1 and I'm going to do: predict-no
  3837. ENV: Agent did: predict-no for direction U in state State-A
  3838. In State-A moving U
  3839. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3840. predict error 0
  3841. dir: dir isR
  3842. \-/544: O: O1087 (predict-yes)
  3843. I see 1 and I'm going to do: predict-yes
  3844. ENV: Agent did: predict-yes for direction R in state State-A
  3845. In State-A moving R
  3846. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3847. predict error 0
  3848. dir: dir isU
  3849. |\-545: O: O1090 (predict-no)
  3850. I see 1 and I'm going to do: predict-no
  3851. ENV: Agent did: predict-no for direction U in state State-B
  3852. In State-B moving U
  3853. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3854. predict error 0
  3855. dir: dir isU
  3856. /|\546: O: O1092 (predict-no)
  3857. I see 1 and I'm going to do: predict-no
  3858. ENV: Agent did: predict-no for direction U in state State-B
  3859. In State-B moving U
  3860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3861. predict error 0
  3862. dir: dir isL
  3863. -547: O: O1093 (predict-yes)
  3864. I see 1 and I'm going to do: predict-yes
  3865. ENV: Agent did: predict-yes for direction L in state State-B
  3866. In State-B moving L
  3867. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3868. predict error 0
  3869. dir: dir isR
  3870. /|548: O: O1095 (predict-yes)
  3871. I see 1 and I'm going to do: predict-yes
  3872. ENV: Agent did: predict-yes for direction R in state State-A
  3873. In State-A moving R
  3874. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3875. predict error 0
  3876. dir: dir isL
  3877. \-/549: O: O1097 (predict-yes)
  3878. I see 1 and I'm going to do: predict-yes
  3879. ENV: Agent did: predict-yes for direction L in state State-B
  3880. In State-B moving L
  3881. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3882. predict error 0
  3883. dir: dir isL
  3884. |\-550: O: O1100 (predict-no)
  3885. I see 1 and I'm going to do: predict-no
  3886. ENV: Agent did: predict-no for direction L in state State-A
  3887. In State-A moving L
  3888. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3889. predict error 0
  3890. dir: dir isL
  3891. /|\551: O: O1102 (predict-no)
  3892. I see 1 and I'm going to do: predict-no
  3893. ENV: Agent did: predict-no for direction L in state State-A
  3894. In State-A moving L
  3895. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3896. predict error 0
  3897. dir: dir isL
  3898. -552: O: O1104 (predict-no)
  3899. I see 1 and I'm going to do: predict-no
  3900. ENV: Agent did: predict-no for direction L in state State-A
  3901. In State-A moving L
  3902. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3903. predict error 0
  3904. dir: dir isL
  3905. /|553: O: O1106 (predict-no)
  3906. I see 1 and I'm going to do: predict-no
  3907. ENV: Agent did: predict-no for direction L in state State-A
  3908. In State-A moving L
  3909. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3910. predict error 0
  3911. dir: dir isL
  3912. \-554: O: O1108 (predict-no)
  3913. I see 1 and I'm going to do: predict-no
  3914. ENV: Agent did: predict-no for direction L in state State-A
  3915. In State-A moving L
  3916. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3917. predict error 0
  3918. dir: dir isR
  3919. /|555: O: O1109 (predict-yes)
  3920. I see 1 and I'm going to do: predict-yes
  3921. ENV: Agent did: predict-yes for direction R in state State-A
  3922. In State-A moving R
  3923. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3924. predict error 0
  3925. dir: dir isR
  3926. \556: O: O1112 (predict-no)
  3927. I see 1 and I'm going to do: predict-no
  3928. ENV: Agent did: predict-no for direction R in state State-B
  3929. In State-B moving R
  3930. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3931. predict error 0
  3932. dir: dir isU
  3933. -557: O: O1114 (predict-no)
  3934. I see 1 and I'm going to do: predict-no
  3935. ENV: Agent did: predict-no for direction U in state State-B
  3936. In State-B moving U
  3937. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3938. predict error 0
  3939. dir: dir isU
  3940. /|558: O: O1116 (predict-no)
  3941. I see 1 and I'm going to do: predict-no
  3942. ENV: Agent did: predict-no for direction U in state State-B
  3943. In State-B moving U
  3944. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3945. predict error 0
  3946. dir: dir isU
  3947. \-/559: O: O1117 (predict-yes)
  3948. I see 1 and I'm going to do: predict-yes
  3949. ENV: Agent did: predict-yes for direction U in state State-B
  3950. In State-B moving U
  3951. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3952. predict error 1
  3953. dir: dir isR
  3954. |\560: O: O1120 (predict-no)
  3955. I see 0 and I'm going to do: predict-no
  3956. ENV: Agent did: predict-no for direction R in state State-B
  3957. In State-B moving R
  3958. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3959. predict error 0
  3960. dir: dir isU
  3961. -/561: O: O1122 (predict-no)
  3962. I see 1 and I'm going to do: predict-no
  3963. ENV: Agent did: predict-no for direction U in state State-B
  3964. In State-B moving U
  3965. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3966. predict error 0
  3967. dir: dir isL
  3968. |562: O: O1123 (predict-yes)
  3969. I see 1 and I'm going to do: predict-yes
  3970. ENV: Agent did: predict-yes for direction L in state State-B
  3971. In State-B moving L
  3972. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3973. predict error 0
  3974. dir: dir isL
  3975. \-/|563: O: O1126 (predict-no)
  3976. I see 1 and I'm going to do: predict-no
  3977. ENV: Agent did: predict-no for direction L in state State-A
  3978. In State-A moving L
  3979. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3980. predict error 0
  3981. dir: dir isU
  3982. \-/564: O: O1128 (predict-no)
  3983. I see 1 and I'm going to do: predict-no
  3984. ENV: Agent did: predict-no for direction U in state State-A
  3985. In State-A moving U
  3986. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3987. predict error 0
  3988. dir: dir isR
  3989. |565: O: O1129 (predict-yes)
  3990. I see 1 and I'm going to do: predict-yes
  3991. ENV: Agent did: predict-yes for direction R in state State-A
  3992. In State-A moving R
  3993. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3994. predict error 0
  3995. dir: dir isL
  3996. \-566: O: O1131 (predict-yes)
  3997. I see 1 and I'm going to do: predict-yes
  3998. ENV: Agent did: predict-yes for direction L in state State-B
  3999. In State-B moving L
  4000. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4001. predict error 0
  4002. dir: dir isR
  4003. /|\567: O: O1133 (predict-yes)
  4004. I see 1 and I'm going to do: predict-yes
  4005. ENV: Agent did: predict-yes for direction R in state State-A
  4006. In State-A moving R
  4007. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4008. predict error 0
  4009. dir: dir isL
  4010. -/|568: O: O1135 (predict-yes)
  4011. I see 1 and I'm going to do: predict-yes
  4012. ENV: Agent did: predict-yes for direction L in state State-B
  4013. In State-B moving L
  4014. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4015. predict error 0
  4016. dir: dir isR
  4017. \569: O: O1137 (predict-yes)
  4018. I see 1 and I'm going to do: predict-yes
  4019. ENV: Agent did: predict-yes for direction R in state State-A
  4020. In State-A moving R
  4021. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4022. predict error 0
  4023. dir: dir isR
  4024. -/|570: O: O1140 (predict-no)
  4025. I see 1 and I'm going to do: predict-no
  4026. ENV: Agent did: predict-no for direction R in state State-B
  4027. In State-B moving R
  4028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4029. predict error 0
  4030. dir: dir isR
  4031. \-/571: O: O1142 (predict-no)
  4032. I see 1 and I'm going to do: predict-no
  4033. ENV: Agent did: predict-no for direction R in state State-B
  4034. In State-B moving R
  4035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4036. predict error 0
  4037. dir: dir isU
  4038. |572: O: O1144 (predict-no)
  4039. I see 1 and I'm going to do: predict-no
  4040. ENV: Agent did: predict-no for direction U in state State-B
  4041. In State-B moving U
  4042. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4043. predict error 0
  4044. dir: dir isU
  4045. \573: O: O1146 (predict-no)
  4046. I see 1 and I'm going to do: predict-no
  4047. ENV: Agent did: predict-no for direction U in state State-B
  4048. In State-B moving U
  4049. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4050. predict error 0
  4051. dir: dir isR
  4052. -/574: O: O1148 (predict-no)
  4053. I see 1 and I'm going to do: predict-no
  4054. ENV: Agent did: predict-no for direction R in state State-B
  4055. In State-B moving R
  4056. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4057. predict error 0
  4058. dir: dir isR
  4059. |\575: O: O1150 (predict-no)
  4060. I see 1 and I'm going to do: predict-no
  4061. ENV: Agent did: predict-no for direction R in state State-B
  4062. In State-B moving R
  4063. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4064. predict error 0
  4065. dir: dir isL
  4066. -576: O: O1151 (predict-yes)
  4067. I see 1 and I'm going to do: predict-yes
  4068. ENV: Agent did: predict-yes for direction L in state State-B
  4069. In State-B moving L
  4070. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4071. predict error 0
  4072. dir: dir isU
  4073. /|577: O: O1154 (predict-no)
  4074. I see 1 and I'm going to do: predict-no
  4075. ENV: Agent did: predict-no for direction U in state State-A
  4076. In State-A moving U
  4077. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4078. predict error 0
  4079. dir: dir isU
  4080. \-/578: O: O1156 (predict-no)
  4081. I see 1 and I'm going to do: predict-no
  4082. ENV: Agent did: predict-no for direction U in state State-A
  4083. In State-A moving U
  4084. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4085. predict error 0
  4086. dir: dir isL
  4087. |\579: O: O1158 (predict-no)
  4088. I see 1 and I'm going to do: predict-no
  4089. ENV: Agent did: predict-no for direction L in state State-A
  4090. In State-A moving L
  4091. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4092. predict error 0
  4093. dir: dir isU
  4094. -/|580: O: O1160 (predict-no)
  4095. I see 1 and I'm going to do: predict-no
  4096. ENV: Agent did: predict-no for direction U in state State-A
  4097. In State-A moving U
  4098. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4099. predict error 0
  4100. dir: dir isU
  4101. \581: O: O1162 (predict-no)
  4102. I see 1 and I'm going to do: predict-no
  4103. ENV: Agent did: predict-no for direction U in state State-A
  4104. In State-A moving U
  4105. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4106. predict error 0
  4107. dir: dir isR
  4108. -582: O: O1163 (predict-yes)
  4109. I see 1 and I'm going to do: predict-yes
  4110. ENV: Agent did: predict-yes for direction R in state State-A
  4111. In State-A moving R
  4112. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4113. predict error 0
  4114. dir: dir isL
  4115. /|\583: O: O1165 (predict-yes)
  4116. I see 1 and I'm going to do: predict-yes
  4117. ENV: Agent did: predict-yes for direction L in state State-B
  4118. In State-B moving L
  4119. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4120. predict error 0
  4121. dir: dir isR
  4122. -/|584: O: O1167 (predict-yes)
  4123. I see 1 and I'm going to do: predict-yes
  4124. ENV: Agent did: predict-yes for direction R in state State-A
  4125. In State-A moving R
  4126. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4127. predict error 0
  4128. dir: dir isL
  4129. \-585: O: O1169 (predict-yes)
  4130. I see 1 and I'm going to do: predict-yes
  4131. ENV: Agent did: predict-yes for direction L in state State-B
  4132. In State-B moving L
  4133. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4134. predict error 0
  4135. dir: dir isL
  4136. /|586: O: O1172 (predict-no)
  4137. I see 1 and I'm going to do: predict-no
  4138. ENV: Agent did: predict-no for direction L in state State-A
  4139. In State-A moving L
  4140. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4141. predict error 0
  4142. dir: dir isR
  4143. \-/587: O: O1173 (predict-yes)
  4144. I see 1 and I'm going to do: predict-yes
  4145. ENV: Agent did: predict-yes for direction R in state State-A
  4146. In State-A moving R
  4147. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4148. predict error 0
  4149. dir: dir isR
  4150. |\588: O: O1176 (predict-no)
  4151. I see 1 and I'm going to do: predict-no
  4152. ENV: Agent did: predict-no for direction R in state State-B
  4153. In State-B moving R
  4154. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4155. predict error 0
  4156. dir: dir isR
  4157. -/589: O: O1178 (predict-no)
  4158. I see 1 and I'm going to do: predict-no
  4159. ENV: Agent did: predict-no for direction R in state State-B
  4160. In State-B moving R
  4161. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4162. predict error 0
  4163. dir: dir isU
  4164. |\590: O: O1180 (predict-no)
  4165. I see 1 and I'm going to do: predict-no
  4166. ENV: Agent did: predict-no for direction U in state State-B
  4167. In State-B moving U
  4168. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4169. predict error 0
  4170. dir: dir isU
  4171. -/|591: O: O1182 (predict-no)
  4172. I see 1 and I'm going to do: predict-no
  4173. ENV: Agent did: predict-no for direction U in state State-B
  4174. In State-B moving U
  4175. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4176. predict error 0
  4177. dir: dir isR
  4178. \592: O: O1184 (predict-no)
  4179. I see 1 and I'm going to do: predict-no
  4180. ENV: Agent did: predict-no for direction R in state State-B
  4181. In State-B moving R
  4182. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4183. predict error 0
  4184. dir: dir isL
  4185. -/|593: O: O1185 (predict-yes)
  4186. I see 1 and I'm going to do: predict-yes
  4187. ENV: Agent did: predict-yes for direction L in state State-B
  4188. In State-B moving L
  4189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4190. predict error 0
  4191. dir: dir isU
  4192. \-/594: O: O1188 (predict-no)
  4193. I see 1 and I'm going to do: predict-no
  4194. ENV: Agent did: predict-no for direction U in state State-A
  4195. In State-A moving U
  4196. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4197. predict error 0
  4198. dir: dir isU
  4199. |\-595: O: O1190 (predict-no)
  4200. I see 1 and I'm going to do: predict-no
  4201. ENV: Agent did: predict-no for direction U in state State-A
  4202. In State-A moving U
  4203. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4204. predict error 0
  4205. dir: dir isU
  4206. /|\596: O: O1192 (predict-no)
  4207. I see 1 and I'm going to do: predict-no
  4208. ENV: Agent did: predict-no for direction U in state State-A
  4209. In State-A moving U
  4210. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4211. predict error 0
  4212. dir: dir isR
  4213. -/|597: O: O1193 (predict-yes)
  4214. I see 1 and I'm going to do: predict-yes
  4215. ENV: Agent did: predict-yes for direction R in state State-A
  4216. In State-A moving R
  4217. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4218. predict error 0
  4219. dir: dir isL
  4220. \-/598: O: O1195 (predict-yes)
  4221. I see 1 and I'm going to do: predict-yes
  4222. ENV: Agent did: predict-yes for direction L in state State-B
  4223. In State-B moving L
  4224. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4225. predict error 0
  4226. dir: dir isL
  4227. |\599: O: O1198 (predict-no)
  4228. I see 1 and I'm going to do: predict-no
  4229. ENV: Agent did: predict-no for direction L in state State-A
  4230. In State-A moving L
  4231. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4232. predict error 0
  4233. dir: dir isL
  4234. -/600: O: O1200 (predict-no)
  4235. I see 1 and I'm going to do: predict-no
  4236. ENV: Agent did: predict-no for direction L in state State-A
  4237. In State-A moving L
  4238. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4239. predict error 0
  4240. dir: dir isR
  4241. |\-601: O: O1201 (predict-yes)
  4242. I see 1 and I'm going to do: predict-yes
  4243. ENV: Agent did: predict-yes for direction R in state State-A
  4244. In State-A moving R
  4245. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4246. predict error 0
  4247. dir: dir isR
  4248. /602: O: O1204 (predict-no)
  4249. I see 1 and I'm going to do: predict-no
  4250. ENV: Agent did: predict-no for direction R in state State-B
  4251. In State-B moving R
  4252. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4253. predict error 0
  4254. dir: dir isL
  4255. |\-603: O: O1205 (predict-yes)
  4256. I see 1 and I'm going to do: predict-yes
  4257. ENV: Agent did: predict-yes for direction L in state State-B
  4258. In State-B moving L
  4259. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4260. predict error 0
  4261. dir: dir isU
  4262. /604: O: O1208 (predict-no)
  4263. I see 1 and I'm going to do: predict-no
  4264. ENV: Agent did: predict-no for direction U in state State-A
  4265. In State-A moving U
  4266. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4267. predict error 0
  4268. dir: dir isU
  4269. |605: O: O1209 (predict-yes)
  4270. I see 1 and I'm going to do: predict-yes
  4271. ENV: Agent did: predict-yes for direction U in state State-A
  4272. In State-A moving U
  4273. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  4274. predict error 1
  4275. dir: dir isU
  4276. \-606: O: O1212 (predict-no)
  4277. I see 0 and I'm going to do: predict-no
  4278. ENV: Agent did: predict-no for direction U in state State-A
  4279. In State-A moving U
  4280. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4281. predict error 0
  4282. dir: dir isR
  4283. /|\607: O: O1213 (predict-yes)
  4284. I see 1 and I'm going to do: predict-yes
  4285. ENV: Agent did: predict-yes for direction R in state State-A
  4286. In State-A moving R
  4287. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4288. predict error 0
  4289. dir: dir isL
  4290. -608: O: O1215 (predict-yes)
  4291. I see 1 and I'm going to do: predict-yes
  4292. ENV: Agent did: predict-yes for direction L in state State-B
  4293. In State-B moving L
  4294. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4295. predict error 0
  4296. dir: dir isL
  4297. /|609: O: O1218 (predict-no)
  4298. I see 1 and I'm going to do: predict-no
  4299. ENV: Agent did: predict-no for direction L in state State-A
  4300. In State-A moving L
  4301. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4302. predict error 0
  4303. dir: dir isU
  4304. \-/610: O: O1220 (predict-no)
  4305. I see 1 and I'm going to do: predict-no
  4306. ENV: Agent did: predict-no for direction U in state State-A
  4307. In State-A moving U
  4308. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4309. predict error 0
  4310. dir: dir isU
  4311. |\-611: O: O1222 (predict-no)
  4312. I see 1 and I'm going to do: predict-no
  4313. ENV: Agent did: predict-no for direction U in state State-A
  4314. In State-A moving U
  4315. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4316. predict error 0
  4317. dir: dir isL
  4318. /612: O: O1224 (predict-no)
  4319. I see 1 and I'm going to do: predict-no
  4320. ENV: Agent did: predict-no for direction L in state State-A
  4321. In State-A moving L
  4322. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4323. predict error 0
  4324. dir: dir isU
  4325. |\-613: O: O1226 (predict-no)
  4326. I see 1 and I'm going to do: predict-no
  4327. ENV: Agent did: predict-no for direction U in state State-A
  4328. In State-A moving U
  4329. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4330. predict error 0
  4331. dir: dir isR
  4332. /|614: O: O1227 (predict-yes)
  4333. I see 1 and I'm going to do: predict-yes
  4334. ENV: Agent did: predict-yes for direction R in state State-A
  4335. In State-A moving R
  4336. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4337. predict error 0
  4338. dir: dir isR
  4339. \-615: O: O1230 (predict-no)
  4340. I see 1 and I'm going to do: predict-no
  4341. ENV: Agent did: predict-no for direction R in state State-B
  4342. In State-B moving R
  4343. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4344. predict error 0
  4345. dir: dir isR
  4346. /|\616: O: O1232 (predict-no)
  4347. I see 1 and I'm going to do: predict-no
  4348. ENV: Agent did: predict-no for direction R in state State-B
  4349. In State-B moving R
  4350. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4351. predict error 0
  4352. dir: dir isU
  4353. -/|617: O: O1234 (predict-no)
  4354. I see 1 and I'm going to do: predict-no
  4355. ENV: Agent did: predict-no for direction U in state State-B
  4356. In State-B moving U
  4357. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4358. predict error 0
  4359. dir: dir isR
  4360. \618: O: O1236 (predict-no)
  4361. I see 1 and I'm going to do: predict-no
  4362. ENV: Agent did: predict-no for direction R in state State-B
  4363. In State-B moving R
  4364. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4365. predict error 0
  4366. dir: dir isR
  4367. -/|619: O: O1238 (predict-no)
  4368. I see 1 and I'm going to do: predict-no
  4369. ENV: Agent did: predict-no for direction R in state State-B
  4370. In State-B moving R
  4371. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4372. predict error 0
  4373. dir: dir isL
  4374. \-/620: O: O1239 (predict-yes)
  4375. I see 1 and I'm going to do: predict-yes
  4376. ENV: Agent did: predict-yes for direction L in state State-B
  4377. In State-B moving L
  4378. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4379. predict error 0
  4380. dir: dir isU
  4381. |\-621: O: O1242 (predict-no)
  4382. I see 1 and I'm going to do: predict-no
  4383. ENV: Agent did: predict-no for direction U in state State-A
  4384. In State-A moving U
  4385. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4386. predict error 0
  4387. dir: dir isL
  4388. /622: O: O1244 (predict-no)
  4389. I see 1 and I'm going to do: predict-no
  4390. ENV: Agent did: predict-no for direction L in state State-A
  4391. In State-A moving L
  4392. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4393. predict error 0
  4394. dir: dir isU
  4395. |\-623: O: O1246 (predict-no)
  4396. I see 1 and I'm going to do: predict-no
  4397. ENV: Agent did: predict-no for direction U in state State-A
  4398. In State-A moving U
  4399. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4400. predict error 0
  4401. dir: dir isL
  4402. /|624: O: O1248 (predict-no)
  4403. I see 1 and I'm going to do: predict-no
  4404. ENV: Agent did: predict-no for direction L in state State-A
  4405. In State-A moving L
  4406. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4407. predict error 0
  4408. dir: dir isL
  4409. \-/625: O: O1250 (predict-no)
  4410. I see 1 and I'm going to do: predict-no
  4411. ENV: Agent did: predict-no for direction L in state State-A
  4412. In State-A moving L
  4413. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4414. predict error 0
  4415. dir: dir isR
  4416. |\-/sleeping...
  4417. |626: O: O1251 (predict-yes)
  4418. I see 1 and I'm going to do: predict-yes
  4419. ENV: Agent did: predict-yes for direction R in state State-A
  4420. In State-A moving R
  4421. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4422. predict error 0
  4423. dir: dir isR
  4424. \-627: O: O1254 (predict-no)
  4425. I see 1 and I'm going to do: predict-no
  4426. ENV: Agent did: predict-no for direction R in state State-B
  4427. In State-B moving R
  4428. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4429. predict error 0
  4430. dir: dir isR
  4431. /|\628: O: O1256 (predict-no)
  4432. I see 1 and I'm going to do: predict-no
  4433. ENV: Agent did: predict-no for direction R in state State-B
  4434. In State-B moving R
  4435. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4436. predict error 0
  4437. dir: dir isU
  4438. -/629: O: O1258 (predict-no)
  4439. I see 1 and I'm going to do: predict-no
  4440. ENV: Agent did: predict-no for direction U in state State-B
  4441. In State-B moving U
  4442. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4443. predict error 0
  4444. dir: dir isL
  4445. |630: O: O1259 (predict-yes)
  4446. I see 1 and I'm going to do: predict-yes
  4447. ENV: Agent did: predict-yes for direction L in state State-B
  4448. In State-B moving L
  4449. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4450. predict error 0
  4451. dir: dir isU
  4452. \631: O: O1262 (predict-no)
  4453. I see 1 and I'm going to do: predict-no
  4454. ENV: Agent did: predict-no for direction U in state State-A
  4455. In State-A moving U
  4456. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4457. predict error 0
  4458. dir: dir isU
  4459. -632: O: O1264 (predict-no)
  4460. I see 1 and I'm going to do: predict-no
  4461. ENV: Agent did: predict-no for direction U in state State-A
  4462. In State-A moving U
  4463. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4464. predict error 0
  4465. dir: dir isU
  4466. /|633: O: O1266 (predict-no)
  4467. I see 1 and I'm going to do: predict-no
  4468. ENV: Agent did: predict-no for direction U in state State-A
  4469. In State-A moving U
  4470. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4471. predict error 0
  4472. dir: dir isR
  4473. \-634: O: O1267 (predict-yes)
  4474. I see 1 and I'm going to do: predict-yes
  4475. ENV: Agent did: predict-yes for direction R in state State-A
  4476. In State-A moving R
  4477. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4478. predict error 0
  4479. dir: dir isR
  4480. /|\635: O: O1270 (predict-no)
  4481. I see 1 and I'm going to do: predict-no
  4482. ENV: Agent did: predict-no for direction R in state State-B
  4483. In State-B moving R
  4484. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4485. predict error 0
  4486. dir: dir isR
  4487. -636: O: O1272 (predict-no)
  4488. I see 1 and I'm going to do: predict-no
  4489. ENV: Agent did: predict-no for direction R in state State-B
  4490. In State-B moving R
  4491. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4492. predict error 0
  4493. dir: dir isR
  4494. /637: O: O1274 (predict-no)
  4495. I see 1 and I'm going to do: predict-no
  4496. ENV: Agent did: predict-no for direction R in state State-B
  4497. In State-B moving R
  4498. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4499. predict error 0
  4500. dir: dir isL
  4501. |\-638: O: O1275 (predict-yes)
  4502. I see 1 and I'm going to do: predict-yes
  4503. ENV: Agent did: predict-yes for direction L in state State-B
  4504. In State-B moving L
  4505. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4506. predict error 0
  4507. dir: dir isL
  4508. /|639: O: O1278 (predict-no)
  4509. I see 1 and I'm going to do: predict-no
  4510. ENV: Agent did: predict-no for direction L in state State-A
  4511. In State-A moving L
  4512. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4513. predict error 0
  4514. dir: dir isR
  4515. \-/640: O: O1279 (predict-yes)
  4516. I see 1 and I'm going to do: predict-yes
  4517. ENV: Agent did: predict-yes for direction R in state State-A
  4518. In State-A moving R
  4519. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4520. predict error 0
  4521. dir: dir isL
  4522. |\641: O: O1281 (predict-yes)
  4523. I see 1 and I'm going to do: predict-yes
  4524. ENV: Agent did: predict-yes for direction L in state State-B
  4525. In State-B moving L
  4526. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4527. predict error 0
  4528. dir: dir isR
  4529. -642: O: O1283 (predict-yes)
  4530. I see 1 and I'm going to do: predict-yes
  4531. ENV: Agent did: predict-yes for direction R in state State-A
  4532. In State-A moving R
  4533. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4534. predict error 0
  4535. dir: dir isR
  4536. /643: O: O1286 (predict-no)
  4537. I see 1 and I'm going to do: predict-no
  4538. ENV: Agent did: predict-no for direction R in state State-B
  4539. In State-B moving R
  4540. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4541. predict error 0
  4542. dir: dir isL
  4543. |\644: O: O1287 (predict-yes)
  4544. I see 1 and I'm going to do: predict-yes
  4545. ENV: Agent did: predict-yes for direction L in state State-B
  4546. In State-B moving L
  4547. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4548. predict error 0
  4549. dir: dir isL
  4550. -/|645: O: O1290 (predict-no)
  4551. I see 1 and I'm going to do: predict-no
  4552. ENV: Agent did: predict-no for direction L in state State-A
  4553. In State-A moving L
  4554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4555. predict error 0
  4556. dir: dir isR
  4557. \646: O: O1291 (predict-yes)
  4558. I see 1 and I'm going to do: predict-yes
  4559. ENV: Agent did: predict-yes for direction R in state State-A
  4560. In State-A moving R
  4561. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4562. predict error 0
  4563. dir: dir isU
  4564. -/|647: O: O1294 (predict-no)
  4565. I see 1 and I'm going to do: predict-no
  4566. ENV: Agent did: predict-no for direction U in state State-B
  4567. In State-B moving U
  4568. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4569. predict error 0
  4570. dir: dir isL
  4571. \-648: O: O1295 (predict-yes)
  4572. I see 1 and I'm going to do: predict-yes
  4573. ENV: Agent did: predict-yes for direction L in state State-B
  4574. In State-B moving L
  4575. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4576. predict error 0
  4577. dir: dir isR
  4578. /|\649: O: O1297 (predict-yes)
  4579. I see 1 and I'm going to do: predict-yes
  4580. ENV: Agent did: predict-yes for direction R in state State-A
  4581. In State-A moving R
  4582. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4583. predict error 0
  4584. dir: dir isR
  4585. -/|650: O: O1300 (predict-no)
  4586. I see 1 and I'm going to do: predict-no
  4587. ENV: Agent did: predict-no for direction R in state State-B
  4588. In State-B moving R
  4589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4590. predict error 0
  4591. dir: dir isU
  4592. \-/651: O: O1302 (predict-no)
  4593. I see 1 and I'm going to do: predict-no
  4594. ENV: Agent did: predict-no for direction U in state State-B
  4595. In State-B moving U
  4596. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4597. predict error 0
  4598. dir: dir isU
  4599. |652: O: O1304 (predict-no)
  4600. I see 1 and I'm going to do: predict-no
  4601. ENV: Agent did: predict-no for direction U in state State-B
  4602. In State-B moving U
  4603. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4604. predict error 0
  4605. dir: dir isL
  4606. \-/653: O: O1305 (predict-yes)
  4607. I see 1 and I'm going to do: predict-yes
  4608. ENV: Agent did: predict-yes for direction L in state State-B
  4609. In State-B moving L
  4610. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4611. predict error 0
  4612. dir: dir isL
  4613. |\654: O: O1308 (predict-no)
  4614. I see 1 and I'm going to do: predict-no
  4615. ENV: Agent did: predict-no for direction L in state State-A
  4616. In State-A moving L
  4617. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4618. predict error 0
  4619. dir: dir isU
  4620. -/655: O: O1310 (predict-no)
  4621. I see 1 and I'm going to do: predict-no
  4622. ENV: Agent did: predict-no for direction U in state State-A
  4623. In State-A moving U
  4624. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4625. predict error 0
  4626. dir: dir isL
  4627. |\-656: O: O1312 (predict-no)
  4628. I see 1 and I'm going to do: predict-no
  4629. ENV: Agent did: predict-no for direction L in state State-A
  4630. In State-A moving L
  4631. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4632. predict error 0
  4633. dir: dir isL
  4634. /|657: O: O1314 (predict-no)
  4635. I see 1 and I'm going to do: predict-no
  4636. ENV: Agent did: predict-no for direction L in state State-A
  4637. In State-A moving L
  4638. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4639. predict error 0
  4640. dir: dir isU
  4641. \658: O: O1316 (predict-no)
  4642. I see 1 and I'm going to do: predict-no
  4643. ENV: Agent did: predict-no for direction U in state State-A
  4644. In State-A moving U
  4645. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4646. predict error 0
  4647. dir: dir isL
  4648. -659: O: O1318 (predict-no)
  4649. I see 1 and I'm going to do: predict-no
  4650. ENV: Agent did: predict-no for direction L in state State-A
  4651. In State-A moving L
  4652. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4653. predict error 0
  4654. dir: dir isL
  4655. /|\660: O: O1320 (predict-no)
  4656. I see 1 and I'm going to do: predict-no
  4657. ENV: Agent did: predict-no for direction L in state State-A
  4658. In State-A moving L
  4659. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4660. predict error 0
  4661. dir: dir isL
  4662. -/|661: O: O1322 (predict-no)
  4663. I see 1 and I'm going to do: predict-no
  4664. ENV: Agent did: predict-no for direction L in state State-A
  4665. In State-A moving L
  4666. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4667. predict error 0
  4668. dir: dir isL
  4669. \662: O: O1324 (predict-no)
  4670. I see 1 and I'm going to do: predict-no
  4671. ENV: Agent did: predict-no for direction L in state State-A
  4672. In State-A moving L
  4673. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4674. predict error 0
  4675. dir: dir isU
  4676. -/|663: O: O1326 (predict-no)
  4677. I see 1 and I'm going to do: predict-no
  4678. ENV: Agent did: predict-no for direction U in state State-A
  4679. In State-A moving U
  4680. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4681. predict error 0
  4682. dir: dir isR
  4683. \-664: O: O1327 (predict-yes)
  4684. I see 1 and I'm going to do: predict-yes
  4685. ENV: Agent did: predict-yes for direction R in state State-A
  4686. In State-A moving R
  4687. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4688. predict error 0
  4689. dir: dir isR
  4690. /|665: O: O1330 (predict-no)
  4691. I see 1 and I'm going to do: predict-no
  4692. ENV: Agent did: predict-no for direction R in state State-B
  4693. In State-B moving R
  4694. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4695. predict error 0
  4696. dir: dir isR
  4697. \-/666: O: O1332 (predict-no)
  4698. I see 1 and I'm going to do: predict-no
  4699. ENV: Agent did: predict-no for direction R in state State-B
  4700. In State-B moving R
  4701. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4702. predict error 0
  4703. dir: dir isU
  4704. |\-667: O: O1334 (predict-no)
  4705. I see 1 and I'm going to do: predict-no
  4706. ENV: Agent did: predict-no for direction U in state State-B
  4707. In State-B moving U
  4708. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4709. predict error 0
  4710. dir: dir isL
  4711. /|668: O: O1335 (predict-yes)
  4712. I see 1 and I'm going to do: predict-yes
  4713. ENV: Agent did: predict-yes for direction L in state State-B
  4714. In State-B moving L
  4715. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4716. predict error 0
  4717. dir: dir isR
  4718. \-/669: O: O1337 (predict-yes)
  4719. I see 1 and I'm going to do: predict-yes
  4720. ENV: Agent did: predict-yes for direction R in state State-A
  4721. In State-A moving R
  4722. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4723. predict error 0
  4724. dir: dir isL
  4725. |\670: O: O1339 (predict-yes)
  4726. I see 1 and I'm going to do: predict-yes
  4727. ENV: Agent did: predict-yes for direction L in state State-B
  4728. In State-B moving L
  4729. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4730. predict error 0
  4731. dir: dir isL
  4732. -/|671: O: O1342 (predict-no)
  4733. I see 1 and I'm going to do: predict-no
  4734. ENV: Agent did: predict-no for direction L in state State-A
  4735. In State-A moving L
  4736. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4737. predict error 0
  4738. dir: dir isR
  4739. \672: O: O1343 (predict-yes)
  4740. I see 1 and I'm going to do: predict-yes
  4741. ENV: Agent did: predict-yes for direction R in state State-A
  4742. In State-A moving R
  4743. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4744. predict error 0
  4745. dir: dir isR
  4746. -/|673: O: O1346 (predict-no)
  4747. I see 1 and I'm going to do: predict-no
  4748. ENV: Agent did: predict-no for direction R in state State-B
  4749. In State-B moving R
  4750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4751. predict error 0
  4752. dir: dir isL
  4753. \-674: O: O1347 (predict-yes)
  4754. I see 1 and I'm going to do: predict-yes
  4755. ENV: Agent did: predict-yes for direction L in state State-B
  4756. In State-B moving L
  4757. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4758. predict error 0
  4759. dir: dir isR
  4760. /|675: O: O1349 (predict-yes)
  4761. I see 1 and I'm going to do: predict-yes
  4762. ENV: Agent did: predict-yes for direction R in state State-A
  4763. In State-A moving R
  4764. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4765. predict error 0
  4766. dir: dir isU
  4767. \-/676: O: O1352 (predict-no)
  4768. I see 1 and I'm going to do: predict-no
  4769. ENV: Agent did: predict-no for direction U in state State-B
  4770. In State-B moving U
  4771. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4772. predict error 0
  4773. dir: dir isR
  4774. |\-677: O: O1354 (predict-no)
  4775. I see 1 and I'm going to do: predict-no
  4776. ENV: Agent did: predict-no for direction R in state State-B
  4777. In State-B moving R
  4778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4779. predict error 0
  4780. dir: dir isR
  4781. /|678: O: O1356 (predict-no)
  4782. I see 1 and I'm going to do: predict-no
  4783. ENV: Agent did: predict-no for direction R in state State-B
  4784. In State-B moving R
  4785. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4786. predict error 0
  4787. dir: dir isU
  4788. \-/679: O: O1358 (predict-no)
  4789. I see 1 and I'm going to do: predict-no
  4790. ENV: Agent did: predict-no for direction U in state State-B
  4791. In State-B moving U
  4792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4793. predict error 0
  4794. dir: dir isU
  4795. |\-680: O: O1360 (predict-no)
  4796. I see 1 and I'm going to do: predict-no
  4797. ENV: Agent did: predict-no for direction U in state State-B
  4798. In State-B moving U
  4799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4800. predict error 0
  4801. dir: dir isL
  4802. /681: O: O1361 (predict-yes)
  4803. I see 1 and I'm going to do: predict-yes
  4804. ENV: Agent did: predict-yes for direction L in state State-B
  4805. In State-B moving L
  4806. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4807. predict error 0
  4808. dir: dir isL
  4809. |682: O: O1364 (predict-no)
  4810. I see 1 and I'm going to do: predict-no
  4811. ENV: Agent did: predict-no for direction L in state State-A
  4812. In State-A moving L
  4813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4814. predict error 0
  4815. dir: dir isU
  4816. \-683: O: O1366 (predict-no)
  4817. I see 1 and I'm going to do: predict-no
  4818. ENV: Agent did: predict-no for direction U in state State-A
  4819. In State-A moving U
  4820. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4821. predict error 0
  4822. dir: dir isU
  4823. /|684: O: O1368 (predict-no)
  4824. I see 1 and I'm going to do: predict-no
  4825. ENV: Agent did: predict-no for direction U in state State-A
  4826. In State-A moving U
  4827. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4828. predict error 0
  4829. dir: dir isL
  4830. \-/685: O: O1370 (predict-no)
  4831. I see 1 and I'm going to do: predict-no
  4832. ENV: Agent did: predict-no for direction L in state State-A
  4833. In State-A moving L
  4834. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4835. predict error 0
  4836. dir: dir isU
  4837. |\686: O: O1372 (predict-no)
  4838. I see 1 and I'm going to do: predict-no
  4839. ENV: Agent did: predict-no for direction U in state State-A
  4840. In State-A moving U
  4841. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4842. predict error 0
  4843. dir: dir isU
  4844. -/687: O: O1374 (predict-no)
  4845. I see 1 and I'm going to do: predict-no
  4846. ENV: Agent did: predict-no for direction U in state State-A
  4847. In State-A moving U
  4848. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4849. predict error 0
  4850. dir: dir isR
  4851. |\-688: O: O1375 (predict-yes)
  4852. I see 1 and I'm going to do: predict-yes
  4853. ENV: Agent did: predict-yes for direction R in state State-A
  4854. In State-A moving R
  4855. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4856. predict error 0
  4857. dir: dir isU
  4858. /|\689: O: O1378 (predict-no)
  4859. I see 1 and I'm going to do: predict-no
  4860. ENV: Agent did: predict-no for direction U in state State-B
  4861. In State-B moving U
  4862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4863. predict error 0
  4864. dir: dir isR
  4865. -/|690: O: O1380 (predict-no)
  4866. I see 1 and I'm going to do: predict-no
  4867. ENV: Agent did: predict-no for direction R in state State-B
  4868. In State-B moving R
  4869. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4870. predict error 0
  4871. dir: dir isL
  4872. \-/691: O: O1381 (predict-yes)
  4873. I see 1 and I'm going to do: predict-yes
  4874. ENV: Agent did: predict-yes for direction L in state State-B
  4875. In State-B moving L
  4876. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4877. predict error 0
  4878. dir: dir isL
  4879. |692: O: O1384 (predict-no)
  4880. I see 1 and I'm going to do: predict-no
  4881. ENV: Agent did: predict-no for direction L in state State-A
  4882. In State-A moving L
  4883. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4884. predict error 0
  4885. dir: dir isL
  4886. \-/693: O: O1386 (predict-no)
  4887. I see 1 and I'm going to do: predict-no
  4888. ENV: Agent did: predict-no for direction L in state State-A
  4889. In State-A moving L
  4890. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4891. predict error 0
  4892. dir: dir isR
  4893. |\-694: O: O1387 (predict-yes)
  4894. I see 1 and I'm going to do: predict-yes
  4895. ENV: Agent did: predict-yes for direction R in state State-A
  4896. In State-A moving R
  4897. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4898. predict error 0
  4899. dir: dir isL
  4900. /|695: O: O1389 (predict-yes)
  4901. I see 1 and I'm going to do: predict-yes
  4902. ENV: Agent did: predict-yes for direction L in state State-B
  4903. In State-B moving L
  4904. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4905. predict error 0
  4906. dir: dir isR
  4907. \-696: O: O1391 (predict-yes)
  4908. I see 1 and I'm going to do: predict-yes
  4909. ENV: Agent did: predict-yes for direction R in state State-A
  4910. In State-A moving R
  4911. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4912. predict error 0
  4913. dir: dir isR
  4914. /|\697: O: O1394 (predict-no)
  4915. I see 1 and I'm going to do: predict-no
  4916. ENV: Agent did: predict-no for direction R in state State-B
  4917. In State-B moving R
  4918. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4919. predict error 0
  4920. dir: dir isR
  4921. -/698: O: O1396 (predict-no)
  4922. I see 1 and I'm going to do: predict-no
  4923. ENV: Agent did: predict-no for direction R in state State-B
  4924. In State-B moving R
  4925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4926. predict error 0
  4927. dir: dir isU
  4928. |\-699: O: O1398 (predict-no)
  4929. I see 1 and I'm going to do: predict-no
  4930. ENV: Agent did: predict-no for direction U in state State-B
  4931. In State-B moving U
  4932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4933. predict error 0
  4934. dir: dir isR
  4935. /|\700: O: O1400 (predict-no)
  4936. I see 1 and I'm going to do: predict-no
  4937. ENV: Agent did: predict-no for direction R in state State-B
  4938. In State-B moving R
  4939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4940. predict error 0
  4941. dir: dir isR
  4942. -/|701: O: O1402 (predict-no)
  4943. I see 1 and I'm going to do: predict-no
  4944. ENV: Agent did: predict-no for direction R in state State-B
  4945. In State-B moving R
  4946. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4947. predict error 0
  4948. dir: dir isR
  4949. \702: O: O1404 (predict-no)
  4950. I see 1 and I'm going to do: predict-no
  4951. ENV: Agent did: predict-no for direction R in state State-B
  4952. In State-B moving R
  4953. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4954. predict error 0
  4955. dir: dir isL
  4956. -/703: O: O1405 (predict-yes)
  4957. I see 1 and I'm going to do: predict-yes
  4958. ENV: Agent did: predict-yes for direction L in state State-B
  4959. In State-B moving L
  4960. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4961. predict error 0
  4962. dir: dir isL
  4963. |\704: O: O1408 (predict-no)
  4964. I see 1 and I'm going to do: predict-no
  4965. ENV: Agent did: predict-no for direction L in state State-A
  4966. In State-A moving L
  4967. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4968. predict error 0
  4969. dir: dir isR
  4970. -/705: O: O1409 (predict-yes)
  4971. I see 1 and I'm going to do: predict-yes
  4972. ENV: Agent did: predict-yes for direction R in state State-A
  4973. In State-A moving R
  4974. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4975. predict error 0
  4976. dir: dir isU
  4977. |\-706: O: O1412 (predict-no)
  4978. I see 1 and I'm going to do: predict-no
  4979. ENV: Agent did: predict-no for direction U in state State-B
  4980. In State-B moving U
  4981. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4982. predict error 0
  4983. dir: dir isL
  4984. /|\707: O: O1413 (predict-yes)
  4985. I see 1 and I'm going to do: predict-yes
  4986. ENV: Agent did: predict-yes for direction L in state State-B
  4987. In State-B moving L
  4988. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4989. predict error 0
  4990. dir: dir isU
  4991. -/708: O: O1416 (predict-no)
  4992. I see 1 and I'm going to do: predict-no
  4993. ENV: Agent did: predict-no for direction U in state State-A
  4994. In State-A moving U
  4995. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4996. predict error 0
  4997. dir: dir isU
  4998. |\-709: O: O1418 (predict-no)
  4999. I see 1 and I'm going to do: predict-no
  5000. ENV: Agent did: predict-no for direction U in state State-A
  5001. In State-A moving U
  5002. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5003. predict error 0
  5004. dir: dir isL
  5005. /|\710: O: O1420 (predict-no)
  5006. I see 1 and I'm going to do: predict-no
  5007. ENV: Agent did: predict-no for direction L in state State-A
  5008. In State-A moving L
  5009. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5010. predict error 0
  5011. dir: dir isU
  5012. -/|711: O: O1422 (predict-no)
  5013. I see 1 and I'm going to do: predict-no
  5014. ENV: Agent did: predict-no for direction U in state State-A
  5015. In State-A moving U
  5016. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5017. predict error 0
  5018. dir: dir isR
  5019. \712: O: O1423 (predict-yes)
  5020. I see 1 and I'm going to do: predict-yes
  5021. ENV: Agent did: predict-yes for direction R in state State-A
  5022. In State-A moving R
  5023. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5024. predict error 0
  5025. dir: dir isR
  5026. -/|713: O: O1426 (predict-no)
  5027. I see 1 and I'm going to do: predict-no
  5028. ENV: Agent did: predict-no for direction R in state State-B
  5029. In State-B moving R
  5030. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5031. predict error 0
  5032. dir: dir isL
  5033. \-/714: O: O1427 (predict-yes)
  5034. I see 1 and I'm going to do: predict-yes
  5035. ENV: Agent did: predict-yes for direction L in state State-B
  5036. In State-B moving L
  5037. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5038. predict error 0
  5039. dir: dir isR
  5040. |\-715: O: O1429 (predict-yes)
  5041. I see 1 and I'm going to do: predict-yes
  5042. ENV: Agent did: predict-yes for direction R in state State-A
  5043. In State-A moving R
  5044. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5045. predict error 0
  5046. dir: dir isR
  5047. /|716: O: O1432 (predict-no)
  5048. I see 1 and I'm going to do: predict-no
  5049. ENV: Agent did: predict-no for direction R in state State-B
  5050. In State-B moving R
  5051. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5052. predict error 0
  5053. dir: dir isU
  5054. \-/717: O: O1434 (predict-no)
  5055. I see 1 and I'm going to do: predict-no
  5056. ENV: Agent did: predict-no for direction U in state State-B
  5057. In State-B moving U
  5058. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5059. predict error 0
  5060. dir: dir isR
  5061. |\-718: O: O1436 (predict-no)
  5062. I see 1 and I'm going to do: predict-no
  5063. ENV: Agent did: predict-no for direction R in state State-B
  5064. In State-B moving R
  5065. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5066. predict error 0
  5067. dir: dir isU
  5068. /719: O: O1438 (predict-no)
  5069. I see 1 and I'm going to do: predict-no
  5070. ENV: Agent did: predict-no for direction U in state State-B
  5071. In State-B moving U
  5072. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5073. predict error 0
  5074. dir: dir isU
  5075. |\720: O: O1440 (predict-no)
  5076. I see 1 and I'm going to do: predict-no
  5077. ENV: Agent did: predict-no for direction U in state State-B
  5078. In State-B moving U
  5079. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5080. predict error 0
  5081. dir: dir isL
  5082. -/721: O: O1441 (predict-yes)
  5083. I see 1 and I'm going to do: predict-yes
  5084. ENV: Agent did: predict-yes for direction L in state State-B
  5085. In State-B moving L
  5086. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5087. predict error 0
  5088. dir: dir isL
  5089. |722: O: O1444 (predict-no)
  5090. I see 1 and I'm going to do: predict-no
  5091. ENV: Agent did: predict-no for direction L in state State-A
  5092. In State-A moving L
  5093. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5094. predict error 0
  5095. dir: dir isR
  5096. \-/723: O: O1445 (predict-yes)
  5097. I see 1 and I'm going to do: predict-yes
  5098. ENV: Agent did: predict-yes for direction R in state State-A
  5099. In State-A moving R
  5100. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5101. predict error 0
  5102. dir: dir isL
  5103. |\724: O: O1447 (predict-yes)
  5104. I see 1 and I'm going to do: predict-yes
  5105. ENV: Agent did: predict-yes for direction L in state State-B
  5106. In State-B moving L
  5107. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5108. predict error 0
  5109. dir: dir isL
  5110. -/|725: O: O1450 (predict-no)
  5111. I see 1 and I'm going to do: predict-no
  5112. ENV: Agent did: predict-no for direction L in state State-A
  5113. In State-A moving L
  5114. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5115. predict error 0
  5116. dir: dir isL
  5117. \726: O: O1452 (predict-no)
  5118. I see 1 and I'm going to do: predict-no
  5119. ENV: Agent did: predict-no for direction L in state State-A
  5120. In State-A moving L
  5121. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5122. predict error 0
  5123. dir: dir isR
  5124. -/|727: O: O1453 (predict-yes)
  5125. I see 1 and I'm going to do: predict-yes
  5126. ENV: Agent did: predict-yes for direction R in state State-A
  5127. In State-A moving R
  5128. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5129. predict error 0
  5130. dir: dir isR
  5131. \-728: O: O1456 (predict-no)
  5132. I see 1 and I'm going to do: predict-no
  5133. ENV: Agent did: predict-no for direction R in state State-B
  5134. In State-B moving R
  5135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5136. predict error 0
  5137. dir: dir isR
  5138. /|\729: O: O1458 (predict-no)
  5139. I see 1 and I'm going to do: predict-no
  5140. ENV: Agent did: predict-no for direction R in state State-B
  5141. In State-B moving R
  5142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5143. predict error 0
  5144. dir: dir isU
  5145. -730: O: O1460 (predict-no)
  5146. I see 1 and I'm going to do: predict-no
  5147. ENV: Agent did: predict-no for direction U in state State-B
  5148. In State-B moving U
  5149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5150. predict error 0
  5151. dir: dir isL
  5152. /|\731: O: O1461 (predict-yes)
  5153. I see 1 and I'm going to do: predict-yes
  5154. ENV: Agent did: predict-yes for direction L in state State-B
  5155. In State-B moving L
  5156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5157. predict error 0
  5158. dir: dir isR
  5159. -732: O: O1463 (predict-yes)
  5160. I see 1 and I'm going to do: predict-yes
  5161. ENV: Agent did: predict-yes for direction R in state State-A
  5162. In State-A moving R
  5163. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5164. predict error 0
  5165. dir: dir isL
  5166. /|733: O: O1465 (predict-yes)
  5167. I see 1 and I'm going to do: predict-yes
  5168. ENV: Agent did: predict-yes for direction L in state State-B
  5169. In State-B moving L
  5170. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5171. predict error 0
  5172. dir: dir isU
  5173. \-/734: O: O1468 (predict-no)
  5174. I see 1 and I'm going to do: predict-no
  5175. ENV: Agent did: predict-no for direction U in state State-A
  5176. In State-A moving U
  5177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5178. predict error 0
  5179. dir: dir isR
  5180. |\735: O: O1469 (predict-yes)
  5181. I see 1 and I'm going to do: predict-yes
  5182. ENV: Agent did: predict-yes for direction R in state State-A
  5183. In State-A moving R
  5184. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5185. predict error 0
  5186. dir: dir isL
  5187. -/|736: O: O1471 (predict-yes)
  5188. I see 1 and I'm going to do: predict-yes
  5189. ENV: Agent did: predict-yes for direction L in state State-B
  5190. In State-B moving L
  5191. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5192. predict error 0
  5193. dir: dir isL
  5194. \-/737: O: O1474 (predict-no)
  5195. I see 1 and I'm going to do: predict-no
  5196. ENV: Agent did: predict-no for direction L in state State-A
  5197. In State-A moving L
  5198. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5199. predict error 0
  5200. dir: dir isR
  5201. |\738: O: O1475 (predict-yes)
  5202. I see 1 and I'm going to do: predict-yes
  5203. ENV: Agent did: predict-yes for direction R in state State-A
  5204. In State-A moving R
  5205. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5206. predict error 0
  5207. dir: dir isR
  5208. -/|739: O: O1478 (predict-no)
  5209. I see 1 and I'm going to do: predict-no
  5210. ENV: Agent did: predict-no for direction R in state State-B
  5211. In State-B moving R
  5212. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5213. predict error 0
  5214. dir: dir isU
  5215. \-/740: O: O1480 (predict-no)
  5216. I see 1 and I'm going to do: predict-no
  5217. ENV: Agent did: predict-no for direction U in state State-B
  5218. In State-B moving U
  5219. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5220. predict error 0
  5221. dir: dir isR
  5222. |\-741: O: O1482 (predict-no)
  5223. I see 1 and I'm going to do: predict-no
  5224. ENV: Agent did: predict-no for direction R in state State-B
  5225. In State-B moving R
  5226. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5227. predict error 0
  5228. dir: dir isR
  5229. /742: O: O1484 (predict-no)
  5230. I see 1 and I'm going to do: predict-no
  5231. ENV: Agent did: predict-no for direction R in state State-B
  5232. In State-B moving R
  5233. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5234. predict error 0
  5235. dir: dir isU
  5236. |\-743: O: O1486 (predict-no)
  5237. I see 1 and I'm going to do: predict-no
  5238. ENV: Agent did: predict-no for direction U in state State-B
  5239. In State-B moving U
  5240. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5241. predict error 0
  5242. dir: dir isR
  5243. /|\744: O: O1488 (predict-no)
  5244. I see 1 and I'm going to do: predict-no
  5245. ENV: Agent did: predict-no for direction R in state State-B
  5246. In State-B moving R
  5247. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5248. predict error 0
  5249. dir: dir isL
  5250. -/|745: O: O1489 (predict-yes)
  5251. I see 1 and I'm going to do: predict-yes
  5252. ENV: Agent did: predict-yes for direction L in state State-B
  5253. In State-B moving L
  5254. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5255. predict error 0
  5256. dir: dir isU
  5257. \-/746: O: O1492 (predict-no)
  5258. I see 1 and I'm going to do: predict-no
  5259. ENV: Agent did: predict-no for direction U in state State-A
  5260. In State-A moving U
  5261. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5262. predict error 0
  5263. dir: dir isR
  5264. |\-747: O: O1493 (predict-yes)
  5265. I see 1 and I'm going to do: predict-yes
  5266. ENV: Agent did: predict-yes for direction R in state State-A
  5267. In State-A moving R
  5268. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5269. predict error 0
  5270. dir: dir isU
  5271. /|\748: O: O1496 (predict-no)
  5272. I see 1 and I'm going to do: predict-no
  5273. ENV: Agent did: predict-no for direction U in state State-B
  5274. In State-B moving U
  5275. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5276. predict error 0
  5277. dir: dir isR
  5278. -749: O: O1498 (predict-no)
  5279. I see 1 and I'm going to do: predict-no
  5280. ENV: Agent did: predict-no for direction R in state State-B
  5281. In State-B moving R
  5282. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5283. predict error 0
  5284. dir: dir isU
  5285. /|750: O: O1500 (predict-no)
  5286. I see 1 and I'm going to do: predict-no
  5287. ENV: Agent did: predict-no for direction U in state State-B
  5288. In State-B moving U
  5289. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5290. predict error 0
  5291. dir: dir isU
  5292. \-/751: O: O1502 (predict-no)
  5293. I see 1 and I'm going to do: predict-no
  5294. ENV: Agent did: predict-no for direction U in state State-B
  5295. In State-B moving U
  5296. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5297. predict error 0
  5298. dir: dir isR
  5299. |752: O: O1504 (predict-no)
  5300. I see 1 and I'm going to do: predict-no
  5301. ENV: Agent did: predict-no for direction R in state State-B
  5302. In State-B moving R
  5303. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5304. predict error 0
  5305. dir: dir isU
  5306. \-753: O: O1506 (predict-no)
  5307. I see 1 and I'm going to do: predict-no
  5308. ENV: Agent did: predict-no for direction U in state State-B
  5309. In State-B moving U
  5310. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5311. predict error 0
  5312. dir: dir isL
  5313. /|\754: O: O1507 (predict-yes)
  5314. I see 1 and I'm going to do: predict-yes
  5315. ENV: Agent did: predict-yes for direction L in state State-B
  5316. In State-B moving L
  5317. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5318. predict error 0
  5319. dir: dir isR
  5320. -/|755: O: O1509 (predict-yes)
  5321. I see 1 and I'm going to do: predict-yes
  5322. ENV: Agent did: predict-yes for direction R in state State-A
  5323. In State-A moving R
  5324. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5325. predict error 0
  5326. dir: dir isU
  5327. \-/756: O: O1512 (predict-no)
  5328. I see 1 and I'm going to do: predict-no
  5329. ENV: Agent did: predict-no for direction U in state State-B
  5330. In State-B moving U
  5331. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5332. predict error 0
  5333. dir: dir isL
  5334. |\-757: O: O1513 (predict-yes)
  5335. I see 1 and I'm going to do: predict-yes
  5336. ENV: Agent did: predict-yes for direction L in state State-B
  5337. In State-B moving L
  5338. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5339. predict error 0
  5340. dir: dir isU
  5341. /|758: O: O1516 (predict-no)
  5342. I see 1 and I'm going to do: predict-no
  5343. ENV: Agent did: predict-no for direction U in state State-A
  5344. In State-A moving U
  5345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5346. predict error 0
  5347. dir: dir isU
  5348. \759: O: O1518 (predict-no)
  5349. I see 1 and I'm going to do: predict-no
  5350. ENV: Agent did: predict-no for direction U in state State-A
  5351. In State-A moving U
  5352. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5353. predict error 0
  5354. dir: dir isU
  5355. -/760: O: O1520 (predict-no)
  5356. I see 1 and I'm going to do: predict-no
  5357. ENV: Agent did: predict-no for direction U in state State-A
  5358. In State-A moving U
  5359. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5360. predict error 0
  5361. dir: dir isU
  5362. |761: O: O1522 (predict-no)
  5363. I see 1 and I'm going to do: predict-no
  5364. ENV: Agent did: predict-no for direction U in state State-A
  5365. In State-A moving U
  5366. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5367. predict error 0
  5368. dir: dir isL
  5369. \762: O: O1524 (predict-no)
  5370. I see 1 and I'm going to do: predict-no
  5371. ENV: Agent did: predict-no for direction L in state State-A
  5372. In State-A moving L
  5373. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5374. predict error 0
  5375. dir: dir isR
  5376. -/|763: O: O1526 (predict-no)
  5377. I see 1 and I'm going to do: predict-no
  5378. ENV: Agent did: predict-no for direction R in state State-A
  5379. In State-A moving R
  5380. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  5381. predict error 1
  5382. dir: dir isR
  5383. \-/764: O: O1528 (predict-no)
  5384. I see 0 and I'm going to do: predict-no
  5385. ENV: Agent did: predict-no for direction R in state State-B
  5386. In State-B moving R
  5387. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5388. predict error 0
  5389. dir: dir isR
  5390. |\765: O: O1530 (predict-no)
  5391. I see 1 and I'm going to do: predict-no
  5392. ENV: Agent did: predict-no for direction R in state State-B
  5393. In State-B moving R
  5394. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5395. predict error 0
  5396. dir: dir isR
  5397. -/|766: O: O1532 (predict-no)
  5398. I see 1 and I'm going to do: predict-no
  5399. ENV: Agent did: predict-no for direction R in state State-B
  5400. In State-B moving R
  5401. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5402. predict error 0
  5403. dir: dir isU
  5404. \767: O: O1534 (predict-no)
  5405. I see 1 and I'm going to do: predict-no
  5406. ENV: Agent did: predict-no for direction U in state State-B
  5407. In State-B moving U
  5408. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5409. predict error 0
  5410. dir: dir isL
  5411. -/768: O: O1535 (predict-yes)
  5412. I see 1 and I'm going to do: predict-yes
  5413. ENV: Agent did: predict-yes for direction L in state State-B
  5414. In State-B moving L
  5415. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5416. predict error 0
  5417. dir: dir isL
  5418. |769: O: O1538 (predict-no)
  5419. I see 1 and I'm going to do: predict-no
  5420. ENV: Agent did: predict-no for direction L in state State-A
  5421. In State-A moving L
  5422. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5423. predict error 0
  5424. dir: dir isR
  5425. \-770: O: O1539 (predict-yes)
  5426. I see 1 and I'm going to do: predict-yes
  5427. ENV: Agent did: predict-yes for direction R in state State-A
  5428. In State-A moving R
  5429. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5430. predict error 0
  5431. dir: dir isU
  5432. /|\771: O: O1542 (predict-no)
  5433. I see 1 and I'm going to do: predict-no
  5434. ENV: Agent did: predict-no for direction U in state State-B
  5435. In State-B moving U
  5436. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5437. predict error 0
  5438. dir: dir isU
  5439. -772: O: O1544 (predict-no)
  5440. I see 1 and I'm going to do: predict-no
  5441. ENV: Agent did: predict-no for direction U in state State-B
  5442. In State-B moving U
  5443. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5444. predict error 0
  5445. dir: dir isU
  5446. /|\773: O: O1546 (predict-no)
  5447. I see 1 and I'm going to do: predict-no
  5448. ENV: Agent did: predict-no for direction U in state State-B
  5449. In State-B moving U
  5450. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5451. predict error 0
  5452. dir: dir isL
  5453. -774: O: O1547 (predict-yes)
  5454. I see 1 and I'm going to do: predict-yes
  5455. ENV: Agent did: predict-yes for direction L in state State-B
  5456. In State-B moving L
  5457. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5458. predict error 0
  5459. dir: dir isU
  5460. /|775: O: O1550 (predict-no)
  5461. I see 1 and I'm going to do: predict-no
  5462. ENV: Agent did: predict-no for direction U in state State-A
  5463. In State-A moving U
  5464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5465. predict error 0
  5466. dir: dir isL
  5467. \-776: O: O1552 (predict-no)
  5468. I see 1 and I'm going to do: predict-no
  5469. ENV: Agent did: predict-no for direction L in state State-A
  5470. In State-A moving L
  5471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5472. predict error 0
  5473. dir: dir isL
  5474. /|777: O: O1554 (predict-no)
  5475. I see 1 and I'm going to do: predict-no
  5476. ENV: Agent did: predict-no for direction L in state State-A
  5477. In State-A moving L
  5478. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5479. predict error 0
  5480. dir: dir isU
  5481. \-/778: O: O1556 (predict-no)
  5482. I see 1 and I'm going to do: predict-no
  5483. ENV: Agent did: predict-no for direction U in state State-A
  5484. In State-A moving U
  5485. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5486. predict error 0
  5487. dir: dir isU
  5488. |\779: O: O1558 (predict-no)
  5489. I see 1 and I'm going to do: predict-no
  5490. ENV: Agent did: predict-no for direction U in state State-A
  5491. In State-A moving U
  5492. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5493. predict error 0
  5494. dir: dir isR
  5495. -/|780: O: O1559 (predict-yes)
  5496. I see 1 and I'm going to do: predict-yes
  5497. ENV: Agent did: predict-yes for direction R in state State-A
  5498. In State-A moving R
  5499. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5500. predict error 0
  5501. dir: dir isU
  5502. \-/781: O: O1562 (predict-no)
  5503. I see 1 and I'm going to do: predict-no
  5504. ENV: Agent did: predict-no for direction U in state State-B
  5505. In State-B moving U
  5506. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5507. predict error 0
  5508. dir: dir isL
  5509. |782: O: O1563 (predict-yes)
  5510. I see 1 and I'm going to do: predict-yes
  5511. ENV: Agent did: predict-yes for direction L in state State-B
  5512. In State-B moving L
  5513. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5514. predict error 0
  5515. dir: dir isR
  5516. \-/783: O: O1565 (predict-yes)
  5517. I see 1 and I'm going to do: predict-yes
  5518. ENV: Agent did: predict-yes for direction R in state State-A
  5519. In State-A moving R
  5520. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5521. predict error 0
  5522. dir: dir isL
  5523. |\784: O: O1567 (predict-yes)
  5524. I see 1 and I'm going to do: predict-yes
  5525. ENV: Agent did: predict-yes for direction L in state State-B
  5526. In State-B moving L
  5527. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5528. predict error 0
  5529. dir: dir isL
  5530. -/785: O: O1570 (predict-no)
  5531. I see 1 and I'm going to do: predict-no
  5532. ENV: Agent did: predict-no for direction L in state State-A
  5533. In State-A moving L
  5534. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5535. predict error 0
  5536. dir: dir isL
  5537. |\786: O: O1572 (predict-no)
  5538. I see 1 and I'm going to do: predict-no
  5539. ENV: Agent did: predict-no for direction L in state State-A
  5540. In State-A moving L
  5541. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5542. predict error 0
  5543. dir: dir isL
  5544. -787: O: O1574 (predict-no)
  5545. I see 1 and I'm going to do: predict-no
  5546. ENV: Agent did: predict-no for direction L in state State-A
  5547. In State-A moving L
  5548. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5549. predict error 0
  5550. dir: dir isR
  5551. /|\788: O: O1575 (predict-yes)
  5552. I see 1 and I'm going to do: predict-yes
  5553. ENV: Agent did: predict-yes for direction R in state State-A
  5554. In State-A moving R
  5555. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5556. predict error 0
  5557. dir: dir isR
  5558. -/|789: O: O1578 (predict-no)
  5559. I see 1 and I'm going to do: predict-no
  5560. ENV: Agent did: predict-no for direction R in state State-B
  5561. In State-B moving R
  5562. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5563. predict error 0
  5564. dir: dir isL
  5565. \-790: O: O1579 (predict-yes)
  5566. I see 1 and I'm going to do: predict-yes
  5567. ENV: Agent did: predict-yes for direction L in state State-B
  5568. In State-B moving L
  5569. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5570. predict error 0
  5571. dir: dir isR
  5572. /791: O: O1581 (predict-yes)
  5573. I see 1 and I'm going to do: predict-yes
  5574. ENV: Agent did: predict-yes for direction R in state State-A
  5575. In State-A moving R
  5576. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5577. predict error 0
  5578. dir: dir isL
  5579. |792: O: O1583 (predict-yes)
  5580. I see 1 and I'm going to do: predict-yes
  5581. ENV: Agent did: predict-yes for direction L in state State-B
  5582. In State-B moving L
  5583. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5584. predict error 0
  5585. dir: dir isR
  5586. \-793: O: O1585 (predict-yes)
  5587. I see 1 and I'm going to do: predict-yes
  5588. ENV: Agent did: predict-yes for direction R in state State-A
  5589. In State-A moving R
  5590. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5591. predict error 0
  5592. dir: dir isR
  5593. /|\794: O: O1588 (predict-no)
  5594. I see 1 and I'm going to do: predict-no
  5595. ENV: Agent did: predict-no for direction R in state State-B
  5596. In State-B moving R
  5597. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5598. predict error 0
  5599. dir: dir isR
  5600. -/795: O: O1590 (predict-no)
  5601. I see 1 and I'm going to do: predict-no
  5602. ENV: Agent did: predict-no for direction R in state State-B
  5603. In State-B moving R
  5604. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5605. predict error 0
  5606. dir: dir isU
  5607. |\-796: O: O1592 (predict-no)
  5608. I see 1 and I'm going to do: predict-no
  5609. ENV: Agent did: predict-no for direction U in state State-B
  5610. In State-B moving U
  5611. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5612. predict error 0
  5613. dir: dir isL
  5614. /|\797: O: O1593 (predict-yes)
  5615. I see 1 and I'm going to do: predict-yes
  5616. ENV: Agent did: predict-yes for direction L in state State-B
  5617. In State-B moving L
  5618. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5619. predict error 0
  5620. dir: dir isR
  5621. -/798: O: O1595 (predict-yes)
  5622. I see 1 and I'm going to do: predict-yes
  5623. ENV: Agent did: predict-yes for direction R in state State-A
  5624. In State-A moving R
  5625. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5626. predict error 0
  5627. dir: dir isR
  5628. |\-799: O: O1598 (predict-no)
  5629. I see 1 and I'm going to do: predict-no
  5630. ENV: Agent did: predict-no for direction R in state State-B
  5631. In State-B moving R
  5632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5633. predict error 0
  5634. dir: dir isR
  5635. /|\800: O: O1600 (predict-no)
  5636. I see 1 and I'm going to do: predict-no
  5637. ENV: Agent did: predict-no for direction R in state State-B
  5638. In State-B moving R
  5639. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5640. predict error 0
  5641. dir: dir isU
  5642. -/801: O: O1602 (predict-no)
  5643. I see 1 and I'm going to do: predict-no
  5644. ENV: Agent did: predict-no for direction U in state State-B
  5645. In State-B moving U
  5646. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5647. predict error 0
  5648. dir: dir isR
  5649. |802: O: O1604 (predict-no)
  5650. I see 1 and I'm going to do: predict-no
  5651. ENV: Agent did: predict-no for direction R in state State-B
  5652. In State-B moving R
  5653. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5654. predict error 0
  5655. dir: dir isR
  5656. \-/803: O: O1606 (predict-no)
  5657. I see 1 and I'm going to do: predict-no
  5658. ENV: Agent did: predict-no for direction R in state State-B
  5659. In State-B moving R
  5660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5661. predict error 0
  5662. dir: dir isR
  5663. |\804: O: O1608 (predict-no)
  5664. I see 1 and I'm going to do: predict-no
  5665. ENV: Agent did: predict-no for direction R in state State-B
  5666. In State-B moving R
  5667. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5668. predict error 0
  5669. dir: dir isR
  5670. -/805: O: O1610 (predict-no)
  5671. I see 1 and I'm going to do: predict-no
  5672. ENV: Agent did: predict-no for direction R in state State-B
  5673. In State-B moving R
  5674. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5675. predict error 0
  5676. dir: dir isU
  5677. |\-806: O: O1612 (predict-no)
  5678. I see 1 and I'm going to do: predict-no
  5679. ENV: Agent did: predict-no for direction U in state State-B
  5680. In State-B moving U
  5681. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5682. predict error 0
  5683. dir: dir isU
  5684. /807: O: O1614 (predict-no)
  5685. I see 1 and I'm going to do: predict-no
  5686. ENV: Agent did: predict-no for direction U in state State-B
  5687. In State-B moving U
  5688. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5689. predict error 0
  5690. dir: dir isR
  5691. |\-808: O: O1616 (predict-no)
  5692. I see 1 and I'm going to do: predict-no
  5693. ENV: Agent did: predict-no for direction R in state State-B
  5694. In State-B moving R
  5695. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5696. predict error 0
  5697. dir: dir isL
  5698. /|\809: O: O1617 (predict-yes)
  5699. I see 1 and I'm going to do: predict-yes
  5700. ENV: Agent did: predict-yes for direction L in state State-B
  5701. In State-B moving L
  5702. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5703. predict error 0
  5704. dir: dir isR
  5705. -/|810: O: O1619 (predict-yes)
  5706. I see 1 and I'm going to do: predict-yes
  5707. ENV: Agent did: predict-yes for direction R in state State-A
  5708. In State-A moving R
  5709. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5710. predict error 0
  5711. dir: dir isL
  5712. \-/811: O: O1621 (predict-yes)
  5713. I see 1 and I'm going to do: predict-yes
  5714. ENV: Agent did: predict-yes for direction L in state State-B
  5715. In State-B moving L
  5716. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5717. predict error 0
  5718. dir: dir isR
  5719. |812: O: O1623 (predict-yes)
  5720. I see 1 and I'm going to do: predict-yes
  5721. ENV: Agent did: predict-yes for direction R in state State-A
  5722. In State-A moving R
  5723. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5724. predict error 0
  5725. dir: dir isL
  5726. \-813: O: O1625 (predict-yes)
  5727. I see 1 and I'm going to do: predict-yes
  5728. ENV: Agent did: predict-yes for direction L in state State-B
  5729. In State-B moving L
  5730. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5731. predict error 0
  5732. dir: dir isU
  5733. /|\814: O: O1628 (predict-no)
  5734. I see 1 and I'm going to do: predict-no
  5735. ENV: Agent did: predict-no for direction U in state State-A
  5736. In State-A moving U
  5737. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5738. predict error 0
  5739. dir: dir isU
  5740. -/|815: O: O1630 (predict-no)
  5741. I see 1 and I'm going to do: predict-no
  5742. ENV: Agent did: predict-no for direction U in state State-A
  5743. In State-A moving U
  5744. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5745. predict error 0
  5746. dir: dir isR
  5747. \-/816: O: O1631 (predict-yes)
  5748. I see 1 and I'm going to do: predict-yes
  5749. ENV: Agent did: predict-yes for direction R in state State-A
  5750. In State-A moving R
  5751. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5752. predict error 0
  5753. dir: dir isR
  5754. |\817: O: O1634 (predict-no)
  5755. I see 1 and I'm going to do: predict-no
  5756. ENV: Agent did: predict-no for direction R in state State-B
  5757. In State-B moving R
  5758. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5759. predict error 0
  5760. dir: dir isL
  5761. -/|818: O: O1635 (predict-yes)
  5762. I see 1 and I'm going to do: predict-yes
  5763. ENV: Agent did: predict-yes for direction L in state State-B
  5764. In State-B moving L
  5765. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5766. predict error 0
  5767. dir: dir isR
  5768. \819: O: O1637 (predict-yes)
  5769. I see 1 and I'm going to do: predict-yes
  5770. ENV: Agent did: predict-yes for direction R in state State-A
  5771. In State-A moving R
  5772. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5773. predict error 0
  5774. dir: dir isR
  5775. -/|820: O: O1640 (predict-no)
  5776. I see 1 and I'm going to do: predict-no
  5777. ENV: Agent did: predict-no for direction R in state State-B
  5778. In State-B moving R
  5779. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5780. predict error 0
  5781. dir: dir isR
  5782. \-/821: O: O1642 (predict-no)
  5783. I see 1 and I'm going to do: predict-no
  5784. ENV: Agent did: predict-no for direction R in state State-B
  5785. In State-B moving R
  5786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5787. predict error 0
  5788. dir: dir isL
  5789. |822: O: O1643 (predict-yes)
  5790. I see 1 and I'm going to do: predict-yes
  5791. ENV: Agent did: predict-yes for direction L in state State-B
  5792. In State-B moving L
  5793. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5794. predict error 0
  5795. dir: dir isL
  5796. \-/823: O: O1646 (predict-no)
  5797. I see 1 and I'm going to do: predict-no
  5798. ENV: Agent did: predict-no for direction L in state State-A
  5799. In State-A moving L
  5800. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5801. predict error 0
  5802. dir: dir isU
  5803. |\-824: O: O1648 (predict-no)
  5804. I see 1 and I'm going to do: predict-no
  5805. ENV: Agent did: predict-no for direction U in state State-A
  5806. In State-A moving U
  5807. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5808. predict error 0
  5809. dir: dir isU
  5810. /|825: O: O1650 (predict-no)
  5811. I see 1 and I'm going to do: predict-no
  5812. ENV: Agent did: predict-no for direction U in state State-A
  5813. In State-A moving U
  5814. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5815. predict error 0
  5816. dir: dir isU
  5817. \-/826: O: O1652 (predict-no)
  5818. I see 1 and I'm going to do: predict-no
  5819. ENV: Agent did: predict-no for direction U in state State-A
  5820. In State-A moving U
  5821. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5822. predict error 0
  5823. dir: dir isR
  5824. |\-827: O: O1653 (predict-yes)
  5825. I see 1 and I'm going to do: predict-yes
  5826. ENV: Agent did: predict-yes for direction R in state State-A
  5827. In State-A moving R
  5828. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5829. predict error 0
  5830. dir: dir isL
  5831. /|\828: O: O1655 (predict-yes)
  5832. I see 1 and I'm going to do: predict-yes
  5833. ENV: Agent did: predict-yes for direction L in state State-B
  5834. In State-B moving L
  5835. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5836. predict error 0
  5837. dir: dir isL
  5838. -/|829: O: O1658 (predict-no)
  5839. I see 1 and I'm going to do: predict-no
  5840. ENV: Agent did: predict-no for direction L in state State-A
  5841. In State-A moving L
  5842. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5843. predict error 0
  5844. dir: dir isU
  5845. \-/830: O: O1660 (predict-no)
  5846. I see 1 and I'm going to do: predict-no
  5847. ENV: Agent did: predict-no for direction U in state State-A
  5848. In State-A moving U
  5849. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5850. predict error 0
  5851. dir: dir isU
  5852. |\-831: O: O1662 (predict-no)
  5853. I see 1 and I'm going to do: predict-no
  5854. ENV: Agent did: predict-no for direction U in state State-A
  5855. In State-A moving U
  5856. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5857. predict error 0
  5858. dir: dir isR
  5859. /832: O: O1663 (predict-yes)
  5860. I see 1 and I'm going to do: predict-yes
  5861. ENV: Agent did: predict-yes for direction R in state State-A
  5862. In State-A moving R
  5863. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5864. predict error 0
  5865. dir: dir isR
  5866. |\833: O: O1666 (predict-no)
  5867. I see 1 and I'm going to do: predict-no
  5868. ENV: Agent did: predict-no for direction R in state State-B
  5869. In State-B moving R
  5870. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5871. predict error 0
  5872. dir: dir isU
  5873. -/|834: O: O1668 (predict-no)
  5874. I see 1 and I'm going to do: predict-no
  5875. ENV: Agent did: predict-no for direction U in state State-B
  5876. In State-B moving U
  5877. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5878. predict error 0
  5879. dir: dir isL
  5880. \-/835: O: O1669 (predict-yes)
  5881. I see 1 and I'm going to do: predict-yes
  5882. ENV: Agent did: predict-yes for direction L in state State-B
  5883. In State-B moving L
  5884. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5885. predict error 0
  5886. dir: dir isU
  5887. |836: O: O1672 (predict-no)
  5888. I see 1 and I'm going to do: predict-no
  5889. ENV: Agent did: predict-no for direction U in state State-A
  5890. In State-A moving U
  5891. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5892. predict error 0
  5893. dir: dir isR
  5894. \-/837: O: O1673 (predict-yes)
  5895. I see 1 and I'm going to do: predict-yes
  5896. ENV: Agent did: predict-yes for direction R in state State-A
  5897. In State-A moving R
  5898. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5899. predict error 0
  5900. dir: dir isU
  5901. |838: O: O1676 (predict-no)
  5902. I see 1 and I'm going to do: predict-no
  5903. ENV: Agent did: predict-no for direction U in state State-B
  5904. In State-B moving U
  5905. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5906. predict error 0
  5907. dir: dir isR
  5908. \-839: O: O1678 (predict-no)
  5909. I see 1 and I'm going to do: predict-no
  5910. ENV: Agent did: predict-no for direction R in state State-B
  5911. In State-B moving R
  5912. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5913. predict error 0
  5914. dir: dir isR
  5915. /|\840: O: O1680 (predict-no)
  5916. I see 1 and I'm going to do: predict-no
  5917. ENV: Agent did: predict-no for direction R in state State-B
  5918. In State-B moving R
  5919. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5920. predict error 0
  5921. dir: dir isR
  5922. -/|841: O: O1682 (predict-no)
  5923. I see 1 and I'm going to do: predict-no
  5924. ENV: Agent did: predict-no for direction R in state State-B
  5925. In State-B moving R
  5926. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5927. predict error 0
  5928. dir: dir isR
  5929. \842: O: O1684 (predict-no)
  5930. I see 1 and I'm going to do: predict-no
  5931. ENV: Agent did: predict-no for direction R in state State-B
  5932. In State-B moving R
  5933. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5934. predict error 0
  5935. dir: dir isR
  5936. -/843: O: O1686 (predict-no)
  5937. I see 1 and I'm going to do: predict-no
  5938. ENV: Agent did: predict-no for direction R in state State-B
  5939. In State-B moving R
  5940. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5941. predict error 0
  5942. dir: dir isU
  5943. |\-844: O: O1688 (predict-no)
  5944. I see 1 and I'm going to do: predict-no
  5945. ENV: Agent did: predict-no for direction U in state State-B
  5946. In State-B moving U
  5947. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5948. predict error 0
  5949. dir: dir isU
  5950. /|\845: O: O1690 (predict-no)
  5951. I see 1 and I'm going to do: predict-no
  5952. ENV: Agent did: predict-no for direction U in state State-B
  5953. In State-B moving U
  5954. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5955. predict error 0
  5956. dir: dir isR
  5957. -/|846: O: O1692 (predict-no)
  5958. I see 1 and I'm going to do: predict-no
  5959. ENV: Agent did: predict-no for direction R in state State-B
  5960. In State-B moving R
  5961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5962. predict error 0
  5963. dir: dir isR
  5964. \-/847: O: O1694 (predict-no)
  5965. I see 1 and I'm going to do: predict-no
  5966. ENV: Agent did: predict-no for direction R in state State-B
  5967. In State-B moving R
  5968. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5969. predict error 0
  5970. dir: dir isU
  5971. |\848: O: O1696 (predict-no)
  5972. I see 1 and I'm going to do: predict-no
  5973. ENV: Agent did: predict-no for direction U in state State-B
  5974. In State-B moving U
  5975. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5976. predict error 0
  5977. dir: dir isU
  5978. -/|849: O: O1698 (predict-no)
  5979. I see 1 and I'm going to do: predict-no
  5980. ENV: Agent did: predict-no for direction U in state State-B
  5981. In State-B moving U
  5982. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5983. predict error 0
  5984. dir: dir isR
  5985. \-850: O: O1700 (predict-no)
  5986. I see 1 and I'm going to do: predict-no
  5987. ENV: Agent did: predict-no for direction R in state State-B
  5988. In State-B moving R
  5989. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5990. predict error 0
  5991. dir: dir isU
  5992. /|\851: O: O1702 (predict-no)
  5993. I see 1 and I'm going to do: predict-no
  5994. ENV: Agent did: predict-no for direction U in state State-B
  5995. In State-B moving U
  5996. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5997. predict error 0
  5998. dir: dir isL
  5999. -852: O: O1703 (predict-yes)
  6000. I see 1 and I'm going to do: predict-yes
  6001. ENV: Agent did: predict-yes for direction L in state State-B
  6002. In State-B moving L
  6003. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6004. predict error 0
  6005. dir: dir isR
  6006. /|\853: O: O1705 (predict-yes)
  6007. I see 1 and I'm going to do: predict-yes
  6008. ENV: Agent did: predict-yes for direction R in state State-A
  6009. In State-A moving R
  6010. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6011. predict error 0
  6012. dir: dir isR
  6013. -/854: O: O1708 (predict-no)
  6014. I see 1 and I'm going to do: predict-no
  6015. ENV: Agent did: predict-no for direction R in state State-B
  6016. In State-B moving R
  6017. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6018. predict error 0
  6019. dir: dir isR
  6020. |\855: O: O1710 (predict-no)
  6021. I see 1 and I'm going to do: predict-no
  6022. ENV: Agent did: predict-no for direction R in state State-B
  6023. In State-B moving R
  6024. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6025. predict error 0
  6026. dir: dir isU
  6027. -/|856: O: O1712 (predict-no)
  6028. I see 1 and I'm going to do: predict-no
  6029. ENV: Agent did: predict-no for direction U in state State-B
  6030. In State-B moving U
  6031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6032. predict error 0
  6033. dir: dir isU
  6034. \-/857: O: O1714 (predict-no)
  6035. I see 1 and I'm going to do: predict-no
  6036. ENV: Agent did: predict-no for direction U in state State-B
  6037. In State-B moving U
  6038. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6039. predict error 0
  6040. dir: dir isR
  6041. |\858: O: O1716 (predict-no)
  6042. I see 1 and I'm going to do: predict-no
  6043. ENV: Agent did: predict-no for direction R in state State-B
  6044. In State-B moving R
  6045. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6046. predict error 0
  6047. dir: dir isR
  6048. -859: O: O1718 (predict-no)
  6049. I see 1 and I'm going to do: predict-no
  6050. ENV: Agent did: predict-no for direction R in state State-B
  6051. In State-B moving R
  6052. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6053. predict error 0
  6054. dir: dir isU
  6055. /|\860: O: O1720 (predict-no)
  6056. I see 1 and I'm going to do: predict-no
  6057. ENV: Agent did: predict-no for direction U in state State-B
  6058. In State-B moving U
  6059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6060. predict error 0
  6061. dir: dir isU
  6062. -/|861: O: O1722 (predict-no)
  6063. I see 1 and I'm going to do: predict-no
  6064. ENV: Agent did: predict-no for direction U in state State-B
  6065. In State-B moving U
  6066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6067. predict error 0
  6068. dir: dir isR
  6069. \862: O: O1724 (predict-no)
  6070. I see 1 and I'm going to do: predict-no
  6071. ENV: Agent did: predict-no for direction R in state State-B
  6072. In State-B moving R
  6073. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6074. predict error 0
  6075. dir: dir isU
  6076. -/|863: O: O1726 (predict-no)
  6077. I see 1 and I'm going to do: predict-no
  6078. ENV: Agent did: predict-no for direction U in state State-B
  6079. In State-B moving U
  6080. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6081. predict error 0
  6082. dir: dir isR
  6083. \-/864: O: O1728 (predict-no)
  6084. I see 1 and I'm going to do: predict-no
  6085. ENV: Agent did: predict-no for direction R in state State-B
  6086. In State-B moving R
  6087. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6088. predict error 0
  6089. dir: dir isL
  6090. |\-865: O: O1729 (predict-yes)
  6091. I see 1 and I'm going to do: predict-yes
  6092. ENV: Agent did: predict-yes for direction L in state State-B
  6093. In State-B moving L
  6094. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6095. predict error 0
  6096. dir: dir isR
  6097. /|866: O: O1731 (predict-yes)
  6098. I see 1 and I'm going to do: predict-yes
  6099. ENV: Agent did: predict-yes for direction R in state State-A
  6100. In State-A moving R
  6101. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6102. predict error 0
  6103. dir: dir isR
  6104. \-/867: O: O1734 (predict-no)
  6105. I see 1 and I'm going to do: predict-no
  6106. ENV: Agent did: predict-no for direction R in state State-B
  6107. In State-B moving R
  6108. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6109. predict error 0
  6110. dir: dir isR
  6111. |\-868: O: O1736 (predict-no)
  6112. I see 1 and I'm going to do: predict-no
  6113. ENV: Agent did: predict-no for direction R in state State-B
  6114. In State-B moving R
  6115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6116. predict error 0
  6117. dir: dir isR
  6118. /869: O: O1738 (predict-no)
  6119. I see 1 and I'm going to do: predict-no
  6120. ENV: Agent did: predict-no for direction R in state State-B
  6121. In State-B moving R
  6122. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6123. predict error 0
  6124. dir: dir isU
  6125. |\-870: O: O1740 (predict-no)
  6126. I see 1 and I'm going to do: predict-no
  6127. ENV: Agent did: predict-no for direction U in state State-B
  6128. In State-B moving U
  6129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6130. predict error 0
  6131. dir: dir isU
  6132. /|871: O: O1742 (predict-no)
  6133. I see 1 and I'm going to do: predict-no
  6134. ENV: Agent did: predict-no for direction U in state State-B
  6135. In State-B moving U
  6136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6137. predict error 0
  6138. dir: dir isR
  6139. \872: O: O1744 (predict-no)
  6140. I see 1 and I'm going to do: predict-no
  6141. ENV: Agent did: predict-no for direction R in state State-B
  6142. In State-B moving R
  6143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6144. predict error 0
  6145. dir: dir isU
  6146. -873: O: O1746 (predict-no)
  6147. I see 1 and I'm going to do: predict-no
  6148. ENV: Agent did: predict-no for direction U in state State-B
  6149. In State-B moving U
  6150. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6151. predict error 0
  6152. dir: dir isR
  6153. /|\874: O: O1748 (predict-no)
  6154. I see 1 and I'm going to do: predict-no
  6155. ENV: Agent did: predict-no for direction R in state State-B
  6156. In State-B moving R
  6157. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6158. predict error 0
  6159. dir: dir isR
  6160. -/|875: O: O1750 (predict-no)
  6161. I see 1 and I'm going to do: predict-no
  6162. ENV: Agent did: predict-no for direction R in state State-B
  6163. In State-B moving R
  6164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6165. predict error 0
  6166. dir: dir isR
  6167. \-/876: O: O1752 (predict-no)
  6168. I see 1 and I'm going to do: predict-no
  6169. ENV: Agent did: predict-no for direction R in state State-B
  6170. In State-B moving R
  6171. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6172. predict error 0
  6173. dir: dir isU
  6174. |\877: O: O1754 (predict-no)
  6175. I see 1 and I'm going to do: predict-no
  6176. ENV: Agent did: predict-no for direction U in state State-B
  6177. In State-B moving U
  6178. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6179. predict error 0
  6180. dir: dir isU
  6181. -/|878: O: O1756 (predict-no)
  6182. I see 1 and I'm going to do: predict-no
  6183. ENV: Agent did: predict-no for direction U in state State-B
  6184. In State-B moving U
  6185. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6186. predict error 0
  6187. dir: dir isU
  6188. \-/879: O: O1758 (predict-no)
  6189. I see 1 and I'm going to do: predict-no
  6190. ENV: Agent did: predict-no for direction U in state State-B
  6191. In State-B moving U
  6192. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6193. predict error 0
  6194. dir: dir isU
  6195. |\-880: O: O1760 (predict-no)
  6196. I see 1 and I'm going to do: predict-no
  6197. ENV: Agent did: predict-no for direction U in state State-B
  6198. In State-B moving U
  6199. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6200. predict error 0
  6201. dir: dir isR
  6202. /|\881: O: O1762 (predict-no)
  6203. I see 1 and I'm going to do: predict-no
  6204. ENV: Agent did: predict-no for direction R in state State-B
  6205. In State-B moving R
  6206. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6207. predict error 0
  6208. dir: dir isL
  6209. -882: O: O1763 (predict-yes)
  6210. I see 1 and I'm going to do: predict-yes
  6211. ENV: Agent did: predict-yes for direction L in state State-B
  6212. In State-B moving L
  6213. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6214. predict error 0
  6215. dir: dir isR
  6216. /|\883: O: O1765 (predict-yes)
  6217. I see 1 and I'm going to do: predict-yes
  6218. ENV: Agent did: predict-yes for direction R in state State-A
  6219. In State-A moving R
  6220. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6221. predict error 0
  6222. dir: dir isL
  6223. -/|884: O: O1767 (predict-yes)
  6224. I see 1 and I'm going to do: predict-yes
  6225. ENV: Agent did: predict-yes for direction L in state State-B
  6226. In State-B moving L
  6227. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6228. predict error 0
  6229. dir: dir isU
  6230. \-/885: O: O1770 (predict-no)
  6231. I see 1 and I'm going to do: predict-no
  6232. ENV: Agent did: predict-no for direction U in state State-A
  6233. In State-A moving U
  6234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6235. predict error 0
  6236. dir: dir isR
  6237. |\-886: O: O1771 (predict-yes)
  6238. I see 1 and I'm going to do: predict-yes
  6239. ENV: Agent did: predict-yes for direction R in state State-A
  6240. In State-A moving R
  6241. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6242. predict error 0
  6243. dir: dir isR
  6244. /|\887: O: O1774 (predict-no)
  6245. I see 1 and I'm going to do: predict-no
  6246. ENV: Agent did: predict-no for direction R in state State-B
  6247. In State-B moving R
  6248. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6249. predict error 0
  6250. dir: dir isU
  6251. -/|888: O: O1776 (predict-no)
  6252. I see 1 and I'm going to do: predict-no
  6253. ENV: Agent did: predict-no for direction U in state State-B
  6254. In State-B moving U
  6255. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6256. predict error 0
  6257. dir: dir isL
  6258. \889: O: O1777 (predict-yes)
  6259. I see 1 and I'm going to do: predict-yes
  6260. ENV: Agent did: predict-yes for direction L in state State-B
  6261. In State-B moving L
  6262. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6263. predict error 0
  6264. dir: dir isU
  6265. -890: O: O1780 (predict-no)
  6266. I see 1 and I'm going to do: predict-no
  6267. ENV: Agent did: predict-no for direction U in state State-A
  6268. In State-A moving U
  6269. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6270. predict error 0
  6271. dir: dir isL
  6272. /891: O: O1782 (predict-no)
  6273. I see 1 and I'm going to do: predict-no
  6274. ENV: Agent did: predict-no for direction L in state State-A
  6275. In State-A moving L
  6276. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6277. predict error 0
  6278. dir: dir isL
  6279. |892: O: O1784 (predict-no)
  6280. I see 1 and I'm going to do: predict-no
  6281. ENV: Agent did: predict-no for direction L in state State-A
  6282. In State-A moving L
  6283. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6284. predict error 0
  6285. dir: dir isR
  6286. \-/893: O: O1785 (predict-yes)
  6287. I see 1 and I'm going to do: predict-yes
  6288. ENV: Agent did: predict-yes for direction R in state State-A
  6289. In State-A moving R
  6290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6291. predict error 0
  6292. dir: dir isR
  6293. |\894: O: O1788 (predict-no)
  6294. I see 1 and I'm going to do: predict-no
  6295. ENV: Agent did: predict-no for direction R in state State-B
  6296. In State-B moving R
  6297. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6298. predict error 0
  6299. dir: dir isU
  6300. -/|895: O: O1790 (predict-no)
  6301. I see 1 and I'm going to do: predict-no
  6302. ENV: Agent did: predict-no for direction U in state State-B
  6303. In State-B moving U
  6304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6305. predict error 0
  6306. dir: dir isR
  6307. \-/896: O: O1792 (predict-no)
  6308. I see 1 and I'm going to do: predict-no
  6309. ENV: Agent did: predict-no for direction R in state State-B
  6310. In State-B moving R
  6311. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6312. predict error 0
  6313. dir: dir isL
  6314. |\-897: O: O1793 (predict-yes)
  6315. I see 1 and I'm going to do: predict-yes
  6316. ENV: Agent did: predict-yes for direction L in state State-B
  6317. In State-B moving L
  6318. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6319. predict error 0
  6320. dir: dir isL
  6321. /|\898: O: O1796 (predict-no)
  6322. I see 1 and I'm going to do: predict-no
  6323. ENV: Agent did: predict-no for direction L in state State-A
  6324. In State-A moving L
  6325. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6326. predict error 0
  6327. dir: dir isL
  6328. -/899: O: O1798 (predict-no)
  6329. I see 1 and I'm going to do: predict-no
  6330. ENV: Agent did: predict-no for direction L in state State-A
  6331. In State-A moving L
  6332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6333. predict error 0
  6334. dir: dir isR
  6335. |\-/sleeping...
  6336. |900: O: O1799 (predict-yes)
  6337. I see 1 and I'm going to do: predict-yes
  6338. ENV: Agent did: predict-yes for direction R in state State-A
  6339. In State-A moving R
  6340. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6341. predict error 0
  6342. dir: dir isU
  6343. \-901: O: O1802 (predict-no)
  6344. I see 1 and I'm going to do: predict-no
  6345. ENV: Agent did: predict-no for direction U in state State-B
  6346. In State-B moving U
  6347. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6348. predict error 0
  6349. dir: dir isL
  6350. /902: O: O1803 (predict-yes)
  6351. I see 1 and I'm going to do: predict-yes
  6352. ENV: Agent did: predict-yes for direction L in state State-B
  6353. In State-B moving L
  6354. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6355. predict error 0
  6356. dir: dir isL
  6357. |\-903: O: O1806 (predict-no)
  6358. I see 1 and I'm going to do: predict-no
  6359. ENV: Agent did: predict-no for direction L in state State-A
  6360. In State-A moving L
  6361. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6362. predict error 0
  6363. dir: dir isR
  6364. /|\904: O: O1807 (predict-yes)
  6365. I see 1 and I'm going to do: predict-yes
  6366. ENV: Agent did: predict-yes for direction R in state State-A
  6367. In State-A moving R
  6368. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6369. predict error 0
  6370. dir: dir isU
  6371. -/|905: O: O1810 (predict-no)
  6372. I see 1 and I'm going to do: predict-no
  6373. ENV: Agent did: predict-no for direction U in state State-B
  6374. In State-B moving U
  6375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6376. predict error 0
  6377. dir: dir isU
  6378. \-/906: O: O1812 (predict-no)
  6379. I see 1 and I'm going to do: predict-no
  6380. ENV: Agent did: predict-no for direction U in state State-B
  6381. In State-B moving U
  6382. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6383. predict error 0
  6384. dir: dir isU
  6385. |\-907: O: O1814 (predict-no)
  6386. I see 1 and I'm going to do: predict-no
  6387. ENV: Agent did: predict-no for direction U in state State-B
  6388. In State-B moving U
  6389. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6390. predict error 0
  6391. dir: dir isR
  6392. /|\908: O: O1816 (predict-no)
  6393. I see 1 and I'm going to do: predict-no
  6394. ENV: Agent did: predict-no for direction R in state State-B
  6395. In State-B moving R
  6396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6397. predict error 0
  6398. dir: dir isR
  6399. -/|909: O: O1818 (predict-no)
  6400. I see 1 and I'm going to do: predict-no
  6401. ENV: Agent did: predict-no for direction R in state State-B
  6402. In State-B moving R
  6403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6404. predict error 0
  6405. dir: dir isL
  6406. \910: O: O1819 (predict-yes)
  6407. I see 1 and I'm going to do: predict-yes
  6408. ENV: Agent did: predict-yes for direction L in state State-B
  6409. In State-B moving L
  6410. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6411. predict error 0
  6412. dir: dir isR
  6413. -/|911: O: O1821 (predict-yes)
  6414. I see 1 and I'm going to do: predict-yes
  6415. ENV: Agent did: predict-yes for direction R in state State-A
  6416. In State-A moving R
  6417. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6418. predict error 0
  6419. dir: dir isL
  6420. \912: O: O1823 (predict-yes)
  6421. I see 1 and I'm going to do: predict-yes
  6422. ENV: Agent did: predict-yes for direction L in state State-B
  6423. In State-B moving L
  6424. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6425. predict error 0
  6426. dir: dir isL
  6427. -/913: O: O1826 (predict-no)
  6428. I see 1 and I'm going to do: predict-no
  6429. ENV: Agent did: predict-no for direction L in state State-A
  6430. In State-A moving L
  6431. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6432. predict error 0
  6433. dir: dir isU
  6434. |\914: O: O1828 (predict-no)
  6435. I see 1 and I'm going to do: predict-no
  6436. ENV: Agent did: predict-no for direction U in state State-A
  6437. In State-A moving U
  6438. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6439. predict error 0
  6440. dir: dir isR
  6441. -/|915: O: O1829 (predict-yes)
  6442. I see 1 and I'm going to do: predict-yes
  6443. ENV: Agent did: predict-yes for direction R in state State-A
  6444. In State-A moving R
  6445. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6446. predict error 0
  6447. dir: dir isU
  6448. \-/916: O: O1832 (predict-no)
  6449. I see 1 and I'm going to do: predict-no
  6450. ENV: Agent did: predict-no for direction U in state State-B
  6451. In State-B moving U
  6452. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6453. predict error 0
  6454. dir: dir isR
  6455. |\917: O: O1834 (predict-no)
  6456. I see 1 and I'm going to do: predict-no
  6457. ENV: Agent did: predict-no for direction R in state State-B
  6458. In State-B moving R
  6459. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6460. predict error 0
  6461. dir: dir isL
  6462. -918: O: O1835 (predict-yes)
  6463. I see 1 and I'm going to do: predict-yes
  6464. ENV: Agent did: predict-yes for direction L in state State-B
  6465. In State-B moving L
  6466. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6467. predict error 0
  6468. dir: dir isR
  6469. /|\919: O: O1837 (predict-yes)
  6470. I see 1 and I'm going to do: predict-yes
  6471. ENV: Agent did: predict-yes for direction R in state State-A
  6472. In State-A moving R
  6473. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6474. predict error 0
  6475. dir: dir isR
  6476. -/|920: O: O1840 (predict-no)
  6477. I see 1 and I'm going to do: predict-no
  6478. ENV: Agent did: predict-no for direction R in state State-B
  6479. In State-B moving R
  6480. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6481. predict error 0
  6482. dir: dir isL
  6483. \-/921: O: O1841 (predict-yes)
  6484. I see 1 and I'm going to do: predict-yes
  6485. ENV: Agent did: predict-yes for direction L in state State-B
  6486. In State-B moving L
  6487. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6488. predict error 0
  6489. dir: dir isU
  6490. |922: O: O1844 (predict-no)
  6491. I see 1 and I'm going to do: predict-no
  6492. ENV: Agent did: predict-no for direction U in state State-A
  6493. In State-A moving U
  6494. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6495. predict error 0
  6496. dir: dir isU
  6497. \-/923: O: O1846 (predict-no)
  6498. I see 1 and I'm going to do: predict-no
  6499. ENV: Agent did: predict-no for direction U in state State-A
  6500. In State-A moving U
  6501. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6502. predict error 0
  6503. dir: dir isL
  6504. |\-924: O: O1848 (predict-no)
  6505. I see 1 and I'm going to do: predict-no
  6506. ENV: Agent did: predict-no for direction L in state State-A
  6507. In State-A moving L
  6508. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6509. predict error 0
  6510. dir: dir isL
  6511. /925: O: O1850 (predict-no)
  6512. I see 1 and I'm going to do: predict-no
  6513. ENV: Agent did: predict-no for direction L in state State-A
  6514. In State-A moving L
  6515. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6516. predict error 0
  6517. dir: dir isU
  6518. |\-926: O: O1852 (predict-no)
  6519. I see 1 and I'm going to do: predict-no
  6520. ENV: Agent did: predict-no for direction U in state State-A
  6521. In State-A moving U
  6522. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6523. predict error 0
  6524. dir: dir isU
  6525. /|\927: O: O1854 (predict-no)
  6526. I see 1 and I'm going to do: predict-no
  6527. ENV: Agent did: predict-no for direction U in state State-A
  6528. In State-A moving U
  6529. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6530. predict error 0
  6531. dir: dir isL
  6532. -/928: O: O1856 (predict-no)
  6533. I see 1 and I'm going to do: predict-no
  6534. ENV: Agent did: predict-no for direction L in state State-A
  6535. In State-A moving L
  6536. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6537. predict error 0
  6538. dir: dir isR
  6539. |\-929: O: O1857 (predict-yes)
  6540. I see 1 and I'm going to do: predict-yes
  6541. ENV: Agent did: predict-yes for direction R in state State-A
  6542. In State-A moving R
  6543. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6544. predict error 0
  6545. dir: dir isL
  6546. /|\930: O: O1859 (predict-yes)
  6547. I see 1 and I'm going to do: predict-yes
  6548. ENV: Agent did: predict-yes for direction L in state State-B
  6549. In State-B moving L
  6550. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6551. predict error 0
  6552. dir: dir isU
  6553. -/931: O: O1862 (predict-no)
  6554. I see 1 and I'm going to do: predict-no
  6555. ENV: Agent did: predict-no for direction U in state State-A
  6556. In State-A moving U
  6557. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6558. predict error 0
  6559. dir: dir isR
  6560. |932: O: O1863 (predict-yes)
  6561. I see 1 and I'm going to do: predict-yes
  6562. ENV: Agent did: predict-yes for direction R in state State-A
  6563. In State-A moving R
  6564. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6565. predict error 0
  6566. dir: dir isR
  6567. \-/933: O: O1866 (predict-no)
  6568. I see 1 and I'm going to do: predict-no
  6569. ENV: Agent did: predict-no for direction R in state State-B
  6570. In State-B moving R
  6571. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6572. predict error 0
  6573. dir: dir isL
  6574. |\-934: O: O1867 (predict-yes)
  6575. I see 1 and I'm going to do: predict-yes
  6576. ENV: Agent did: predict-yes for direction L in state State-B
  6577. In State-B moving L
  6578. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6579. predict error 0
  6580. dir: dir isR
  6581. /|\935: O: O1869 (predict-yes)
  6582. I see 1 and I'm going to do: predict-yes
  6583. ENV: Agent did: predict-yes for direction R in state State-A
  6584. In State-A moving R
  6585. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6586. predict error 0
  6587. dir: dir isR
  6588. -/936: O: O1872 (predict-no)
  6589. I see 1 and I'm going to do: predict-no
  6590. ENV: Agent did: predict-no for direction R in state State-B
  6591. In State-B moving R
  6592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6593. predict error 0
  6594. dir: dir isR
  6595. |\-937: O: O1874 (predict-no)
  6596. I see 1 and I'm going to do: predict-no
  6597. ENV: Agent did: predict-no for direction R in state State-B
  6598. In State-B moving R
  6599. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6600. predict error 0
  6601. dir: dir isL
  6602. /|\938: O: O1875 (predict-yes)
  6603. I see 1 and I'm going to do: predict-yes
  6604. ENV: Agent did: predict-yes for direction L in state State-B
  6605. In State-B moving L
  6606. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6607. predict error 0
  6608. dir: dir isL
  6609. -/|939: O: O1878 (predict-no)
  6610. I see 1 and I'm going to do: predict-no
  6611. ENV: Agent did: predict-no for direction L in state State-A
  6612. In State-A moving L
  6613. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6614. predict error 0
  6615. dir: dir isU
  6616. \-/940: O: O1880 (predict-no)
  6617. I see 1 and I'm going to do: predict-no
  6618. ENV: Agent did: predict-no for direction U in state State-A
  6619. In State-A moving U
  6620. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6621. predict error 0
  6622. dir: dir isU
  6623. |\-941: O: O1882 (predict-no)
  6624. I see 1 and I'm going to do: predict-no
  6625. ENV: Agent did: predict-no for direction U in state State-A
  6626. In State-A moving U
  6627. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6628. predict error 0
  6629. dir: dir isU
  6630. /942: O: O1884 (predict-no)
  6631. I see 1 and I'm going to do: predict-no
  6632. ENV: Agent did: predict-no for direction U in state State-A
  6633. In State-A moving U
  6634. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6635. predict error 0
  6636. dir: dir isR
  6637. |\943: O: O1885 (predict-yes)
  6638. I see 1 and I'm going to do: predict-yes
  6639. ENV: Agent did: predict-yes for direction R in state State-A
  6640. In State-A moving R
  6641. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6642. predict error 0
  6643. dir: dir isU
  6644. -/|944: O: O1888 (predict-no)
  6645. I see 1 and I'm going to do: predict-no
  6646. ENV: Agent did: predict-no for direction U in state State-B
  6647. In State-B moving U
  6648. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6649. predict error 0
  6650. dir: dir isL
  6651. \-/945: O: O1889 (predict-yes)
  6652. I see 1 and I'm going to do: predict-yes
  6653. ENV: Agent did: predict-yes for direction L in state State-B
  6654. In State-B moving L
  6655. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6656. predict error 0
  6657. dir: dir isL
  6658. |\-946: O: O1892 (predict-no)
  6659. I see 1 and I'm going to do: predict-no
  6660. ENV: Agent did: predict-no for direction L in state State-A
  6661. In State-A moving L
  6662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6663. predict error 0
  6664. dir: dir isU
  6665. /|947: O: O1894 (predict-no)
  6666. I see 1 and I'm going to do: predict-no
  6667. ENV: Agent did: predict-no for direction U in state State-A
  6668. In State-A moving U
  6669. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6670. predict error 0
  6671. dir: dir isL
  6672. \-948: O: O1896 (predict-no)
  6673. I see 1 and I'm going to do: predict-no
  6674. ENV: Agent did: predict-no for direction L in state State-A
  6675. In State-A moving L
  6676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6677. predict error 0
  6678. dir: dir isL
  6679. /|\949: O: O1898 (predict-no)
  6680. I see 1 and I'm going to do: predict-no
  6681. ENV: Agent did: predict-no for direction L in state State-A
  6682. In State-A moving L
  6683. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6684. predict error 0
  6685. dir: dir isU
  6686. -/|950: O: O1900 (predict-no)
  6687. I see 1 and I'm going to do: predict-no
  6688. ENV: Agent did: predict-no for direction U in state State-A
  6689. In State-A moving U
  6690. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6691. predict error 0
  6692. dir: dir isL
  6693. \-/|\-/--- Input Phase ---
  6694. =>WM: (13326: I2 ^dir L)
  6695. =>WM: (13325: I2 ^reward 1)
  6696. =>WM: (13324: I2 ^see 0)
  6697. =>WM: (13323: N950 ^status complete)
  6698. <=WM: (13312: I2 ^dir U)
  6699. <=WM: (13311: I2 ^reward 1)
  6700. <=WM: (13310: I2 ^see 0)
  6701. =>WM: (13327: I2 ^level-1 L0-root)
  6702. <=WM: (13313: I2 ^level-1 L0-root)
  6703. --- END Input Phase ---
  6704. --- Proposal Phase ---
  6705. --- Inner Elaboration Phase, active level 1 (S1) ---
  6706. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6707. -->
  6708. (S1 ^operator O1899 = -0.208713043145708)
  6709. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6710. -->
  6711. (S1 ^operator O1900 = 0.6854017956462798)
  6712. Firing prefer*rvt*predict-no*H0*4*H1
  6713. -->
  6714. Firing prefer*rvt*predict-yes*H0*3*H1
  6715. -->
  6716. Firing elaborate*copy-see-to-output-link
  6717. -->
  6718. (I3 ^see 0 +)
  6719. Firing elaborate*reward*based*on*reward
  6720. -->
  6721. (R954 ^value 1 +)
  6722. (R1 ^reward R954 +)
  6723. Firing propose*predict-yes
  6724. -->
  6725. (O1901 ^name predict-yes +)
  6726. (S1 ^operator O1901 +)
  6727. Firing propose*predict-no
  6728. -->
  6729. (O1902 ^name predict-no +)
  6730. (S1 ^operator O1902 +)
  6731. Firing rl*prefer*rvt*predict-no*H0*4
  6732. -->
  6733. (S1 ^operator O1900 = 0.3145080651024651)
  6734. Firing rl*prefer*rvt*predict-yes*H0*3
  6735. -->
  6736. (S1 ^operator O1899 = 0.3908143935841644)
  6737. Firing prefer*rvt*predict-yes*H0
  6738. -->
  6739. Firing prefer*rvt*predict-no*H0
  6740. -->
  6741. Firing elaborate*copy-dir-to-output-link
  6742. -->
  6743. (I3 ^dir L +)
  6744. inner elaboration loop at bottom goal.
  6745. Retracting elaborate*copy-see-to-output-link
  6746. -->
  6747. (I3 ^see 0 +)
  6748. Retracting propose*predict-no
  6749. -->
  6750. (O1900 ^name predict-no +)
  6751. (S1 ^operator O1900 +)
  6752. Retracting propose*predict-yes
  6753. -->
  6754. (O1899 ^name predict-yes +)
  6755. (S1 ^operator O1899 +)
  6756. Retracting elaborate*reward*based*on*reward
  6757. -->
  6758. (R953 ^value 1 +)
  6759. (R1 ^reward R953 +)
  6760. Retracting elaborate*copy-dir-to-output-link
  6761. -->
  6762. (I3 ^dir U +)
  6763. Retracting rl*prefer*rvt*predict-no*H0*2
  6764. -->
  6765. (S1 ^operator O1900 = 1.)
  6766. Retracting rl*prefer*rvt*predict-yes*H0*1
  6767. -->
  6768. (S1 ^operator O1899 = 0.)
  6769. =>WM: (13334: S1 ^operator O1902 +)
  6770. =>WM: (13333: S1 ^operator O1901 +)
  6771. =>WM: (13332: I3 ^dir L)
  6772. =>WM: (13331: O1902 ^name predict-no)
  6773. =>WM: (13330: O1901 ^name predict-yes)
  6774. =>WM: (13329: R954 ^value 1)
  6775. =>WM: (13328: R1 ^reward R954)
  6776. <=WM: (13319: S1 ^operator O1899 +)
  6777. <=WM: (13320: S1 ^operator O1900 +)
  6778. <=WM: (13321: S1 ^operator O1900)
  6779. <=WM: (13318: I3 ^dir U)
  6780. <=WM: (13314: R1 ^reward R953)
  6781. <=WM: (13317: O1900 ^name predict-no)
  6782. <=WM: (13316: O1899 ^name predict-yes)
  6783. <=WM: (13315: R953 ^value 1)
  6784. --- Inner Elaboration Phase, active level 1 (S1) ---
  6785. Firing prefer*rvt*predict-yes*H0
  6786. -->
  6787. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6788. -->
  6789. (S1 ^operator O1901 = -0.208713043145708)
  6790. Firing rl*prefer*rvt*predict-yes*H0*3
  6791. -->
  6792. (S1 ^operator O1901 = 0.3908143935841644)
  6793. Firing prefer*rvt*predict-yes*H0*3*H1
  6794. -->
  6795. Firing prefer*rvt*predict-no*H0
  6796. -->
  6797. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6798. -->
  6799. (S1 ^operator O1902 = 0.6854017956462798)
  6800. Firing rl*prefer*rvt*predict-no*H0*4
  6801. -->
  6802. (S1 ^operator O1902 = 0.3145080651024651)
  6803. Firing prefer*rvt*predict-no*H0*4*H1
  6804. -->
  6805. inner elaboration loop at bottom goal.
  6806. Retracting rl*prefer*rvt*predict-no*H0*4
  6807. -->
  6808. (S1 ^operator O1900 = 0.3145080651024651)
  6809. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  6810. -->
  6811. (S1 ^operator O1900 = 0.6854017956462798)
  6812. Retracting rl*prefer*rvt*predict-yes*H0*3
  6813. -->
  6814. (S1 ^operator O1899 = 0.3908143935841644)
  6815. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  6816. -->
  6817. (S1 ^operator O1899 = -0.208713043145708)
  6818. --- END Proposal Phase ---
  6819. --- Decision Phase ---
  6820. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6821. =>WM: (13335: S1 ^operator O1902)
  6822. 951: O: O1902 (predict-no)
  6823. --- END Decision Phase ---
  6824. --- Application Phase ---
  6825. --- Firing Productions (PE) For State At Depth 1 ---
  6826. --- Inner Elaboration Phase, active level 1 (S1) ---
  6827. Firing apply*operator
  6828. -->
  6829. (I3 ^predict-no N951 + :O )
  6830. Firing apply*operator*complete
  6831. -->
  6832. (I3 ^predict-no N950 - :O )
  6833. inner elaboration loop at bottom goal.
  6834. --- Change Working Memory (PE) ---
  6835. =>WM: (13336: I3 ^predict-no N951)
  6836. <=WM: (13323: N950 ^status complete)
  6837. <=WM: (13322: I3 ^predict-no N950)
  6838. --- Firing Productions (IE) For State At Depth 1 ---
  6839. --- Inner Elaboration Phase, active level 1 (S1) ---
  6840. Firing monitor*world
  6841. -->
  6842. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6843. --- Change Working Memory (IE) ---
  6844. --- END Application Phase ---
  6845. --- Output Phase ---
  6846. ENV: Agent did: predict-no for direction L in state State-A
  6847. In State-A moving L
  6848. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6849. predict error 0
  6850. dir: dir isL
  6851. --- END Output Phase ---
  6852. |--- Input Phase ---
  6853. =>WM: (13340: I2 ^dir L)
  6854. =>WM: (13339: I2 ^reward 1)
  6855. =>WM: (13338: I2 ^see 0)
  6856. =>WM: (13337: N951 ^status complete)
  6857. <=WM: (13326: I2 ^dir L)
  6858. <=WM: (13325: I2 ^reward 1)
  6859. <=WM: (13324: I2 ^see 0)
  6860. =>WM: (13341: I2 ^level-1 L0-root)
  6861. <=WM: (13327: I2 ^level-1 L0-root)
  6862. --- END Input Phase ---
  6863. --- Proposal Phase ---
  6864. --- Inner Elaboration Phase, active level 1 (S1) ---
  6865. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6866. -->
  6867. (S1 ^operator O1901 = -0.208713043145708)
  6868. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6869. -->
  6870. (S1 ^operator O1902 = 0.6854017956462798)
  6871. Firing prefer*rvt*predict-no*H0*4*H1
  6872. -->
  6873. Firing prefer*rvt*predict-yes*H0*3*H1
  6874. -->
  6875. Firing elaborate*copy-see-to-output-link
  6876. -->
  6877. (I3 ^see 0 +)
  6878. Firing elaborate*reward*based*on*reward
  6879. -->
  6880. (R955 ^value 1 +)
  6881. (R1 ^reward R955 +)
  6882. Firing propose*predict-yes
  6883. -->
  6884. (O1903 ^name predict-yes +)
  6885. (S1 ^operator O1903 +)
  6886. Firing propose*predict-no
  6887. -->
  6888. (O1904 ^name predict-no +)
  6889. (S1 ^operator O1904 +)
  6890. Firing rl*prefer*rvt*predict-no*H0*4
  6891. -->
  6892. (S1 ^operator O1902 = 0.3145080651024651)
  6893. Firing rl*prefer*rvt*predict-yes*H0*3
  6894. -->
  6895. (S1 ^operator O1901 = 0.3908143935841644)
  6896. Firing prefer*rvt*predict-yes*H0
  6897. -->
  6898. Firing prefer*rvt*predict-no*H0
  6899. -->
  6900. Firing elaborate*copy-dir-to-output-link
  6901. -->
  6902. (I3 ^dir L +)
  6903. inner elaboration loop at bottom goal.
  6904. Retracting elaborate*copy-see-to-output-link
  6905. -->
  6906. (I3 ^see 0 +)
  6907. Retracting propose*predict-no
  6908. -->
  6909. (O1902 ^name predict-no +)
  6910. (S1 ^operator O1902 +)
  6911. Retracting propose*predict-yes
  6912. -->
  6913. (O1901 ^name predict-yes +)
  6914. (S1 ^operator O1901 +)
  6915. Retracting elaborate*reward*based*on*reward
  6916. -->
  6917. (R954 ^value 1 +)
  6918. (R1 ^reward R954 +)
  6919. Retracting elaborate*copy-dir-to-output-link
  6920. -->
  6921. (I3 ^dir L +)
  6922. Retracting rl*prefer*rvt*predict-no*H0*4
  6923. -->
  6924. (S1 ^operator O1902 = 0.3145080651024651)
  6925. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  6926. -->
  6927. (S1 ^operator O1902 = 0.6854017956462798)
  6928. Retracting rl*prefer*rvt*predict-yes*H0*3
  6929. -->
  6930. (S1 ^operator O1901 = 0.3908143935841644)
  6931. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  6932. -->
  6933. (S1 ^operator O1901 = -0.208713043145708)
  6934. =>WM: (13347: S1 ^operator O1904 +)
  6935. =>WM: (13346: S1 ^operator O1903 +)
  6936. =>WM: (13345: O1904 ^name predict-no)
  6937. =>WM: (13344: O1903 ^name predict-yes)
  6938. =>WM: (13343: R955 ^value 1)
  6939. =>WM: (13342: R1 ^reward R955)
  6940. <=WM: (13333: S1 ^operator O1901 +)
  6941. <=WM: (13334: S1 ^operator O1902 +)
  6942. <=WM: (13335: S1 ^operator O1902)
  6943. <=WM: (13328: R1 ^reward R954)
  6944. <=WM: (13331: O1902 ^name predict-no)
  6945. <=WM: (13330: O1901 ^name predict-yes)
  6946. <=WM: (13329: R954 ^value 1)
  6947. --- Inner Elaboration Phase, active level 1 (S1) ---
  6948. Firing prefer*rvt*predict-yes*H0
  6949. -->
  6950. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6951. -->
  6952. (S1 ^operator O1903 = -0.208713043145708)
  6953. Firing rl*prefer*rvt*predict-yes*H0*3
  6954. -->
  6955. (S1 ^operator O1903 = 0.3908143935841644)
  6956. Firing prefer*rvt*predict-yes*H0*3*H1
  6957. -->
  6958. Firing prefer*rvt*predict-no*H0
  6959. -->
  6960. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6961. -->
  6962. (S1 ^operator O1904 = 0.6854017956462798)
  6963. Firing rl*prefer*rvt*predict-no*H0*4
  6964. -->
  6965. (S1 ^operator O1904 = 0.3145080651024651)
  6966. Firing prefer*rvt*predict-no*H0*4*H1
  6967. -->
  6968. inner elaboration loop at bottom goal.
  6969. Retracting rl*prefer*rvt*predict-no*H0*4
  6970. -->
  6971. (S1 ^operator O1902 = 0.3145080651024651)
  6972. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  6973. -->
  6974. (S1 ^operator O1902 = 0.6854017956462798)
  6975. Retracting rl*prefer*rvt*predict-yes*H0*3
  6976. -->
  6977. (S1 ^operator O1901 = 0.3908143935841644)
  6978. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  6979. -->
  6980. (S1 ^operator O1901 = -0.208713043145708)
  6981. --- END Proposal Phase ---
  6982. --- Decision Phase ---
  6983. RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478563 -0.164047 0.314516(R,m,v=1,0.917808,0.0759565)
  6984. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521362 0.16404 0.685402 -> 0.52137 0.16404 0.685411(R,m,v=1,1,0)
  6985. =>WM: (13348: S1 ^operator O1904)
  6986. 952: O: O1904 (predict-no)
  6987. --- END Decision Phase ---
  6988. --- Application Phase ---
  6989. --- Firing Productions (PE) For State At Depth 1 ---
  6990. --- Inner Elaboration Phase, active level 1 (S1) ---
  6991. Firing apply*operator
  6992. -->
  6993. (I3 ^predict-no N952 + :O )
  6994. Firing apply*operator*complete
  6995. -->
  6996. (I3 ^predict-no N951 - :O )
  6997. inner elaboration loop at bottom goal.
  6998. --- Change Working Memory (PE) ---
  6999. =>WM: (13349: I3 ^predict-no N952)
  7000. <=WM: (13337: N951 ^status complete)
  7001. <=WM: (13336: I3 ^predict-no N951)
  7002. --- Firing Productions (IE) For State At Depth 1 ---
  7003. --- Inner Elaboration Phase, active level 1 (S1) ---
  7004. Firing monitor*world
  7005. -->
  7006. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7007. --- Change Working Memory (IE) ---
  7008. --- END Application Phase ---
  7009. --- Output Phase ---
  7010. ENV: Agent did: predict-no for direction L in state State-A
  7011. In State-A moving L
  7012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7013. predict error 0
  7014. dir: dir isR
  7015. --- END Output Phase ---
  7016. \-/--- Input Phase ---
  7017. =>WM: (13353: I2 ^dir R)
  7018. =>WM: (13352: I2 ^reward 1)
  7019. =>WM: (13351: I2 ^see 0)
  7020. =>WM: (13350: N952 ^status complete)
  7021. <=WM: (13340: I2 ^dir L)
  7022. <=WM: (13339: I2 ^reward 1)
  7023. <=WM: (13338: I2 ^see 0)
  7024. =>WM: (13354: I2 ^level-1 L0-root)
  7025. <=WM: (13341: I2 ^level-1 L0-root)
  7026. --- END Input Phase ---
  7027. --- Proposal Phase ---
  7028. --- Inner Elaboration Phase, active level 1 (S1) ---
  7029. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7030. -->
  7031. (S1 ^operator O1903 = 0.8783877442642956)
  7032. Firing prefer*rvt*predict-yes*H0*5*H1
  7033. -->
  7034. Firing elaborate*copy-see-to-output-link
  7035. -->
  7036. (I3 ^see 0 +)
  7037. Firing elaborate*reward*based*on*reward
  7038. -->
  7039. (R956 ^value 1 +)
  7040. (R1 ^reward R956 +)
  7041. Firing propose*predict-yes
  7042. -->
  7043. (O1905 ^name predict-yes +)
  7044. (S1 ^operator O1905 +)
  7045. Firing propose*predict-no
  7046. -->
  7047. (O1906 ^name predict-no +)
  7048. (S1 ^operator O1906 +)
  7049. Firing rl*prefer*rvt*predict-no*H0*6
  7050. -->
  7051. (S1 ^operator O1904 = 0.999977424773942)
  7052. Firing rl*prefer*rvt*predict-yes*H0*5
  7053. -->
  7054. (S1 ^operator O1903 = 0.1215951465100475)
  7055. Firing prefer*rvt*predict-yes*H0
  7056. -->
  7057. Firing prefer*rvt*predict-no*H0
  7058. -->
  7059. Firing elaborate*copy-dir-to-output-link
  7060. -->
  7061. (I3 ^dir R +)
  7062. inner elaboration loop at bottom goal.
  7063. Retracting elaborate*copy-see-to-output-link
  7064. -->
  7065. (I3 ^see 0 +)
  7066. Retracting propose*predict-no
  7067. -->
  7068. (O1904 ^name predict-no +)
  7069. (S1 ^operator O1904 +)
  7070. Retracting propose*predict-yes
  7071. -->
  7072. (O1903 ^name predict-yes +)
  7073. (S1 ^operator O1903 +)
  7074. Retracting elaborate*reward*based*on*reward
  7075. -->
  7076. (R955 ^value 1 +)
  7077. (R1 ^reward R955 +)
  7078. Retracting elaborate*copy-dir-to-output-link
  7079. -->
  7080. (I3 ^dir L +)
  7081. Retracting rl*prefer*rvt*predict-no*H0*4
  7082. -->
  7083. (S1 ^operator O1904 = 0.3145155972863931)
  7084. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  7085. -->
  7086. (S1 ^operator O1904 = 0.6854105587116136)
  7087. Retracting rl*prefer*rvt*predict-yes*H0*3
  7088. -->
  7089. (S1 ^operator O1903 = 0.3908143935841644)
  7090. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  7091. -->
  7092. (S1 ^operator O1903 = -0.208713043145708)
  7093. =>WM: (13361: S1 ^operator O1906 +)
  7094. =>WM: (13360: S1 ^operator O1905 +)
  7095. =>WM: (13359: I3 ^dir R)
  7096. =>WM: (13358: O1906 ^name predict-no)
  7097. =>WM: (13357: O1905 ^name predict-yes)
  7098. =>WM: (13356: R956 ^value 1)
  7099. =>WM: (13355: R1 ^reward R956)
  7100. <=WM: (13346: S1 ^operator O1903 +)
  7101. <=WM: (13347: S1 ^operator O1904 +)
  7102. <=WM: (13348: S1 ^operator O1904)
  7103. <=WM: (13332: I3 ^dir L)
  7104. <=WM: (13342: R1 ^reward R955)
  7105. <=WM: (13345: O1904 ^name predict-no)
  7106. <=WM: (13344: O1903 ^name predict-yes)
  7107. <=WM: (13343: R955 ^value 1)
  7108. --- Inner Elaboration Phase, active level 1 (S1) ---
  7109. Firing prefer*rvt*predict-yes*H0
  7110. -->
  7111. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7112. -->
  7113. (S1 ^operator O1905 = 0.8783877442642956)
  7114. Firing rl*prefer*rvt*predict-yes*H0*5
  7115. -->
  7116. (S1 ^operator O1905 = 0.1215951465100475)
  7117. Firing prefer*rvt*predict-yes*H0*5*H1
  7118. -->
  7119. Firing prefer*rvt*predict-no*H0
  7120. -->
  7121. Firing rl*prefer*rvt*predict-no*H0*6
  7122. -->
  7123. (S1 ^operator O1906 = 0.999977424773942)
  7124. inner elaboration loop at bottom goal.
  7125. Retracting rl*prefer*rvt*predict-no*H0*6
  7126. -->
  7127. (S1 ^operator O1904 = 0.999977424773942)
  7128. Retracting rl*prefer*rvt*predict-yes*H0*5
  7129. -->
  7130. (S1 ^operator O1903 = 0.1215951465100475)
  7131. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7132. -->
  7133. (S1 ^operator O1903 = 0.8783877442642956)
  7134. --- END Proposal Phase ---
  7135. --- Decision Phase ---
  7136. RL update rl*prefer*rvt*predict-no*H0*4 0.478563 -0.164047 0.314516 -> 0.478568 -0.164047 0.314522(R,m,v=1,0.918367,0.0754822)
  7137. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.52137 0.16404 0.685411 -> 0.521377 0.164041 0.685418(R,m,v=1,1,0)
  7138. =>WM: (13362: S1 ^operator O1905)
  7139. 953: O: O1905 (predict-yes)
  7140. --- END Decision Phase ---
  7141. --- Application Phase ---
  7142. --- Firing Productions (PE) For State At Depth 1 ---
  7143. --- Inner Elaboration Phase, active level 1 (S1) ---
  7144. Firing apply*operator
  7145. -->
  7146. (I3 ^predict-yes N953 + :O )
  7147. Firing apply*operator*complete
  7148. -->
  7149. (I3 ^predict-no N952 - :O )
  7150. inner elaboration loop at bottom goal.
  7151. --- Change Working Memory (PE) ---
  7152. =>WM: (13363: I3 ^predict-yes N953)
  7153. <=WM: (13350: N952 ^status complete)
  7154. <=WM: (13349: I3 ^predict-no N952)
  7155. --- Firing Productions (IE) For State At Depth 1 ---
  7156. --- Inner Elaboration Phase, active level 1 (S1) ---
  7157. Firing monitor*world
  7158. -->
  7159. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7160. --- Change Working Memory (IE) ---
  7161. --- END Application Phase ---
  7162. --- Output Phase ---
  7163. ENV: Agent did: predict-yes for direction R in state State-A
  7164. In State-A moving R
  7165. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7166. predict error 0
  7167. dir: dir isU
  7168. --- END Output Phase ---
  7169. |\--- Input Phase ---
  7170. =>WM: (13367: I2 ^dir U)
  7171. =>WM: (13366: I2 ^reward 1)
  7172. =>WM: (13365: I2 ^see 1)
  7173. =>WM: (13364: N953 ^status complete)
  7174. <=WM: (13353: I2 ^dir R)
  7175. <=WM: (13352: I2 ^reward 1)
  7176. <=WM: (13351: I2 ^see 0)
  7177. =>WM: (13368: I2 ^level-1 R1-root)
  7178. <=WM: (13354: I2 ^level-1 L0-root)
  7179. --- END Input Phase ---
  7180. --- Proposal Phase ---
  7181. --- Inner Elaboration Phase, active level 1 (S1) ---
  7182. Firing elaborate*copy-see-to-output-link
  7183. -->
  7184. (I3 ^see 1 +)
  7185. Firing elaborate*reward*based*on*reward
  7186. -->
  7187. (R957 ^value 1 +)
  7188. (R1 ^reward R957 +)
  7189. Firing propose*predict-yes
  7190. -->
  7191. (O1907 ^name predict-yes +)
  7192. (S1 ^operator O1907 +)
  7193. Firing propose*predict-no
  7194. -->
  7195. (O1908 ^name predict-no +)
  7196. (S1 ^operator O1908 +)
  7197. Firing rl*prefer*rvt*predict-no*H0*2
  7198. -->
  7199. (S1 ^operator O1906 = 1.)
  7200. Firing rl*prefer*rvt*predict-yes*H0*1
  7201. -->
  7202. (S1 ^operator O1905 = 0.)
  7203. Firing prefer*rvt*predict-yes*H0
  7204. -->
  7205. Firing prefer*rvt*predict-no*H0
  7206. -->
  7207. Firing elaborate*copy-dir-to-output-link
  7208. -->
  7209. (I3 ^dir U +)
  7210. inner elaboration loop at bottom goal.
  7211. Retracting elaborate*copy-see-to-output-link
  7212. -->
  7213. (I3 ^see 0 +)
  7214. Retracting propose*predict-no
  7215. -->
  7216. (O1906 ^name predict-no +)
  7217. (S1 ^operator O1906 +)
  7218. Retracting propose*predict-yes
  7219. -->
  7220. (O1905 ^name predict-yes +)
  7221. (S1 ^operator O1905 +)
  7222. Retracting elaborate*reward*based*on*reward
  7223. -->
  7224. (R956 ^value 1 +)
  7225. (R1 ^reward R956 +)
  7226. Retracting elaborate*copy-dir-to-output-link
  7227. -->
  7228. (I3 ^dir R +)
  7229. Retracting rl*prefer*rvt*predict-no*H0*6
  7230. -->
  7231. (S1 ^operator O1906 = 0.999977424773942)
  7232. Retracting rl*prefer*rvt*predict-yes*H0*5
  7233. -->
  7234. (S1 ^operator O1905 = 0.1215951465100475)
  7235. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7236. -->
  7237. (S1 ^operator O1905 = 0.8783877442642956)
  7238. =>WM: (13376: S1 ^operator O1908 +)
  7239. =>WM: (13375: S1 ^operator O1907 +)
  7240. =>WM: (13374: I3 ^dir U)
  7241. =>WM: (13373: O1908 ^name predict-no)
  7242. =>WM: (13372: O1907 ^name predict-yes)
  7243. =>WM: (13371: R957 ^value 1)
  7244. =>WM: (13370: R1 ^reward R957)
  7245. =>WM: (13369: I3 ^see 1)
  7246. <=WM: (13360: S1 ^operator O1905 +)
  7247. <=WM: (13362: S1 ^operator O1905)
  7248. <=WM: (13361: S1 ^operator O1906 +)
  7249. <=WM: (13359: I3 ^dir R)
  7250. <=WM: (13355: R1 ^reward R956)
  7251. <=WM: (13272: I3 ^see 0)
  7252. <=WM: (13358: O1906 ^name predict-no)
  7253. <=WM: (13357: O1905 ^name predict-yes)
  7254. <=WM: (13356: R956 ^value 1)
  7255. --- Inner Elaboration Phase, active level 1 (S1) ---
  7256. Firing prefer*rvt*predict-yes*H0
  7257. -->
  7258. Firing rl*prefer*rvt*predict-yes*H0*1
  7259. -->
  7260. (S1 ^operator O1907 = 0.)
  7261. Firing prefer*rvt*predict-no*H0
  7262. -->
  7263. Firing rl*prefer*rvt*predict-no*H0*2
  7264. -->
  7265. (S1 ^operator O1908 = 1.)
  7266. inner elaboration loop at bottom goal.
  7267. Retracting rl*prefer*rvt*predict-no*H0*2
  7268. -->
  7269. (S1 ^operator O1906 = 1.)
  7270. Retracting rl*prefer*rvt*predict-yes*H0*1
  7271. -->
  7272. (S1 ^operator O1905 = 0.)
  7273. --- END Proposal Phase ---
  7274. --- Decision Phase ---
  7275. RL update rl*prefer*rvt*predict-yes*H0*5 0.534522 -0.412927 0.121595 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.857143,0.123182)
  7276. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465464 0.412924 0.878388 -> 0.465465 0.412924 0.878389(R,m,v=1,1,0)
  7277. =>WM: (13377: S1 ^operator O1908)
  7278. 954: O: O1908 (predict-no)
  7279. --- END Decision Phase ---
  7280. --- Application Phase ---
  7281. --- Firing Productions (PE) For State At Depth 1 ---
  7282. --- Inner Elaboration Phase, active level 1 (S1) ---
  7283. Firing apply*operator
  7284. -->
  7285. (I3 ^predict-no N954 + :O )
  7286. Firing apply*operator*complete
  7287. -->
  7288. (I3 ^predict-yes N953 - :O )
  7289. inner elaboration loop at bottom goal.
  7290. --- Change Working Memory (PE) ---
  7291. =>WM: (13378: I3 ^predict-no N954)
  7292. <=WM: (13364: N953 ^status complete)
  7293. <=WM: (13363: I3 ^predict-yes N953)
  7294. --- Firing Productions (IE) For State At Depth 1 ---
  7295. --- Inner Elaboration Phase, active level 1 (S1) ---
  7296. Firing monitor*world
  7297. -->
  7298. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7299. --- Change Working Memory (IE) ---
  7300. --- END Application Phase ---
  7301. --- Output Phase ---
  7302. ENV: Agent did: predict-no for direction U in state State-B
  7303. In State-B moving U
  7304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7305. predict error 0
  7306. dir: dir isL
  7307. --- END Output Phase ---
  7308. -/--- Input Phase ---
  7309. =>WM: (13382: I2 ^dir L)
  7310. =>WM: (13381: I2 ^reward 1)
  7311. =>WM: (13380: I2 ^see 0)
  7312. =>WM: (13379: N954 ^status complete)
  7313. <=WM: (13367: I2 ^dir U)
  7314. <=WM: (13366: I2 ^reward 1)
  7315. <=WM: (13365: I2 ^see 1)
  7316. =>WM: (13383: I2 ^level-1 R1-root)
  7317. <=WM: (13368: I2 ^level-1 R1-root)
  7318. --- END Input Phase ---
  7319. --- Proposal Phase ---
  7320. --- Inner Elaboration Phase, active level 1 (S1) ---
  7321. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  7322. -->
  7323. (S1 ^operator O1908 = -0.168718511744511)
  7324. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  7325. -->
  7326. (S1 ^operator O1907 = 0.6093893278107597)
  7327. Firing prefer*rvt*predict-no*H0*4*H1
  7328. -->
  7329. Firing prefer*rvt*predict-yes*H0*3*H1
  7330. -->
  7331. Firing elaborate*copy-see-to-output-link
  7332. -->
  7333. (I3 ^see 0 +)
  7334. Firing elaborate*reward*based*on*reward
  7335. -->
  7336. (R958 ^value 1 +)
  7337. (R1 ^reward R958 +)
  7338. Firing propose*predict-yes
  7339. -->
  7340. (O1909 ^name predict-yes +)
  7341. (S1 ^operator O1909 +)
  7342. Firing propose*predict-no
  7343. -->
  7344. (O1910 ^name predict-no +)
  7345. (S1 ^operator O1910 +)
  7346. Firing rl*prefer*rvt*predict-no*H0*4
  7347. -->
  7348. (S1 ^operator O1908 = 0.3145217607813431)
  7349. Firing rl*prefer*rvt*predict-yes*H0*3
  7350. -->
  7351. (S1 ^operator O1907 = 0.3908143935841644)
  7352. Firing prefer*rvt*predict-yes*H0
  7353. -->
  7354. Firing prefer*rvt*predict-no*H0
  7355. -->
  7356. Firing elaborate*copy-dir-to-output-link
  7357. -->
  7358. (I3 ^dir L +)
  7359. inner elaboration loop at bottom goal.
  7360. Retracting elaborate*copy-see-to-output-link
  7361. -->
  7362. (I3 ^see 1 +)
  7363. Retracting propose*predict-no
  7364. -->
  7365. (O1908 ^name predict-no +)
  7366. (S1 ^operator O1908 +)
  7367. Retracting propose*predict-yes
  7368. -->
  7369. (O1907 ^name predict-yes +)
  7370. (S1 ^operator O1907 +)
  7371. Retracting elaborate*reward*based*on*reward
  7372. -->
  7373. (R957 ^value 1 +)
  7374. (R1 ^reward R957 +)
  7375. Retracting elaborate*copy-dir-to-output-link
  7376. -->
  7377. (I3 ^dir U +)
  7378. Retracting rl*prefer*rvt*predict-no*H0*2
  7379. -->
  7380. (S1 ^operator O1908 = 1.)
  7381. Retracting rl*prefer*rvt*predict-yes*H0*1
  7382. -->
  7383. (S1 ^operator O1907 = 0.)
  7384. =>WM: (13391: S1 ^operator O1910 +)
  7385. =>WM: (13390: S1 ^operator O1909 +)
  7386. =>WM: (13389: I3 ^dir L)
  7387. =>WM: (13388: O1910 ^name predict-no)
  7388. =>WM: (13387: O1909 ^name predict-yes)
  7389. =>WM: (13386: R958 ^value 1)
  7390. =>WM: (13385: R1 ^reward R958)
  7391. =>WM: (13384: I3 ^see 0)
  7392. <=WM: (13375: S1 ^operator O1907 +)
  7393. <=WM: (13376: S1 ^operator O1908 +)
  7394. <=WM: (13377: S1 ^operator O1908)
  7395. <=WM: (13374: I3 ^dir U)
  7396. <=WM: (13370: R1 ^reward R957)
  7397. <=WM: (13369: I3 ^see 1)
  7398. <=WM: (13373: O1908 ^name predict-no)
  7399. <=WM: (13372: O1907 ^name predict-yes)
  7400. <=WM: (13371: R957 ^value 1)
  7401. --- Inner Elaboration Phase, active level 1 (S1) ---
  7402. Firing prefer*rvt*predict-yes*H0
  7403. -->
  7404. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  7405. -->
  7406. (S1 ^operator O1909 = 0.6093893278107597)
  7407. Firing rl*prefer*rvt*predict-yes*H0*3
  7408. -->
  7409. (S1 ^operator O1909 = 0.3908143935841644)
  7410. Firing prefer*rvt*predict-yes*H0*3*H1
  7411. -->
  7412. Firing prefer*rvt*predict-no*H0
  7413. -->
  7414. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  7415. -->
  7416. (S1 ^operator O1910 = -0.168718511744511)
  7417. Firing rl*prefer*rvt*predict-no*H0*4
  7418. -->
  7419. (S1 ^operator O1910 = 0.3145217607813431)
  7420. Firing prefer*rvt*predict-no*H0*4*H1
  7421. -->
  7422. inner elaboration loop at bottom goal.
  7423. Retracting rl*prefer*rvt*predict-no*H0*4
  7424. -->
  7425. (S1 ^operator O1908 = 0.3145217607813431)
  7426. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  7427. -->
  7428. (S1 ^operator O1908 = -0.168718511744511)
  7429. Retracting rl*prefer*rvt*predict-yes*H0*3
  7430. -->
  7431. (S1 ^operator O1907 = 0.3908143935841644)
  7432. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  7433. -->
  7434. (S1 ^operator O1907 = 0.6093893278107597)
  7435. --- END Proposal Phase ---
  7436. --- Decision Phase ---
  7437. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7438. =>WM: (13392: S1 ^operator O1909)
  7439. 955: O: O1909 (predict-yes)
  7440. --- END Decision Phase ---
  7441. --- Application Phase ---
  7442. --- Firing Productions (PE) For State At Depth 1 ---
  7443. --- Inner Elaboration Phase, active level 1 (S1) ---
  7444. Firing apply*operator
  7445. -->
  7446. (I3 ^predict-yes N955 + :O )
  7447. Firing apply*operator*complete
  7448. -->
  7449. (I3 ^predict-no N954 - :O )
  7450. inner elaboration loop at bottom goal.
  7451. --- Change Working Memory (PE) ---
  7452. =>WM: (13393: I3 ^predict-yes N955)
  7453. <=WM: (13379: N954 ^status complete)
  7454. <=WM: (13378: I3 ^predict-no N954)
  7455. --- Firing Productions (IE) For State At Depth 1 ---
  7456. --- Inner Elaboration Phase, active level 1 (S1) ---
  7457. Firing monitor*world
  7458. -->
  7459. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7460. --- Change Working Memory (IE) ---
  7461. --- END Application Phase ---
  7462. --- Output Phase ---
  7463. ENV: Agent did: predict-yes for direction L in state State-B
  7464. In State-B moving L
  7465. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7466. predict error 0
  7467. dir: dir isU
  7468. --- END Output Phase ---
  7469. |\---- Input Phase ---
  7470. =>WM: (13397: I2 ^dir U)
  7471. =>WM: (13396: I2 ^reward 1)
  7472. =>WM: (13395: I2 ^see 1)
  7473. =>WM: (13394: N955 ^status complete)
  7474. <=WM: (13382: I2 ^dir L)
  7475. <=WM: (13381: I2 ^reward 1)
  7476. <=WM: (13380: I2 ^see 0)
  7477. =>WM: (13398: I2 ^level-1 L1-root)
  7478. <=WM: (13383: I2 ^level-1 R1-root)
  7479. --- END Input Phase ---
  7480. --- Proposal Phase ---
  7481. --- Inner Elaboration Phase, active level 1 (S1) ---
  7482. Firing elaborate*copy-see-to-output-link
  7483. -->
  7484. (I3 ^see 1 +)
  7485. Firing elaborate*reward*based*on*reward
  7486. -->
  7487. (R959 ^value 1 +)
  7488. (R1 ^reward R959 +)
  7489. Firing propose*predict-yes
  7490. -->
  7491. (O1911 ^name predict-yes +)
  7492. (S1 ^operator O1911 +)
  7493. Firing propose*predict-no
  7494. -->
  7495. (O1912 ^name predict-no +)
  7496. (S1 ^operator O1912 +)
  7497. Firing rl*prefer*rvt*predict-no*H0*2
  7498. -->
  7499. (S1 ^operator O1910 = 1.)
  7500. Firing rl*prefer*rvt*predict-yes*H0*1
  7501. -->
  7502. (S1 ^operator O1909 = 0.)
  7503. Firing prefer*rvt*predict-yes*H0
  7504. -->
  7505. Firing prefer*rvt*predict-no*H0
  7506. -->
  7507. Firing elaborate*copy-dir-to-output-link
  7508. -->
  7509. (I3 ^dir U +)
  7510. inner elaboration loop at bottom goal.
  7511. Retracting elaborate*copy-see-to-output-link
  7512. -->
  7513. (I3 ^see 0 +)
  7514. Retracting propose*predict-no
  7515. -->
  7516. (O1910 ^name predict-no +)
  7517. (S1 ^operator O1910 +)
  7518. Retracting propose*predict-yes
  7519. -->
  7520. (O1909 ^name predict-yes +)
  7521. (S1 ^operator O1909 +)
  7522. Retracting elaborate*reward*based*on*reward
  7523. -->
  7524. (R958 ^value 1 +)
  7525. (R1 ^reward R958 +)
  7526. Retracting elaborate*copy-dir-to-output-link
  7527. -->
  7528. (I3 ^dir L +)
  7529. Retracting rl*prefer*rvt*predict-no*H0*4
  7530. -->
  7531. (S1 ^operator O1910 = 0.3145217607813431)
  7532. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  7533. -->
  7534. (S1 ^operator O1910 = -0.168718511744511)
  7535. Retracting rl*prefer*rvt*predict-yes*H0*3
  7536. -->
  7537. (S1 ^operator O1909 = 0.3908143935841644)
  7538. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  7539. -->
  7540. (S1 ^operator O1909 = 0.6093893278107597)
  7541. =>WM: (13406: S1 ^operator O1912 +)
  7542. =>WM: (13405: S1 ^operator O1911 +)
  7543. =>WM: (13404: I3 ^dir U)
  7544. =>WM: (13403: O1912 ^name predict-no)
  7545. =>WM: (13402: O1911 ^name predict-yes)
  7546. =>WM: (13401: R959 ^value 1)
  7547. =>WM: (13400: R1 ^reward R959)
  7548. =>WM: (13399: I3 ^see 1)
  7549. <=WM: (13390: S1 ^operator O1909 +)
  7550. <=WM: (13392: S1 ^operator O1909)
  7551. <=WM: (13391: S1 ^operator O1910 +)
  7552. <=WM: (13389: I3 ^dir L)
  7553. <=WM: (13385: R1 ^reward R958)
  7554. <=WM: (13384: I3 ^see 0)
  7555. <=WM: (13388: O1910 ^name predict-no)
  7556. <=WM: (13387: O1909 ^name predict-yes)
  7557. <=WM: (13386: R958 ^value 1)
  7558. --- Inner Elaboration Phase, active level 1 (S1) ---
  7559. Firing prefer*rvt*predict-yes*H0
  7560. -->
  7561. Firing rl*prefer*rvt*predict-yes*H0*1
  7562. -->
  7563. (S1 ^operator O1911 = 0.)
  7564. Firing prefer*rvt*predict-no*H0
  7565. -->
  7566. Firing rl*prefer*rvt*predict-no*H0*2
  7567. -->
  7568. (S1 ^operator O1912 = 1.)
  7569. inner elaboration loop at bottom goal.
  7570. Retracting rl*prefer*rvt*predict-no*H0*2
  7571. -->
  7572. (S1 ^operator O1910 = 1.)
  7573. Retracting rl*prefer*rvt*predict-yes*H0*1
  7574. -->
  7575. (S1 ^operator O1909 = 0.)
  7576. --- END Proposal Phase ---
  7577. --- Decision Phase ---
  7578. RL update rl*prefer*rvt*predict-yes*H0*3 0.472355 -0.0815405 0.390814 -> 0.47234 -0.081543 0.390797(R,m,v=1,0.940789,0.0560735)
  7579. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527819 0.0815706 0.609389 -> 0.527802 0.0815677 0.60937(R,m,v=1,1,0)
  7580. =>WM: (13407: S1 ^operator O1912)
  7581. 956: O: O1912 (predict-no)
  7582. --- END Decision Phase ---
  7583. --- Application Phase ---
  7584. --- Firing Productions (PE) For State At Depth 1 ---
  7585. --- Inner Elaboration Phase, active level 1 (S1) ---
  7586. Firing apply*operator
  7587. -->
  7588. (I3 ^predict-no N956 + :O )
  7589. Firing apply*operator*complete
  7590. -->
  7591. (I3 ^predict-yes N955 - :O )
  7592. inner elaboration loop at bottom goal.
  7593. --- Change Working Memory (PE) ---
  7594. =>WM: (13408: I3 ^predict-no N956)
  7595. <=WM: (13394: N955 ^status complete)
  7596. <=WM: (13393: I3 ^predict-yes N955)
  7597. --- Firing Productions (IE) For State At Depth 1 ---
  7598. --- Inner Elaboration Phase, active level 1 (S1) ---
  7599. Firing monitor*world
  7600. -->
  7601. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7602. --- Change Working Memory (IE) ---
  7603. --- END Application Phase ---
  7604. --- Output Phase ---
  7605. ENV: Agent did: predict-no for direction U in state State-A
  7606. In State-A moving U
  7607. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7608. predict error 0
  7609. dir: dir isL
  7610. --- END Output Phase ---
  7611. /|\--- Input Phase ---
  7612. =>WM: (13412: I2 ^dir L)
  7613. =>WM: (13411: I2 ^reward 1)
  7614. =>WM: (13410: I2 ^see 0)
  7615. =>WM: (13409: N956 ^status complete)
  7616. <=WM: (13397: I2 ^dir U)
  7617. <=WM: (13396: I2 ^reward 1)
  7618. <=WM: (13395: I2 ^see 1)
  7619. =>WM: (13413: I2 ^level-1 L1-root)
  7620. <=WM: (13398: I2 ^level-1 L1-root)
  7621. --- END Input Phase ---
  7622. --- Proposal Phase ---
  7623. --- Inner Elaboration Phase, active level 1 (S1) ---
  7624. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  7625. -->
  7626. (S1 ^operator O1911 = -0.2062723012911647)
  7627. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  7628. -->
  7629. (S1 ^operator O1912 = 0.6855673437364445)
  7630. Firing prefer*rvt*predict-no*H0*4*H1
  7631. -->
  7632. Firing prefer*rvt*predict-yes*H0*3*H1
  7633. -->
  7634. Firing elaborate*copy-see-to-output-link
  7635. -->
  7636. (I3 ^see 0 +)
  7637. Firing elaborate*reward*based*on*reward
  7638. -->
  7639. (R960 ^value 1 +)
  7640. (R1 ^reward R960 +)
  7641. Firing propose*predict-yes
  7642. -->
  7643. (O1913 ^name predict-yes +)
  7644. (S1 ^operator O1913 +)
  7645. Firing propose*predict-no
  7646. -->
  7647. (O1914 ^name predict-no +)
  7648. (S1 ^operator O1914 +)
  7649. Firing rl*prefer*rvt*predict-no*H0*4
  7650. -->
  7651. (S1 ^operator O1912 = 0.3145217607813431)
  7652. Firing rl*prefer*rvt*predict-yes*H0*3
  7653. -->
  7654. (S1 ^operator O1911 = 0.3907974841024591)
  7655. Firing prefer*rvt*predict-yes*H0
  7656. -->
  7657. Firing prefer*rvt*predict-no*H0
  7658. -->
  7659. Firing elaborate*copy-dir-to-output-link
  7660. -->
  7661. (I3 ^dir L +)
  7662. inner elaboration loop at bottom goal.
  7663. Retracting elaborate*copy-see-to-output-link
  7664. -->
  7665. (I3 ^see 1 +)
  7666. Retracting propose*predict-no
  7667. -->
  7668. (O1912 ^name predict-no +)
  7669. (S1 ^operator O1912 +)
  7670. Retracting propose*predict-yes
  7671. -->
  7672. (O1911 ^name predict-yes +)
  7673. (S1 ^operator O1911 +)
  7674. Retracting elaborate*reward*based*on*reward
  7675. -->
  7676. (R959 ^value 1 +)
  7677. (R1 ^reward R959 +)
  7678. Retracting elaborate*copy-dir-to-output-link
  7679. -->
  7680. (I3 ^dir U +)
  7681. Retracting rl*prefer*rvt*predict-no*H0*2
  7682. -->
  7683. (S1 ^operator O1912 = 1.)
  7684. Retracting rl*prefer*rvt*predict-yes*H0*1
  7685. -->
  7686. (S1 ^operator O1911 = 0.)
  7687. =>WM: (13421: S1 ^operator O1914 +)
  7688. =>WM: (13420: S1 ^operator O1913 +)
  7689. =>WM: (13419: I3 ^dir L)
  7690. =>WM: (13418: O1914 ^name predict-no)
  7691. =>WM: (13417: O1913 ^name predict-yes)
  7692. =>WM: (13416: R960 ^value 1)
  7693. =>WM: (13415: R1 ^reward R960)
  7694. =>WM: (13414: I3 ^see 0)
  7695. <=WM: (13405: S1 ^operator O1911 +)
  7696. <=WM: (13406: S1 ^operator O1912 +)
  7697. <=WM: (13407: S1 ^operator O1912)
  7698. <=WM: (13404: I3 ^dir U)
  7699. <=WM: (13400: R1 ^reward R959)
  7700. <=WM: (13399: I3 ^see 1)
  7701. <=WM: (13403: O1912 ^name predict-no)
  7702. <=WM: (13402: O1911 ^name predict-yes)
  7703. <=WM: (13401: R959 ^value 1)
  7704. --- Inner Elaboration Phase, active level 1 (S1) ---
  7705. Firing prefer*rvt*predict-yes*H0
  7706. -->
  7707. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  7708. -->
  7709. (S1 ^operator O1913 = -0.2062723012911647)
  7710. Firing rl*prefer*rvt*predict-yes*H0*3
  7711. -->
  7712. (S1 ^operator O1913 = 0.3907974841024591)
  7713. Firing prefer*rvt*predict-yes*H0*3*H1
  7714. -->
  7715. Firing prefer*rvt*predict-no*H0
  7716. -->
  7717. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  7718. -->
  7719. (S1 ^operator O1914 = 0.6855673437364445)
  7720. Firing rl*prefer*rvt*predict-no*H0*4
  7721. -->
  7722. (S1 ^operator O1914 = 0.3145217607813431)
  7723. Firing prefer*rvt*predict-no*H0*4*H1
  7724. -->
  7725. inner elaboration loop at bottom goal.
  7726. Retracting rl*prefer*rvt*predict-no*H0*4
  7727. -->
  7728. (S1 ^operator O1912 = 0.3145217607813431)
  7729. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  7730. -->
  7731. (S1 ^operator O1912 = 0.6855673437364445)
  7732. Retracting rl*prefer*rvt*predict-yes*H0*3
  7733. -->
  7734. (S1 ^operator O1911 = 0.3907974841024591)
  7735. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  7736. -->
  7737. (S1 ^operator O1911 = -0.2062723012911647)
  7738. --- END Proposal Phase ---
  7739. --- Decision Phase ---
  7740. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7741. =>WM: (13422: S1 ^operator O1914)
  7742. 957: O: O1914 (predict-no)
  7743. --- END Decision Phase ---
  7744. --- Application Phase ---
  7745. --- Firing Productions (PE) For State At Depth 1 ---
  7746. --- Inner Elaboration Phase, active level 1 (S1) ---
  7747. Firing apply*operator
  7748. -->
  7749. (I3 ^predict-no N957 + :O )
  7750. Firing apply*operator*complete
  7751. -->
  7752. (I3 ^predict-no N956 - :O )
  7753. inner elaboration loop at bottom goal.
  7754. --- Change Working Memory (PE) ---
  7755. =>WM: (13423: I3 ^predict-no N957)
  7756. <=WM: (13409: N956 ^status complete)
  7757. <=WM: (13408: I3 ^predict-no N956)
  7758. --- Firing Productions (IE) For State At Depth 1 ---
  7759. --- Inner Elaboration Phase, active level 1 (S1) ---
  7760. Firing monitor*world
  7761. -->
  7762. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7763. --- Change Working Memory (IE) ---
  7764. --- END Application Phase ---
  7765. --- Output Phase ---
  7766. ENV: Agent did: predict-no for direction L in state State-A
  7767. In State-A moving L
  7768. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7769. predict error 0
  7770. dir: dir isR
  7771. --- END Output Phase ---
  7772. -/|--- Input Phase ---
  7773. =>WM: (13427: I2 ^dir R)
  7774. =>WM: (13426: I2 ^reward 1)
  7775. =>WM: (13425: I2 ^see 0)
  7776. =>WM: (13424: N957 ^status complete)
  7777. <=WM: (13412: I2 ^dir L)
  7778. <=WM: (13411: I2 ^reward 1)
  7779. <=WM: (13410: I2 ^see 0)
  7780. =>WM: (13428: I2 ^level-1 L0-root)
  7781. <=WM: (13413: I2 ^level-1 L1-root)
  7782. --- END Input Phase ---
  7783. --- Proposal Phase ---
  7784. --- Inner Elaboration Phase, active level 1 (S1) ---
  7785. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7786. -->
  7787. (S1 ^operator O1913 = 0.8783894024939338)
  7788. Firing prefer*rvt*predict-yes*H0*5*H1
  7789. -->
  7790. Firing elaborate*copy-see-to-output-link
  7791. -->
  7792. (I3 ^see 0 +)
  7793. Firing elaborate*reward*based*on*reward
  7794. -->
  7795. (R961 ^value 1 +)
  7796. (R1 ^reward R961 +)
  7797. Firing propose*predict-yes
  7798. -->
  7799. (O1915 ^name predict-yes +)
  7800. (S1 ^operator O1915 +)
  7801. Firing propose*predict-no
  7802. -->
  7803. (O1916 ^name predict-no +)
  7804. (S1 ^operator O1916 +)
  7805. Firing rl*prefer*rvt*predict-no*H0*6
  7806. -->
  7807. (S1 ^operator O1914 = 0.999977424773942)
  7808. Firing rl*prefer*rvt*predict-yes*H0*5
  7809. -->
  7810. (S1 ^operator O1913 = 0.1215965434178113)
  7811. Firing prefer*rvt*predict-yes*H0
  7812. -->
  7813. Firing prefer*rvt*predict-no*H0
  7814. -->
  7815. Firing elaborate*copy-dir-to-output-link
  7816. -->
  7817. (I3 ^dir R +)
  7818. inner elaboration loop at bottom goal.
  7819. Retracting elaborate*copy-see-to-output-link
  7820. -->
  7821. (I3 ^see 0 +)
  7822. Retracting propose*predict-no
  7823. -->
  7824. (O1914 ^name predict-no +)
  7825. (S1 ^operator O1914 +)
  7826. Retracting propose*predict-yes
  7827. -->
  7828. (O1913 ^name predict-yes +)
  7829. (S1 ^operator O1913 +)
  7830. Retracting elaborate*reward*based*on*reward
  7831. -->
  7832. (R960 ^value 1 +)
  7833. (R1 ^reward R960 +)
  7834. Retracting elaborate*copy-dir-to-output-link
  7835. -->
  7836. (I3 ^dir L +)
  7837. Retracting rl*prefer*rvt*predict-no*H0*4
  7838. -->
  7839. (S1 ^operator O1914 = 0.3145217607813431)
  7840. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  7841. -->
  7842. (S1 ^operator O1914 = 0.6855673437364445)
  7843. Retracting rl*prefer*rvt*predict-yes*H0*3
  7844. -->
  7845. (S1 ^operator O1913 = 0.3907974841024591)
  7846. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  7847. -->
  7848. (S1 ^operator O1913 = -0.2062723012911647)
  7849. =>WM: (13435: S1 ^operator O1916 +)
  7850. =>WM: (13434: S1 ^operator O1915 +)
  7851. =>WM: (13433: I3 ^dir R)
  7852. =>WM: (13432: O1916 ^name predict-no)
  7853. =>WM: (13431: O1915 ^name predict-yes)
  7854. =>WM: (13430: R961 ^value 1)
  7855. =>WM: (13429: R1 ^reward R961)
  7856. <=WM: (13420: S1 ^operator O1913 +)
  7857. <=WM: (13421: S1 ^operator O1914 +)
  7858. <=WM: (13422: S1 ^operator O1914)
  7859. <=WM: (13419: I3 ^dir L)
  7860. <=WM: (13415: R1 ^reward R960)
  7861. <=WM: (13418: O1914 ^name predict-no)
  7862. <=WM: (13417: O1913 ^name predict-yes)
  7863. <=WM: (13416: R960 ^value 1)
  7864. --- Inner Elaboration Phase, active level 1 (S1) ---
  7865. Firing prefer*rvt*predict-yes*H0
  7866. -->
  7867. Firing rl*prefer*rvt*predict-yes*H0*5
  7868. -->
  7869. (S1 ^operator O1915 = 0.1215965434178113)
  7870. Firing prefer*rvt*predict-yes*H0*5*H1
  7871. -->
  7872. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7873. -->
  7874. (S1 ^operator O1915 = 0.8783894024939338)
  7875. Firing prefer*rvt*predict-no*H0
  7876. -->
  7877. Firing rl*prefer*rvt*predict-no*H0*6
  7878. -->
  7879. (S1 ^operator O1916 = 0.999977424773942)
  7880. inner elaboration loop at bottom goal.
  7881. Retracting rl*prefer*rvt*predict-no*H0*6
  7882. -->
  7883. (S1 ^operator O1914 = 0.999977424773942)
  7884. Retracting rl*prefer*rvt*predict-yes*H0*5
  7885. -->
  7886. (S1 ^operator O1913 = 0.1215965434178113)
  7887. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7888. -->
  7889. (S1 ^operator O1913 = 0.8783894024939338)
  7890. --- END Proposal Phase ---
  7891. --- Decision Phase ---
  7892. RL update rl*prefer*rvt*predict-no*H0*4 0.478568 -0.164047 0.314522 -> 0.478562 -0.164047 0.314514(R,m,v=1,0.918919,0.0750138)
  7893. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521513 0.164055 0.685567 -> 0.521505 0.164054 0.685559(R,m,v=1,1,0)
  7894. =>WM: (13436: S1 ^operator O1915)
  7895. 958: O: O1915 (predict-yes)
  7896. --- END Decision Phase ---
  7897. --- Application Phase ---
  7898. --- Firing Productions (PE) For State At Depth 1 ---
  7899. --- Inner Elaboration Phase, active level 1 (S1) ---
  7900. Firing apply*operator
  7901. -->
  7902. (I3 ^predict-yes N958 + :O )
  7903. Firing apply*operator*complete
  7904. -->
  7905. (I3 ^predict-no N957 - :O )
  7906. inner elaboration loop at bottom goal.
  7907. --- Change Working Memory (PE) ---
  7908. =>WM: (13437: I3 ^predict-yes N958)
  7909. <=WM: (13424: N957 ^status complete)
  7910. <=WM: (13423: I3 ^predict-no N957)
  7911. --- Firing Productions (IE) For State At Depth 1 ---
  7912. --- Inner Elaboration Phase, active level 1 (S1) ---
  7913. Firing monitor*world
  7914. -->
  7915. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7916. --- Change Working Memory (IE) ---
  7917. --- END Application Phase ---
  7918. --- Output Phase ---
  7919. ENV: Agent did: predict-yes for direction R in state State-A
  7920. In State-A moving R
  7921. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7922. predict error 0
  7923. dir: dir isR
  7924. --- END Output Phase ---
  7925. \-/--- Input Phase ---
  7926. =>WM: (13441: I2 ^dir R)
  7927. =>WM: (13440: I2 ^reward 1)
  7928. =>WM: (13439: I2 ^see 1)
  7929. =>WM: (13438: N958 ^status complete)
  7930. <=WM: (13427: I2 ^dir R)
  7931. <=WM: (13426: I2 ^reward 1)
  7932. <=WM: (13425: I2 ^see 0)
  7933. =>WM: (13442: I2 ^level-1 R1-root)
  7934. <=WM: (13428: I2 ^level-1 L0-root)
  7935. --- END Input Phase ---
  7936. --- Proposal Phase ---
  7937. --- Inner Elaboration Phase, active level 1 (S1) ---
  7938. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  7939. -->
  7940. (S1 ^operator O1915 = -0.04253361215288998)
  7941. Firing prefer*rvt*predict-yes*H0*5*H1
  7942. -->
  7943. Firing elaborate*copy-see-to-output-link
  7944. -->
  7945. (I3 ^see 1 +)
  7946. Firing elaborate*reward*based*on*reward
  7947. -->
  7948. (R962 ^value 1 +)
  7949. (R1 ^reward R962 +)
  7950. Firing propose*predict-yes
  7951. -->
  7952. (O1917 ^name predict-yes +)
  7953. (S1 ^operator O1917 +)
  7954. Firing propose*predict-no
  7955. -->
  7956. (O1918 ^name predict-no +)
  7957. (S1 ^operator O1918 +)
  7958. Firing rl*prefer*rvt*predict-no*H0*6
  7959. -->
  7960. (S1 ^operator O1916 = 0.999977424773942)
  7961. Firing rl*prefer*rvt*predict-yes*H0*5
  7962. -->
  7963. (S1 ^operator O1915 = 0.1215965434178113)
  7964. Firing prefer*rvt*predict-yes*H0
  7965. -->
  7966. Firing prefer*rvt*predict-no*H0
  7967. -->
  7968. Firing elaborate*copy-dir-to-output-link
  7969. -->
  7970. (I3 ^dir R +)
  7971. inner elaboration loop at bottom goal.
  7972. Retracting elaborate*copy-see-to-output-link
  7973. -->
  7974. (I3 ^see 0 +)
  7975. Retracting propose*predict-no
  7976. -->
  7977. (O1916 ^name predict-no +)
  7978. (S1 ^operator O1916 +)
  7979. Retracting propose*predict-yes
  7980. -->
  7981. (O1915 ^name predict-yes +)
  7982. (S1 ^operator O1915 +)
  7983. Retracting elaborate*reward*based*on*reward
  7984. -->
  7985. (R961 ^value 1 +)
  7986. (R1 ^reward R961 +)
  7987. Retracting elaborate*copy-dir-to-output-link
  7988. -->
  7989. (I3 ^dir R +)
  7990. Retracting rl*prefer*rvt*predict-no*H0*6
  7991. -->
  7992. (S1 ^operator O1916 = 0.999977424773942)
  7993. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7994. -->
  7995. (S1 ^operator O1915 = 0.8783894024939338)
  7996. Retracting rl*prefer*rvt*predict-yes*H0*5
  7997. -->
  7998. (S1 ^operator O1915 = 0.1215965434178113)
  7999. =>WM: (13449: S1 ^operator O1918 +)
  8000. =>WM: (13448: S1 ^operator O1917 +)
  8001. =>WM: (13447: O1918 ^name predict-no)
  8002. =>WM: (13446: O1917 ^name predict-yes)
  8003. =>WM: (13445: R962 ^value 1)
  8004. =>WM: (13444: R1 ^reward R962)
  8005. =>WM: (13443: I3 ^see 1)
  8006. <=WM: (13434: S1 ^operator O1915 +)
  8007. <=WM: (13436: S1 ^operator O1915)
  8008. <=WM: (13435: S1 ^operator O1916 +)
  8009. <=WM: (13429: R1 ^reward R961)
  8010. <=WM: (13414: I3 ^see 0)
  8011. <=WM: (13432: O1916 ^name predict-no)
  8012. <=WM: (13431: O1915 ^name predict-yes)
  8013. <=WM: (13430: R961 ^value 1)
  8014. --- Inner Elaboration Phase, active level 1 (S1) ---
  8015. Firing prefer*rvt*predict-yes*H0
  8016. -->
  8017. Firing rl*prefer*rvt*predict-yes*H0*5
  8018. -->
  8019. (S1 ^operator O1917 = 0.1215965434178113)
  8020. Firing prefer*rvt*predict-yes*H0*5*H1
  8021. -->
  8022. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  8023. -->
  8024. (S1 ^operator O1917 = -0.04253361215288998)
  8025. Firing prefer*rvt*predict-no*H0
  8026. -->
  8027. Firing rl*prefer*rvt*predict-no*H0*6
  8028. -->
  8029. (S1 ^operator O1918 = 0.999977424773942)
  8030. inner elaboration loop at bottom goal.
  8031. Retracting rl*prefer*rvt*predict-no*H0*6
  8032. -->
  8033. (S1 ^operator O1916 = 0.999977424773942)
  8034. Retracting rl*prefer*rvt*predict-yes*H0*5
  8035. -->
  8036. (S1 ^operator O1915 = 0.1215965434178113)
  8037. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  8038. -->
  8039. (S1 ^operator O1915 = -0.04253361215288998)
  8040. --- END Proposal Phase ---
  8041. --- Decision Phase ---
  8042. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.857988,0.12257)
  8043. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465465 0.412924 0.878389 -> 0.465467 0.412924 0.878391(R,m,v=1,1,0)
  8044. =>WM: (13450: S1 ^operator O1918)
  8045. 959: O: O1918 (predict-no)
  8046. --- END Decision Phase ---
  8047. --- Application Phase ---
  8048. --- Firing Productions (PE) For State At Depth 1 ---
  8049. --- Inner Elaboration Phase, active level 1 (S1) ---
  8050. Firing apply*operator
  8051. -->
  8052. (I3 ^predict-no N959 + :O )
  8053. Firing apply*operator*complete
  8054. -->
  8055. (I3 ^predict-yes N958 - :O )
  8056. inner elaboration loop at bottom goal.
  8057. --- Change Working Memory (PE) ---
  8058. =>WM: (13451: I3 ^predict-no N959)
  8059. <=WM: (13438: N958 ^status complete)
  8060. <=WM: (13437: I3 ^predict-yes N958)
  8061. --- Firing Productions (IE) For State At Depth 1 ---
  8062. --- Inner Elaboration Phase, active level 1 (S1) ---
  8063. Firing monitor*world
  8064. -->
  8065. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8066. --- Change Working Memory (IE) ---
  8067. --- END Application Phase ---
  8068. --- Output Phase ---
  8069. ENV: Agent did: predict-no for direction R in state State-B
  8070. In State-B moving R
  8071. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8072. predict error 0
  8073. dir: dir isL
  8074. --- END Output Phase ---
  8075. |\---- Input Phase ---
  8076. =>WM: (13455: I2 ^dir L)
  8077. =>WM: (13454: I2 ^reward 1)
  8078. =>WM: (13453: I2 ^see 0)
  8079. =>WM: (13452: N959 ^status complete)
  8080. <=WM: (13441: I2 ^dir R)
  8081. <=WM: (13440: I2 ^reward 1)
  8082. <=WM: (13439: I2 ^see 1)
  8083. =>WM: (13456: I2 ^level-1 R0-root)
  8084. <=WM: (13442: I2 ^level-1 R1-root)
  8085. --- END Input Phase ---
  8086. --- Proposal Phase ---
  8087. --- Inner Elaboration Phase, active level 1 (S1) ---
  8088. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  8089. -->
  8090. (S1 ^operator O1918 = -0.1984300550322165)
  8091. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  8092. -->
  8093. (S1 ^operator O1917 = 0.6090773459257411)
  8094. Firing prefer*rvt*predict-no*H0*4*H1
  8095. -->
  8096. Firing prefer*rvt*predict-yes*H0*3*H1
  8097. -->
  8098. Firing elaborate*copy-see-to-output-link
  8099. -->
  8100. (I3 ^see 0 +)
  8101. Firing elaborate*reward*based*on*reward
  8102. -->
  8103. (R963 ^value 1 +)
  8104. (R1 ^reward R963 +)
  8105. Firing propose*predict-yes
  8106. -->
  8107. (O1919 ^name predict-yes +)
  8108. (S1 ^operator O1919 +)
  8109. Firing propose*predict-no
  8110. -->
  8111. (O1920 ^name predict-no +)
  8112. (S1 ^operator O1920 +)
  8113. Firing rl*prefer*rvt*predict-no*H0*4
  8114. -->
  8115. (S1 ^operator O1918 = 0.3145143319532709)
  8116. Firing rl*prefer*rvt*predict-yes*H0*3
  8117. -->
  8118. (S1 ^operator O1917 = 0.3907974841024591)
  8119. Firing prefer*rvt*predict-yes*H0
  8120. -->
  8121. Firing prefer*rvt*predict-no*H0
  8122. -->
  8123. Firing elaborate*copy-dir-to-output-link
  8124. -->
  8125. (I3 ^dir L +)
  8126. inner elaboration loop at bottom goal.
  8127. Retracting elaborate*copy-see-to-output-link
  8128. -->
  8129. (I3 ^see 1 +)
  8130. Retracting propose*predict-no
  8131. -->
  8132. (O1918 ^name predict-no +)
  8133. (S1 ^operator O1918 +)
  8134. Retracting propose*predict-yes
  8135. -->
  8136. (O1917 ^name predict-yes +)
  8137. (S1 ^operator O1917 +)
  8138. Retracting elaborate*reward*based*on*reward
  8139. -->
  8140. (R962 ^value 1 +)
  8141. (R1 ^reward R962 +)
  8142. Retracting elaborate*copy-dir-to-output-link
  8143. -->
  8144. (I3 ^dir R +)
  8145. Retracting rl*prefer*rvt*predict-no*H0*6
  8146. -->
  8147. (S1 ^operator O1918 = 0.999977424773942)
  8148. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  8149. -->
  8150. (S1 ^operator O1917 = -0.04253361215288998)
  8151. Retracting rl*prefer*rvt*predict-yes*H0*5
  8152. -->
  8153. (S1 ^operator O1917 = 0.121597689773478)
  8154. =>WM: (13464: S1 ^operator O1920 +)
  8155. =>WM: (13463: S1 ^operator O1919 +)
  8156. =>WM: (13462: I3 ^dir L)
  8157. =>WM: (13461: O1920 ^name predict-no)
  8158. =>WM: (13460: O1919 ^name predict-yes)
  8159. =>WM: (13459: R963 ^value 1)
  8160. =>WM: (13458: R1 ^reward R963)
  8161. =>WM: (13457: I3 ^see 0)
  8162. <=WM: (13448: S1 ^operator O1917 +)
  8163. <=WM: (13449: S1 ^operator O1918 +)
  8164. <=WM: (13450: S1 ^operator O1918)
  8165. <=WM: (13433: I3 ^dir R)
  8166. <=WM: (13444: R1 ^reward R962)
  8167. <=WM: (13443: I3 ^see 1)
  8168. <=WM: (13447: O1918 ^name predict-no)
  8169. <=WM: (13446: O1917 ^name predict-yes)
  8170. <=WM: (13445: R962 ^value 1)
  8171. --- Inner Elaboration Phase, active level 1 (S1) ---
  8172. Firing prefer*rvt*predict-yes*H0
  8173. -->
  8174. Firing rl*prefer*rvt*predict-yes*H0*3
  8175. -->
  8176. (S1 ^operator O1919 = 0.3907974841024591)
  8177. Firing prefer*rvt*predict-yes*H0*3*H1
  8178. -->
  8179. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  8180. -->
  8181. (S1 ^operator O1919 = 0.6090773459257411)
  8182. Firing prefer*rvt*predict-no*H0
  8183. -->
  8184. Firing rl*prefer*rvt*predict-no*H0*4
  8185. -->
  8186. (S1 ^operator O1920 = 0.3145143319532709)
  8187. Firing prefer*rvt*predict-no*H0*4*H1
  8188. -->
  8189. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  8190. -->
  8191. (S1 ^operator O1920 = -0.1984300550322165)
  8192. inner elaboration loop at bottom goal.
  8193. Retracting rl*prefer*rvt*predict-no*H0*4
  8194. -->
  8195. (S1 ^operator O1918 = 0.3145143319532709)
  8196. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  8197. -->
  8198. (S1 ^operator O1918 = -0.1984300550322165)
  8199. Retracting rl*prefer*rvt*predict-yes*H0*3
  8200. -->
  8201. (S1 ^operator O1917 = 0.3907974841024591)
  8202. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  8203. -->
  8204. (S1 ^operator O1917 = 0.6090773459257411)
  8205. --- END Proposal Phase ---
  8206. --- Decision Phase ---
  8207. RL update rl*prefer*rvt*predict-no*H0*6 0.999977 0 0.999977 -> 0.999981 0 0.999981(R,m,v=1,0.936782,0.0595641)
  8208. =>WM: (13465: S1 ^operator O1919)
  8209. 960: O: O1919 (predict-yes)
  8210. --- END Decision Phase ---
  8211. --- Application Phase ---
  8212. --- Firing Productions (PE) For State At Depth 1 ---
  8213. --- Inner Elaboration Phase, active level 1 (S1) ---
  8214. Firing apply*operator
  8215. -->
  8216. (I3 ^predict-yes N960 + :O )
  8217. Firing apply*operator*complete
  8218. -->
  8219. (I3 ^predict-no N959 - :O )
  8220. inner elaboration loop at bottom goal.
  8221. --- Change Working Memory (PE) ---
  8222. =>WM: (13466: I3 ^predict-yes N960)
  8223. <=WM: (13452: N959 ^status complete)
  8224. <=WM: (13451: I3 ^predict-no N959)
  8225. --- Firing Productions (IE) For State At Depth 1 ---
  8226. --- Inner Elaboration Phase, active level 1 (S1) ---
  8227. Firing monitor*world
  8228. -->
  8229. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8230. --- Change Working Memory (IE) ---
  8231. --- END Application Phase ---
  8232. --- Output Phase ---
  8233. ENV: Agent did: predict-yes for direction L in state State-B
  8234. In State-B moving L
  8235. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8236. predict error 0
  8237. dir: dir isU
  8238. --- END Output Phase ---
  8239. /|\--- Input Phase ---
  8240. =>WM: (13470: I2 ^dir U)
  8241. =>WM: (13469: I2 ^reward 1)
  8242. =>WM: (13468: I2 ^see 1)
  8243. =>WM: (13467: N960 ^status complete)
  8244. <=WM: (13455: I2 ^dir L)
  8245. <=WM: (13454: I2 ^reward 1)
  8246. <=WM: (13453: I2 ^see 0)
  8247. =>WM: (13471: I2 ^level-1 L1-root)
  8248. <=WM: (13456: I2 ^level-1 R0-root)
  8249. --- END Input Phase ---
  8250. --- Proposal Phase ---
  8251. --- Inner Elaboration Phase, active level 1 (S1) ---
  8252. Firing elaborate*copy-see-to-output-link
  8253. -->
  8254. (I3 ^see 1 +)
  8255. Firing elaborate*reward*based*on*reward
  8256. -->
  8257. (R964 ^value 1 +)
  8258. (R1 ^reward R964 +)
  8259. Firing propose*predict-yes
  8260. -->
  8261. (O1921 ^name predict-yes +)
  8262. (S1 ^operator O1921 +)
  8263. Firing propose*predict-no
  8264. -->
  8265. (O1922 ^name predict-no +)
  8266. (S1 ^operator O1922 +)
  8267. Firing rl*prefer*rvt*predict-no*H0*2
  8268. -->
  8269. (S1 ^operator O1920 = 1.)
  8270. Firing rl*prefer*rvt*predict-yes*H0*1
  8271. -->
  8272. (S1 ^operator O1919 = 0.)
  8273. Firing prefer*rvt*predict-yes*H0
  8274. -->
  8275. Firing prefer*rvt*predict-no*H0
  8276. -->
  8277. Firing elaborate*copy-dir-to-output-link
  8278. -->
  8279. (I3 ^dir U +)
  8280. inner elaboration loop at bottom goal.
  8281. Retracting elaborate*copy-see-to-output-link
  8282. -->
  8283. (I3 ^see 0 +)
  8284. Retracting propose*predict-no
  8285. -->
  8286. (O1920 ^name predict-no +)
  8287. (S1 ^operator O1920 +)
  8288. Retracting propose*predict-yes
  8289. -->
  8290. (O1919 ^name predict-yes +)
  8291. (S1 ^operator O1919 +)
  8292. Retracting elaborate*reward*based*on*reward
  8293. -->
  8294. (R963 ^value 1 +)
  8295. (R1 ^reward R963 +)
  8296. Retracting elaborate*copy-dir-to-output-link
  8297. -->
  8298. (I3 ^dir L +)
  8299. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  8300. -->
  8301. (S1 ^operator O1920 = -0.1984300550322165)
  8302. Retracting rl*prefer*rvt*predict-no*H0*4
  8303. -->
  8304. (S1 ^operator O1920 = 0.3145143319532709)
  8305. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  8306. -->
  8307. (S1 ^operator O1919 = 0.6090773459257411)
  8308. Retracting rl*prefer*rvt*predict-yes*H0*3
  8309. -->
  8310. (S1 ^operator O1919 = 0.3907974841024591)
  8311. =>WM: (13479: S1 ^operator O1922 +)
  8312. =>WM: (13478: S1 ^operator O1921 +)
  8313. =>WM: (13477: I3 ^dir U)
  8314. =>WM: (13476: O1922 ^name predict-no)
  8315. =>WM: (13475: O1921 ^name predict-yes)
  8316. =>WM: (13474: R964 ^value 1)
  8317. =>WM: (13473: R1 ^reward R964)
  8318. =>WM: (13472: I3 ^see 1)
  8319. <=WM: (13463: S1 ^operator O1919 +)
  8320. <=WM: (13465: S1 ^operator O1919)
  8321. <=WM: (13464: S1 ^operator O1920 +)
  8322. <=WM: (13462: I3 ^dir L)
  8323. <=WM: (13458: R1 ^reward R963)
  8324. <=WM: (13457: I3 ^see 0)
  8325. <=WM: (13461: O1920 ^name predict-no)
  8326. <=WM: (13460: O1919 ^name predict-yes)
  8327. <=WM: (13459: R963 ^value 1)
  8328. --- Inner Elaboration Phase, active level 1 (S1) ---
  8329. Firing prefer*rvt*predict-yes*H0
  8330. -->
  8331. Firing rl*prefer*rvt*predict-yes*H0*1
  8332. -->
  8333. (S1 ^operator O1921 = 0.)
  8334. Firing prefer*rvt*predict-no*H0
  8335. -->
  8336. Firing rl*prefer*rvt*predict-no*H0*2
  8337. -->
  8338. (S1 ^operator O1922 = 1.)
  8339. inner elaboration loop at bottom goal.
  8340. Retracting rl*prefer*rvt*predict-no*H0*2
  8341. -->
  8342. (S1 ^operator O1920 = 1.)
  8343. Retracting rl*prefer*rvt*predict-yes*H0*1
  8344. -->
  8345. (S1 ^operator O1919 = 0.)
  8346. --- END Proposal Phase ---
  8347. --- Decision Phase ---
  8348. RL update rl*prefer*rvt*predict-yes*H0*3 0.47234 -0.081543 0.390797 -> 0.472349 -0.0815415 0.390808(R,m,v=1,0.941176,0.0557276)
  8349. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527553 0.0815245 0.609077 -> 0.527563 0.0815262 0.609089(R,m,v=1,1,0)
  8350. =>WM: (13480: S1 ^operator O1922)
  8351. 961: O: O1922 (predict-no)
  8352. --- END Decision Phase ---
  8353. --- Application Phase ---
  8354. --- Firing Productions (PE) For State At Depth 1 ---
  8355. --- Inner Elaboration Phase, active level 1 (S1) ---
  8356. Firing apply*operator
  8357. -->
  8358. (I3 ^predict-no N961 + :O )
  8359. Firing apply*operator*complete
  8360. -->
  8361. (I3 ^predict-yes N960 - :O )
  8362. inner elaboration loop at bottom goal.
  8363. --- Change Working Memory (PE) ---
  8364. =>WM: (13481: I3 ^predict-no N961)
  8365. <=WM: (13467: N960 ^status complete)
  8366. <=WM: (13466: I3 ^predict-yes N960)
  8367. --- Firing Productions (IE) For State At Depth 1 ---
  8368. --- Inner Elaboration Phase, active level 1 (S1) ---
  8369. Firing monitor*world
  8370. -->
  8371. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8372. --- Change Working Memory (IE) ---
  8373. --- END Application Phase ---
  8374. --- Output Phase ---
  8375. ENV: Agent did: predict-no for direction U in state State-A
  8376. In State-A moving U
  8377. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8378. predict error 0
  8379. dir: dir isL
  8380. --- END Output Phase ---
  8381. ---- Input Phase ---
  8382. =>WM: (13485: I2 ^dir L)
  8383. =>WM: (13484: I2 ^reward 1)
  8384. =>WM: (13483: I2 ^see 0)
  8385. =>WM: (13482: N961 ^status complete)
  8386. <=WM: (13470: I2 ^dir U)
  8387. <=WM: (13469: I2 ^reward 1)
  8388. <=WM: (13468: I2 ^see 1)
  8389. =>WM: (13486: I2 ^level-1 L1-root)
  8390. <=WM: (13471: I2 ^level-1 L1-root)
  8391. --- END Input Phase ---
  8392. --- Proposal Phase ---
  8393. --- Inner Elaboration Phase, active level 1 (S1) ---
  8394. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  8395. -->
  8396. (S1 ^operator O1921 = -0.2062723012911647)
  8397. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  8398. -->
  8399. (S1 ^operator O1922 = 0.685558831823503)
  8400. Firing prefer*rvt*predict-no*H0*4*H1
  8401. -->
  8402. Firing prefer*rvt*predict-yes*H0*3*H1
  8403. -->
  8404. Firing elaborate*copy-see-to-output-link
  8405. -->
  8406. (I3 ^see 0 +)
  8407. Firing elaborate*reward*based*on*reward
  8408. -->
  8409. (R965 ^value 1 +)
  8410. (R1 ^reward R965 +)
  8411. Firing propose*predict-yes
  8412. -->
  8413. (O1923 ^name predict-yes +)
  8414. (S1 ^operator O1923 +)
  8415. Firing propose*predict-no
  8416. -->
  8417. (O1924 ^name predict-no +)
  8418. (S1 ^operator O1924 +)
  8419. Firing rl*prefer*rvt*predict-no*H0*4
  8420. -->
  8421. (S1 ^operator O1922 = 0.3145143319532709)
  8422. Firing rl*prefer*rvt*predict-yes*H0*3
  8423. -->
  8424. (S1 ^operator O1921 = 0.390807862285058)
  8425. Firing prefer*rvt*predict-yes*H0
  8426. -->
  8427. Firing prefer*rvt*predict-no*H0
  8428. -->
  8429. Firing elaborate*copy-dir-to-output-link
  8430. -->
  8431. (I3 ^dir L +)
  8432. inner elaboration loop at bottom goal.
  8433. Retracting elaborate*copy-see-to-output-link
  8434. -->
  8435. (I3 ^see 1 +)
  8436. Retracting propose*predict-no
  8437. -->
  8438. (O1922 ^name predict-no +)
  8439. (S1 ^operator O1922 +)
  8440. Retracting propose*predict-yes
  8441. -->
  8442. (O1921 ^name predict-yes +)
  8443. (S1 ^operator O1921 +)
  8444. Retracting elaborate*reward*based*on*reward
  8445. -->
  8446. (R964 ^value 1 +)
  8447. (R1 ^reward R964 +)
  8448. Retracting elaborate*copy-dir-to-output-link
  8449. -->
  8450. (I3 ^dir U +)
  8451. Retracting rl*prefer*rvt*predict-no*H0*2
  8452. -->
  8453. (S1 ^operator O1922 = 1.)
  8454. Retracting rl*prefer*rvt*predict-yes*H0*1
  8455. -->
  8456. (S1 ^operator O1921 = 0.)
  8457. =>WM: (13494: S1 ^operator O1924 +)
  8458. =>WM: (13493: S1 ^operator O1923 +)
  8459. =>WM: (13492: I3 ^dir L)
  8460. =>WM: (13491: O1924 ^name predict-no)
  8461. =>WM: (13490: O1923 ^name predict-yes)
  8462. =>WM: (13489: R965 ^value 1)
  8463. =>WM: (13488: R1 ^reward R965)
  8464. =>WM: (13487: I3 ^see 0)
  8465. <=WM: (13478: S1 ^operator O1921 +)
  8466. <=WM: (13479: S1 ^operator O1922 +)
  8467. <=WM: (13480: S1 ^operator O1922)
  8468. <=WM: (13477: I3 ^dir U)
  8469. <=WM: (13473: R1 ^reward R964)
  8470. <=WM: (13472: I3 ^see 1)
  8471. <=WM: (13476: O1922 ^name predict-no)
  8472. <=WM: (13475: O1921 ^name predict-yes)
  8473. <=WM: (13474: R964 ^value 1)
  8474. --- Inner Elaboration Phase, active level 1 (S1) ---
  8475. Firing prefer*rvt*predict-yes*H0
  8476. -->
  8477. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  8478. -->
  8479. (S1 ^operator O1923 = -0.2062723012911647)
  8480. Firing rl*prefer*rvt*predict-yes*H0*3
  8481. -->
  8482. (S1 ^operator O1923 = 0.390807862285058)
  8483. Firing prefer*rvt*predict-yes*H0*3*H1
  8484. -->
  8485. Firing prefer*rvt*predict-no*H0
  8486. -->
  8487. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  8488. -->
  8489. (S1 ^operator O1924 = 0.685558831823503)
  8490. Firing rl*prefer*rvt*predict-no*H0*4
  8491. -->
  8492. (S1 ^operator O1924 = 0.3145143319532709)
  8493. Firing prefer*rvt*predict-no*H0*4*H1
  8494. -->
  8495. inner elaboration loop at bottom goal.
  8496. Retracting rl*prefer*rvt*predict-no*H0*4
  8497. -->
  8498. (S1 ^operator O1922 = 0.3145143319532709)
  8499. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  8500. -->
  8501. (S1 ^operator O1922 = 0.685558831823503)
  8502. Retracting rl*prefer*rvt*predict-yes*H0*3
  8503. -->
  8504. (S1 ^operator O1921 = 0.390807862285058)
  8505. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  8506. -->
  8507. (S1 ^operator O1921 = -0.2062723012911647)
  8508. --- END Proposal Phase ---
  8509. --- Decision Phase ---
  8510. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8511. =>WM: (13495: S1 ^operator O1924)
  8512. 962: O: O1924 (predict-no)
  8513. --- END Decision Phase ---
  8514. --- Application Phase ---
  8515. --- Firing Productions (PE) For State At Depth 1 ---
  8516. --- Inner Elaboration Phase, active level 1 (S1) ---
  8517. Firing apply*operator
  8518. -->
  8519. (I3 ^predict-no N962 + :O )
  8520. Firing apply*operator*complete
  8521. -->
  8522. (I3 ^predict-no N961 - :O )
  8523. inner elaboration loop at bottom goal.
  8524. --- Change Working Memory (PE) ---
  8525. =>WM: (13496: I3 ^predict-no N962)
  8526. <=WM: (13482: N961 ^status complete)
  8527. <=WM: (13481: I3 ^predict-no N961)
  8528. --- Firing Productions (IE) For State At Depth 1 ---
  8529. --- Inner Elaboration Phase, active level 1 (S1) ---
  8530. Firing monitor*world
  8531. -->
  8532. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8533. --- Change Working Memory (IE) ---
  8534. --- END Application Phase ---
  8535. --- Output Phase ---
  8536. ENV: Agent did: predict-no for direction L in state State-A
  8537. In State-A moving L
  8538. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8539. predict error 0
  8540. dir: dir isU
  8541. --- END Output Phase ---
  8542. /|\--- Input Phase ---
  8543. =>WM: (13500: I2 ^dir U)
  8544. =>WM: (13499: I2 ^reward 1)
  8545. =>WM: (13498: I2 ^see 0)
  8546. =>WM: (13497: N962 ^status complete)
  8547. <=WM: (13485: I2 ^dir L)
  8548. <=WM: (13484: I2 ^reward 1)
  8549. <=WM: (13483: I2 ^see 0)
  8550. =>WM: (13501: I2 ^level-1 L0-root)
  8551. <=WM: (13486: I2 ^level-1 L1-root)
  8552. --- END Input Phase ---
  8553. --- Proposal Phase ---
  8554. --- Inner Elaboration Phase, active level 1 (S1) ---
  8555. Firing elaborate*copy-see-to-output-link
  8556. -->
  8557. (I3 ^see 0 +)
  8558. Firing elaborate*reward*based*on*reward
  8559. -->
  8560. (R966 ^value 1 +)
  8561. (R1 ^reward R966 +)
  8562. Firing propose*predict-yes
  8563. -->
  8564. (O1925 ^name predict-yes +)
  8565. (S1 ^operator O1925 +)
  8566. Firing propose*predict-no
  8567. -->
  8568. (O1926 ^name predict-no +)
  8569. (S1 ^operator O1926 +)
  8570. Firing rl*prefer*rvt*predict-no*H0*2
  8571. -->
  8572. (S1 ^operator O1924 = 1.)
  8573. Firing rl*prefer*rvt*predict-yes*H0*1
  8574. -->
  8575. (S1 ^operator O1923 = 0.)
  8576. Firing prefer*rvt*predict-yes*H0
  8577. -->
  8578. Firing prefer*rvt*predict-no*H0
  8579. -->
  8580. Firing elaborate*copy-dir-to-output-link
  8581. -->
  8582. (I3 ^dir U +)
  8583. inner elaboration loop at bottom goal.
  8584. Retracting elaborate*copy-see-to-output-link
  8585. -->
  8586. (I3 ^see 0 +)
  8587. Retracting propose*predict-no
  8588. -->
  8589. (O1924 ^name predict-no +)
  8590. (S1 ^operator O1924 +)
  8591. Retracting propose*predict-yes
  8592. -->
  8593. (O1923 ^name predict-yes +)
  8594. (S1 ^operator O1923 +)
  8595. Retracting elaborate*reward*based*on*reward
  8596. -->
  8597. (R965 ^value 1 +)
  8598. (R1 ^reward R965 +)
  8599. Retracting elaborate*copy-dir-to-output-link
  8600. -->
  8601. (I3 ^dir L +)
  8602. Retracting rl*prefer*rvt*predict-no*H0*4
  8603. -->
  8604. (S1 ^operator O1924 = 0.3145143319532709)
  8605. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  8606. -->
  8607. (S1 ^operator O1924 = 0.685558831823503)
  8608. Retracting rl*prefer*rvt*predict-yes*H0*3
  8609. -->
  8610. (S1 ^operator O1923 = 0.390807862285058)
  8611. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  8612. -->
  8613. (S1 ^operator O1923 = -0.2062723012911647)
  8614. =>WM: (13508: S1 ^operator O1926 +)
  8615. =>WM: (13507: S1 ^operator O1925 +)
  8616. =>WM: (13506: I3 ^dir U)
  8617. =>WM: (13505: O1926 ^name predict-no)
  8618. =>WM: (13504: O1925 ^name predict-yes)
  8619. =>WM: (13503: R966 ^value 1)
  8620. =>WM: (13502: R1 ^reward R966)
  8621. <=WM: (13493: S1 ^operator O1923 +)
  8622. <=WM: (13494: S1 ^operator O1924 +)
  8623. <=WM: (13495: S1 ^operator O1924)
  8624. <=WM: (13492: I3 ^dir L)
  8625. <=WM: (13488: R1 ^reward R965)
  8626. <=WM: (13491: O1924 ^name predict-no)
  8627. <=WM: (13490: O1923 ^name predict-yes)
  8628. <=WM: (13489: R965 ^value 1)
  8629. --- Inner Elaboration Phase, active level 1 (S1) ---
  8630. Firing prefer*rvt*predict-yes*H0
  8631. -->
  8632. Firing rl*prefer*rvt*predict-yes*H0*1
  8633. -->
  8634. (S1 ^operator O1925 = 0.)
  8635. Firing prefer*rvt*predict-no*H0
  8636. -->
  8637. Firing rl*prefer*rvt*predict-no*H0*2
  8638. -->
  8639. (S1 ^operator O1926 = 1.)
  8640. inner elaboration loop at bottom goal.
  8641. Retracting rl*prefer*rvt*predict-no*H0*2
  8642. -->
  8643. (S1 ^operator O1924 = 1.)
  8644. Retracting rl*prefer*rvt*predict-yes*H0*1
  8645. -->
  8646. (S1 ^operator O1923 = 0.)
  8647. --- END Proposal Phase ---
  8648. --- Decision Phase ---
  8649. RL update rl*prefer*rvt*predict-no*H0*4 0.478562 -0.164047 0.314514 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.919463,0.0745511)
  8650. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521505 0.164054 0.685559 -> 0.521498 0.164053 0.685552(R,m,v=1,1,0)
  8651. =>WM: (13509: S1 ^operator O1926)
  8652. 963: O: O1926 (predict-no)
  8653. --- END Decision Phase ---
  8654. --- Application Phase ---
  8655. --- Firing Productions (PE) For State At Depth 1 ---
  8656. --- Inner Elaboration Phase, active level 1 (S1) ---
  8657. Firing apply*operator
  8658. -->
  8659. (I3 ^predict-no N963 + :O )
  8660. Firing apply*operator*complete
  8661. -->
  8662. (I3 ^predict-no N962 - :O )
  8663. inner elaboration loop at bottom goal.
  8664. --- Change Working Memory (PE) ---
  8665. =>WM: (13510: I3 ^predict-no N963)
  8666. <=WM: (13497: N962 ^status complete)
  8667. <=WM: (13496: I3 ^predict-no N962)
  8668. --- Firing Productions (IE) For State At Depth 1 ---
  8669. --- Inner Elaboration Phase, active level 1 (S1) ---
  8670. Firing monitor*world
  8671. -->
  8672. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8673. --- Change Working Memory (IE) ---
  8674. --- END Application Phase ---
  8675. --- Output Phase ---
  8676. ENV: Agent did: predict-no for direction U in state State-A
  8677. In State-A moving U
  8678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8679. predict error 0
  8680. dir: dir isU
  8681. --- END Output Phase ---
  8682. -/--- Input Phase ---
  8683. =>WM: (13514: I2 ^dir U)
  8684. =>WM: (13513: I2 ^reward 1)
  8685. =>WM: (13512: I2 ^see 0)
  8686. =>WM: (13511: N963 ^status complete)
  8687. <=WM: (13500: I2 ^dir U)
  8688. <=WM: (13499: I2 ^reward 1)
  8689. <=WM: (13498: I2 ^see 0)
  8690. =>WM: (13515: I2 ^level-1 L0-root)
  8691. <=WM: (13501: I2 ^level-1 L0-root)
  8692. --- END Input Phase ---
  8693. --- Proposal Phase ---
  8694. --- Inner Elaboration Phase, active level 1 (S1) ---
  8695. Firing elaborate*copy-see-to-output-link
  8696. -->
  8697. (I3 ^see 0 +)
  8698. Firing elaborate*reward*based*on*reward
  8699. -->
  8700. (R967 ^value 1 +)
  8701. (R1 ^reward R967 +)
  8702. Firing propose*predict-yes
  8703. -->
  8704. (O1927 ^name predict-yes +)
  8705. (S1 ^operator O1927 +)
  8706. Firing propose*predict-no
  8707. -->
  8708. (O1928 ^name predict-no +)
  8709. (S1 ^operator O1928 +)
  8710. Firing rl*prefer*rvt*predict-no*H0*2
  8711. -->
  8712. (S1 ^operator O1926 = 1.)
  8713. Firing rl*prefer*rvt*predict-yes*H0*1
  8714. -->
  8715. (S1 ^operator O1925 = 0.)
  8716. Firing prefer*rvt*predict-yes*H0
  8717. -->
  8718. Firing prefer*rvt*predict-no*H0
  8719. -->
  8720. Firing elaborate*copy-dir-to-output-link
  8721. -->
  8722. (I3 ^dir U +)
  8723. inner elaboration loop at bottom goal.
  8724. Retracting elaborate*copy-see-to-output-link
  8725. -->
  8726. (I3 ^see 0 +)
  8727. Retracting propose*predict-no
  8728. -->
  8729. (O1926 ^name predict-no +)
  8730. (S1 ^operator O1926 +)
  8731. Retracting propose*predict-yes
  8732. -->
  8733. (O1925 ^name predict-yes +)
  8734. (S1 ^operator O1925 +)
  8735. Retracting elaborate*reward*based*on*reward
  8736. -->
  8737. (R966 ^value 1 +)
  8738. (R1 ^reward R966 +)
  8739. Retracting elaborate*copy-dir-to-output-link
  8740. -->
  8741. (I3 ^dir U +)
  8742. Retracting rl*prefer*rvt*predict-no*H0*2
  8743. -->
  8744. (S1 ^operator O1926 = 1.)
  8745. Retracting rl*prefer*rvt*predict-yes*H0*1
  8746. -->
  8747. (S1 ^operator O1925 = 0.)
  8748. =>WM: (13521: S1 ^operator O1928 +)
  8749. =>WM: (13520: S1 ^operator O1927 +)
  8750. =>WM: (13519: O1928 ^name predict-no)
  8751. =>WM: (13518: O1927 ^name predict-yes)
  8752. =>WM: (13517: R967 ^value 1)
  8753. =>WM: (13516: R1 ^reward R967)
  8754. <=WM: (13507: S1 ^operator O1925 +)
  8755. <=WM: (13508: S1 ^operator O1926 +)
  8756. <=WM: (13509: S1 ^operator O1926)
  8757. <=WM: (13502: R1 ^reward R966)
  8758. <=WM: (13505: O1926 ^name predict-no)
  8759. <=WM: (13504: O1925 ^name predict-yes)
  8760. <=WM: (13503: R966 ^value 1)
  8761. --- Inner Elaboration Phase, active level 1 (S1) ---
  8762. Firing prefer*rvt*predict-yes*H0
  8763. -->
  8764. Firing rl*prefer*rvt*predict-yes*H0*1
  8765. -->
  8766. (S1 ^operator O1927 = 0.)
  8767. Firing prefer*rvt*predict-no*H0
  8768. -->
  8769. Firing rl*prefer*rvt*predict-no*H0*2
  8770. -->
  8771. (S1 ^operator O1928 = 1.)
  8772. inner elaboration loop at bottom goal.
  8773. Retracting rl*prefer*rvt*predict-no*H0*2
  8774. -->
  8775. (S1 ^operator O1926 = 1.)
  8776. Retracting rl*prefer*rvt*predict-yes*H0*1
  8777. -->
  8778. (S1 ^operator O1925 = 0.)
  8779. --- END Proposal Phase ---
  8780. --- Decision Phase ---
  8781. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8782. =>WM: (13522: S1 ^operator O1928)
  8783. 964: O: O1928 (predict-no)
  8784. --- END Decision Phase ---
  8785. --- Application Phase ---
  8786. --- Firing Productions (PE) For State At Depth 1 ---
  8787. --- Inner Elaboration Phase, active level 1 (S1) ---
  8788. Firing apply*operator
  8789. -->
  8790. (I3 ^predict-no N964 + :O )
  8791. Firing apply*operator*complete
  8792. -->
  8793. (I3 ^predict-no N963 - :O )
  8794. inner elaboration loop at bottom goal.
  8795. --- Change Working Memory (PE) ---
  8796. =>WM: (13523: I3 ^predict-no N964)
  8797. <=WM: (13511: N963 ^status complete)
  8798. <=WM: (13510: I3 ^predict-no N963)
  8799. --- Firing Productions (IE) For State At Depth 1 ---
  8800. --- Inner Elaboration Phase, active level 1 (S1) ---
  8801. Firing monitor*world
  8802. -->
  8803. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8804. --- Change Working Memory (IE) ---
  8805. --- END Application Phase ---
  8806. --- Output Phase ---
  8807. ENV: Agent did: predict-no for direction U in state State-A
  8808. In State-A moving U
  8809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8810. predict error 0
  8811. dir: dir isR
  8812. --- END Output Phase ---
  8813. |\--- Input Phase ---
  8814. =>WM: (13527: I2 ^dir R)
  8815. =>WM: (13526: I2 ^reward 1)
  8816. =>WM: (13525: I2 ^see 0)
  8817. =>WM: (13524: N964 ^status complete)
  8818. <=WM: (13514: I2 ^dir U)
  8819. <=WM: (13513: I2 ^reward 1)
  8820. <=WM: (13512: I2 ^see 0)
  8821. =>WM: (13528: I2 ^level-1 L0-root)
  8822. <=WM: (13515: I2 ^level-1 L0-root)
  8823. --- END Input Phase ---
  8824. --- Proposal Phase ---
  8825. --- Inner Elaboration Phase, active level 1 (S1) ---
  8826. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  8827. -->
  8828. (S1 ^operator O1927 = 0.878390760537652)
  8829. Firing prefer*rvt*predict-yes*H0*5*H1
  8830. -->
  8831. Firing elaborate*copy-see-to-output-link
  8832. -->
  8833. (I3 ^see 0 +)
  8834. Firing elaborate*reward*based*on*reward
  8835. -->
  8836. (R968 ^value 1 +)
  8837. (R1 ^reward R968 +)
  8838. Firing propose*predict-yes
  8839. -->
  8840. (O1929 ^name predict-yes +)
  8841. (S1 ^operator O1929 +)
  8842. Firing propose*predict-no
  8843. -->
  8844. (O1930 ^name predict-no +)
  8845. (S1 ^operator O1930 +)
  8846. Firing rl*prefer*rvt*predict-no*H0*6
  8847. -->
  8848. (S1 ^operator O1928 = 0.9999810901454903)
  8849. Firing rl*prefer*rvt*predict-yes*H0*5
  8850. -->
  8851. (S1 ^operator O1927 = 0.121597689773478)
  8852. Firing prefer*rvt*predict-yes*H0
  8853. -->
  8854. Firing prefer*rvt*predict-no*H0
  8855. -->
  8856. Firing elaborate*copy-dir-to-output-link
  8857. -->
  8858. (I3 ^dir R +)
  8859. inner elaboration loop at bottom goal.
  8860. Retracting elaborate*copy-see-to-output-link
  8861. -->
  8862. (I3 ^see 0 +)
  8863. Retracting propose*predict-no
  8864. -->
  8865. (O1928 ^name predict-no +)
  8866. (S1 ^operator O1928 +)
  8867. Retracting propose*predict-yes
  8868. -->
  8869. (O1927 ^name predict-yes +)
  8870. (S1 ^operator O1927 +)
  8871. Retracting elaborate*reward*based*on*reward
  8872. -->
  8873. (R967 ^value 1 +)
  8874. (R1 ^reward R967 +)
  8875. Retracting elaborate*copy-dir-to-output-link
  8876. -->
  8877. (I3 ^dir U +)
  8878. Retracting rl*prefer*rvt*predict-no*H0*2
  8879. -->
  8880. (S1 ^operator O1928 = 1.)
  8881. Retracting rl*prefer*rvt*predict-yes*H0*1
  8882. -->
  8883. (S1 ^operator O1927 = 0.)
  8884. =>WM: (13535: S1 ^operator O1930 +)
  8885. =>WM: (13534: S1 ^operator O1929 +)
  8886. =>WM: (13533: I3 ^dir R)
  8887. =>WM: (13532: O1930 ^name predict-no)
  8888. =>WM: (13531: O1929 ^name predict-yes)
  8889. =>WM: (13530: R968 ^value 1)
  8890. =>WM: (13529: R1 ^reward R968)
  8891. <=WM: (13520: S1 ^operator O1927 +)
  8892. <=WM: (13521: S1 ^operator O1928 +)
  8893. <=WM: (13522: S1 ^operator O1928)
  8894. <=WM: (13506: I3 ^dir U)
  8895. <=WM: (13516: R1 ^reward R967)
  8896. <=WM: (13519: O1928 ^name predict-no)
  8897. <=WM: (13518: O1927 ^name predict-yes)
  8898. <=WM: (13517: R967 ^value 1)
  8899. --- Inner Elaboration Phase, active level 1 (S1) ---
  8900. Firing prefer*rvt*predict-yes*H0
  8901. -->
  8902. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  8903. -->
  8904. (S1 ^operator O1929 = 0.878390760537652)
  8905. Firing rl*prefer*rvt*predict-yes*H0*5
  8906. -->
  8907. (S1 ^operator O1929 = 0.121597689773478)
  8908. Firing prefer*rvt*predict-yes*H0*5*H1
  8909. -->
  8910. Firing prefer*rvt*predict-no*H0
  8911. -->
  8912. Firing rl*prefer*rvt*predict-no*H0*6
  8913. -->
  8914. (S1 ^operator O1930 = 0.9999810901454903)
  8915. inner elaboration loop at bottom goal.
  8916. Retracting rl*prefer*rvt*predict-no*H0*6
  8917. -->
  8918. (S1 ^operator O1928 = 0.9999810901454903)
  8919. Retracting rl*prefer*rvt*predict-yes*H0*5
  8920. -->
  8921. (S1 ^operator O1927 = 0.121597689773478)
  8922. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  8923. -->
  8924. (S1 ^operator O1927 = 0.878390760537652)
  8925. --- END Proposal Phase ---
  8926. --- Decision Phase ---
  8927. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8928. =>WM: (13536: S1 ^operator O1929)
  8929. 965: O: O1929 (predict-yes)
  8930. --- END Decision Phase ---
  8931. --- Application Phase ---
  8932. --- Firing Productions (PE) For State At Depth 1 ---
  8933. --- Inner Elaboration Phase, active level 1 (S1) ---
  8934. Firing apply*operator
  8935. -->
  8936. (I3 ^predict-yes N965 + :O )
  8937. Firing apply*operator*complete
  8938. -->
  8939. (I3 ^predict-no N964 - :O )
  8940. inner elaboration loop at bottom goal.
  8941. --- Change Working Memory (PE) ---
  8942. =>WM: (13537: I3 ^predict-yes N965)
  8943. <=WM: (13524: N964 ^status complete)
  8944. <=WM: (13523: I3 ^predict-no N964)
  8945. --- Firing Productions (IE) For State At Depth 1 ---
  8946. --- Inner Elaboration Phase, active level 1 (S1) ---
  8947. Firing monitor*world
  8948. -->
  8949. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8950. --- Change Working Memory (IE) ---
  8951. --- END Application Phase ---
  8952. --- Output Phase ---
  8953. ENV: Agent did: predict-yes for direction R in state State-A
  8954. In State-A moving R
  8955. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8956. predict error 0
  8957. dir: dir isU
  8958. --- END Output Phase ---
  8959. -/|--- Input Phase ---
  8960. =>WM: (13541: I2 ^dir U)
  8961. =>WM: (13540: I2 ^reward 1)
  8962. =>WM: (13539: I2 ^see 1)
  8963. =>WM: (13538: N965 ^status complete)
  8964. <=WM: (13527: I2 ^dir R)
  8965. <=WM: (13526: I2 ^reward 1)
  8966. <=WM: (13525: I2 ^see 0)
  8967. =>WM: (13542: I2 ^level-1 R1-root)
  8968. <=WM: (13528: I2 ^level-1 L0-root)
  8969. --- END Input Phase ---
  8970. --- Proposal Phase ---
  8971. --- Inner Elaboration Phase, active level 1 (S1) ---
  8972. Firing elaborate*copy-see-to-output-link
  8973. -->
  8974. (I3 ^see 1 +)
  8975. Firing elaborate*reward*based*on*reward
  8976. -->
  8977. (R969 ^value 1 +)
  8978. (R1 ^reward R969 +)
  8979. Firing propose*predict-yes
  8980. -->
  8981. (O1931 ^name predict-yes +)
  8982. (S1 ^operator O1931 +)
  8983. Firing propose*predict-no
  8984. -->
  8985. (O1932 ^name predict-no +)
  8986. (S1 ^operator O1932 +)
  8987. Firing rl*prefer*rvt*predict-no*H0*2
  8988. -->
  8989. (S1 ^operator O1930 = 1.)
  8990. Firing rl*prefer*rvt*predict-yes*H0*1
  8991. -->
  8992. (S1 ^operator O1929 = 0.)
  8993. Firing prefer*rvt*predict-yes*H0
  8994. -->
  8995. Firing prefer*rvt*predict-no*H0
  8996. -->
  8997. Firing elaborate*copy-dir-to-output-link
  8998. -->
  8999. (I3 ^dir U +)
  9000. inner elaboration loop at bottom goal.
  9001. Retracting elaborate*copy-see-to-output-link
  9002. -->
  9003. (I3 ^see 0 +)
  9004. Retracting propose*predict-no
  9005. -->
  9006. (O1930 ^name predict-no +)
  9007. (S1 ^operator O1930 +)
  9008. Retracting propose*predict-yes
  9009. -->
  9010. (O1929 ^name predict-yes +)
  9011. (S1 ^operator O1929 +)
  9012. Retracting elaborate*reward*based*on*reward
  9013. -->
  9014. (R968 ^value 1 +)
  9015. (R1 ^reward R968 +)
  9016. Retracting elaborate*copy-dir-to-output-link
  9017. -->
  9018. (I3 ^dir R +)
  9019. Retracting rl*prefer*rvt*predict-no*H0*6
  9020. -->
  9021. (S1 ^operator O1930 = 0.9999810901454903)
  9022. Retracting rl*prefer*rvt*predict-yes*H0*5
  9023. -->
  9024. (S1 ^operator O1929 = 0.121597689773478)
  9025. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  9026. -->
  9027. (S1 ^operator O1929 = 0.878390760537652)
  9028. =>WM: (13550: S1 ^operator O1932 +)
  9029. =>WM: (13549: S1 ^operator O1931 +)
  9030. =>WM: (13548: I3 ^dir U)
  9031. =>WM: (13547: O1932 ^name predict-no)
  9032. =>WM: (13546: O1931 ^name predict-yes)
  9033. =>WM: (13545: R969 ^value 1)
  9034. =>WM: (13544: R1 ^reward R969)
  9035. =>WM: (13543: I3 ^see 1)
  9036. <=WM: (13534: S1 ^operator O1929 +)
  9037. <=WM: (13536: S1 ^operator O1929)
  9038. <=WM: (13535: S1 ^operator O1930 +)
  9039. <=WM: (13533: I3 ^dir R)
  9040. <=WM: (13529: R1 ^reward R968)
  9041. <=WM: (13487: I3 ^see 0)
  9042. <=WM: (13532: O1930 ^name predict-no)
  9043. <=WM: (13531: O1929 ^name predict-yes)
  9044. <=WM: (13530: R968 ^value 1)
  9045. --- Inner Elaboration Phase, active level 1 (S1) ---
  9046. Firing prefer*rvt*predict-yes*H0
  9047. -->
  9048. Firing rl*prefer*rvt*predict-yes*H0*1
  9049. -->
  9050. (S1 ^operator O1931 = 0.)
  9051. Firing prefer*rvt*predict-no*H0
  9052. -->
  9053. Firing rl*prefer*rvt*predict-no*H0*2
  9054. -->
  9055. (S1 ^operator O1932 = 1.)
  9056. inner elaboration loop at bottom goal.
  9057. Retracting rl*prefer*rvt*predict-no*H0*2
  9058. -->
  9059. (S1 ^operator O1930 = 1.)
  9060. Retracting rl*prefer*rvt*predict-yes*H0*1
  9061. -->
  9062. (S1 ^operator O1929 = 0.)
  9063. --- END Proposal Phase ---
  9064. --- Decision Phase ---
  9065. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.858824,0.121963)
  9066. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465467 0.412924 0.878391 -> 0.465467 0.412924 0.878392(R,m,v=1,1,0)
  9067. =>WM: (13551: S1 ^operator O1932)
  9068. 966: O: O1932 (predict-no)
  9069. --- END Decision Phase ---
  9070. --- Application Phase ---
  9071. --- Firing Productions (PE) For State At Depth 1 ---
  9072. --- Inner Elaboration Phase, active level 1 (S1) ---
  9073. Firing apply*operator
  9074. -->
  9075. (I3 ^predict-no N966 + :O )
  9076. Firing apply*operator*complete
  9077. -->
  9078. (I3 ^predict-yes N965 - :O )
  9079. inner elaboration loop at bottom goal.
  9080. --- Change Working Memory (PE) ---
  9081. =>WM: (13552: I3 ^predict-no N966)
  9082. <=WM: (13538: N965 ^status complete)
  9083. <=WM: (13537: I3 ^predict-yes N965)
  9084. --- Firing Productions (IE) For State At Depth 1 ---
  9085. --- Inner Elaboration Phase, active level 1 (S1) ---
  9086. Firing monitor*world
  9087. -->
  9088. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9089. --- Change Working Memory (IE) ---
  9090. --- END Application Phase ---
  9091. --- Output Phase ---
  9092. ENV: Agent did: predict-no for direction U in state State-B
  9093. In State-B moving U
  9094. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9095. predict error 0
  9096. dir: dir isL
  9097. --- END Output Phase ---
  9098. \---- Input Phase ---
  9099. =>WM: (13556: I2 ^dir L)
  9100. =>WM: (13555: I2 ^reward 1)
  9101. =>WM: (13554: I2 ^see 0)
  9102. =>WM: (13553: N966 ^status complete)
  9103. <=WM: (13541: I2 ^dir U)
  9104. <=WM: (13540: I2 ^reward 1)
  9105. <=WM: (13539: I2 ^see 1)
  9106. =>WM: (13557: I2 ^level-1 R1-root)
  9107. <=WM: (13542: I2 ^level-1 R1-root)
  9108. --- END Input Phase ---
  9109. --- Proposal Phase ---
  9110. --- Inner Elaboration Phase, active level 1 (S1) ---
  9111. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9112. -->
  9113. (S1 ^operator O1932 = -0.168718511744511)
  9114. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9115. -->
  9116. (S1 ^operator O1931 = 0.6093697568764296)
  9117. Firing prefer*rvt*predict-no*H0*4*H1
  9118. -->
  9119. Firing prefer*rvt*predict-yes*H0*3*H1
  9120. -->
  9121. Firing elaborate*copy-see-to-output-link
  9122. -->
  9123. (I3 ^see 0 +)
  9124. Firing elaborate*reward*based*on*reward
  9125. -->
  9126. (R970 ^value 1 +)
  9127. (R1 ^reward R970 +)
  9128. Firing propose*predict-yes
  9129. -->
  9130. (O1933 ^name predict-yes +)
  9131. (S1 ^operator O1933 +)
  9132. Firing propose*predict-no
  9133. -->
  9134. (O1934 ^name predict-no +)
  9135. (S1 ^operator O1934 +)
  9136. Firing rl*prefer*rvt*predict-no*H0*4
  9137. -->
  9138. (S1 ^operator O1932 = 0.3145082389793297)
  9139. Firing rl*prefer*rvt*predict-yes*H0*3
  9140. -->
  9141. (S1 ^operator O1931 = 0.390807862285058)
  9142. Firing prefer*rvt*predict-yes*H0
  9143. -->
  9144. Firing prefer*rvt*predict-no*H0
  9145. -->
  9146. Firing elaborate*copy-dir-to-output-link
  9147. -->
  9148. (I3 ^dir L +)
  9149. inner elaboration loop at bottom goal.
  9150. Retracting elaborate*copy-see-to-output-link
  9151. -->
  9152. (I3 ^see 1 +)
  9153. Retracting propose*predict-no
  9154. -->
  9155. (O1932 ^name predict-no +)
  9156. (S1 ^operator O1932 +)
  9157. Retracting propose*predict-yes
  9158. -->
  9159. (O1931 ^name predict-yes +)
  9160. (S1 ^operator O1931 +)
  9161. Retracting elaborate*reward*based*on*reward
  9162. -->
  9163. (R969 ^value 1 +)
  9164. (R1 ^reward R969 +)
  9165. Retracting elaborate*copy-dir-to-output-link
  9166. -->
  9167. (I3 ^dir U +)
  9168. Retracting rl*prefer*rvt*predict-no*H0*2
  9169. -->
  9170. (S1 ^operator O1932 = 1.)
  9171. Retracting rl*prefer*rvt*predict-yes*H0*1
  9172. -->
  9173. (S1 ^operator O1931 = 0.)
  9174. =>WM: (13565: S1 ^operator O1934 +)
  9175. =>WM: (13564: S1 ^operator O1933 +)
  9176. =>WM: (13563: I3 ^dir L)
  9177. =>WM: (13562: O1934 ^name predict-no)
  9178. =>WM: (13561: O1933 ^name predict-yes)
  9179. =>WM: (13560: R970 ^value 1)
  9180. =>WM: (13559: R1 ^reward R970)
  9181. =>WM: (13558: I3 ^see 0)
  9182. <=WM: (13549: S1 ^operator O1931 +)
  9183. <=WM: (13550: S1 ^operator O1932 +)
  9184. <=WM: (13551: S1 ^operator O1932)
  9185. <=WM: (13548: I3 ^dir U)
  9186. <=WM: (13544: R1 ^reward R969)
  9187. <=WM: (13543: I3 ^see 1)
  9188. <=WM: (13547: O1932 ^name predict-no)
  9189. <=WM: (13546: O1931 ^name predict-yes)
  9190. <=WM: (13545: R969 ^value 1)
  9191. --- Inner Elaboration Phase, active level 1 (S1) ---
  9192. Firing prefer*rvt*predict-yes*H0
  9193. -->
  9194. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9195. -->
  9196. (S1 ^operator O1933 = 0.6093697568764296)
  9197. Firing rl*prefer*rvt*predict-yes*H0*3
  9198. -->
  9199. (S1 ^operator O1933 = 0.390807862285058)
  9200. Firing prefer*rvt*predict-yes*H0*3*H1
  9201. -->
  9202. Firing prefer*rvt*predict-no*H0
  9203. -->
  9204. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9205. -->
  9206. (S1 ^operator O1934 = -0.168718511744511)
  9207. Firing rl*prefer*rvt*predict-no*H0*4
  9208. -->
  9209. (S1 ^operator O1934 = 0.3145082389793297)
  9210. Firing prefer*rvt*predict-no*H0*4*H1
  9211. -->
  9212. inner elaboration loop at bottom goal.
  9213. Retracting rl*prefer*rvt*predict-no*H0*4
  9214. -->
  9215. (S1 ^operator O1932 = 0.3145082389793297)
  9216. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9217. -->
  9218. (S1 ^operator O1932 = -0.168718511744511)
  9219. Retracting rl*prefer*rvt*predict-yes*H0*3
  9220. -->
  9221. (S1 ^operator O1931 = 0.390807862285058)
  9222. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9223. -->
  9224. (S1 ^operator O1931 = 0.6093697568764296)
  9225. --- END Proposal Phase ---
  9226. --- Decision Phase ---
  9227. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9228. =>WM: (13566: S1 ^operator O1933)
  9229. 967: O: O1933 (predict-yes)
  9230. --- END Decision Phase ---
  9231. --- Application Phase ---
  9232. --- Firing Productions (PE) For State At Depth 1 ---
  9233. --- Inner Elaboration Phase, active level 1 (S1) ---
  9234. Firing apply*operator
  9235. -->
  9236. (I3 ^predict-yes N967 + :O )
  9237. Firing apply*operator*complete
  9238. -->
  9239. (I3 ^predict-no N966 - :O )
  9240. inner elaboration loop at bottom goal.
  9241. --- Change Working Memory (PE) ---
  9242. =>WM: (13567: I3 ^predict-yes N967)
  9243. <=WM: (13553: N966 ^status complete)
  9244. <=WM: (13552: I3 ^predict-no N966)
  9245. --- Firing Productions (IE) For State At Depth 1 ---
  9246. --- Inner Elaboration Phase, active level 1 (S1) ---
  9247. Firing monitor*world
  9248. -->
  9249. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9250. --- Change Working Memory (IE) ---
  9251. --- END Application Phase ---
  9252. --- Output Phase ---
  9253. ENV: Agent did: predict-yes for direction L in state State-B
  9254. In State-B moving L
  9255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9256. predict error 0
  9257. dir: dir isL
  9258. --- END Output Phase ---
  9259. /|\--- Input Phase ---
  9260. =>WM: (13571: I2 ^dir L)
  9261. =>WM: (13570: I2 ^reward 1)
  9262. =>WM: (13569: I2 ^see 1)
  9263. =>WM: (13568: N967 ^status complete)
  9264. <=WM: (13556: I2 ^dir L)
  9265. <=WM: (13555: I2 ^reward 1)
  9266. <=WM: (13554: I2 ^see 0)
  9267. =>WM: (13572: I2 ^level-1 L1-root)
  9268. <=WM: (13557: I2 ^level-1 R1-root)
  9269. --- END Input Phase ---
  9270. --- Proposal Phase ---
  9271. --- Inner Elaboration Phase, active level 1 (S1) ---
  9272. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  9273. -->
  9274. (S1 ^operator O1933 = -0.2062723012911647)
  9275. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  9276. -->
  9277. (S1 ^operator O1934 = 0.685551861847024)
  9278. Firing prefer*rvt*predict-no*H0*4*H1
  9279. -->
  9280. Firing prefer*rvt*predict-yes*H0*3*H1
  9281. -->
  9282. Firing elaborate*copy-see-to-output-link
  9283. -->
  9284. (I3 ^see 1 +)
  9285. Firing elaborate*reward*based*on*reward
  9286. -->
  9287. (R971 ^value 1 +)
  9288. (R1 ^reward R971 +)
  9289. Firing propose*predict-yes
  9290. -->
  9291. (O1935 ^name predict-yes +)
  9292. (S1 ^operator O1935 +)
  9293. Firing propose*predict-no
  9294. -->
  9295. (O1936 ^name predict-no +)
  9296. (S1 ^operator O1936 +)
  9297. Firing rl*prefer*rvt*predict-no*H0*4
  9298. -->
  9299. (S1 ^operator O1934 = 0.3145082389793297)
  9300. Firing rl*prefer*rvt*predict-yes*H0*3
  9301. -->
  9302. (S1 ^operator O1933 = 0.390807862285058)
  9303. Firing prefer*rvt*predict-yes*H0
  9304. -->
  9305. Firing prefer*rvt*predict-no*H0
  9306. -->
  9307. Firing elaborate*copy-dir-to-output-link
  9308. -->
  9309. (I3 ^dir L +)
  9310. inner elaboration loop at bottom goal.
  9311. Retracting elaborate*copy-see-to-output-link
  9312. -->
  9313. (I3 ^see 0 +)
  9314. Retracting propose*predict-no
  9315. -->
  9316. (O1934 ^name predict-no +)
  9317. (S1 ^operator O1934 +)
  9318. Retracting propose*predict-yes
  9319. -->
  9320. (O1933 ^name predict-yes +)
  9321. (S1 ^operator O1933 +)
  9322. Retracting elaborate*reward*based*on*reward
  9323. -->
  9324. (R970 ^value 1 +)
  9325. (R1 ^reward R970 +)
  9326. Retracting elaborate*copy-dir-to-output-link
  9327. -->
  9328. (I3 ^dir L +)
  9329. Retracting rl*prefer*rvt*predict-no*H0*4
  9330. -->
  9331. (S1 ^operator O1934 = 0.3145082389793297)
  9332. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9333. -->
  9334. (S1 ^operator O1934 = -0.168718511744511)
  9335. Retracting rl*prefer*rvt*predict-yes*H0*3
  9336. -->
  9337. (S1 ^operator O1933 = 0.390807862285058)
  9338. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9339. -->
  9340. (S1 ^operator O1933 = 0.6093697568764296)
  9341. =>WM: (13579: S1 ^operator O1936 +)
  9342. =>WM: (13578: S1 ^operator O1935 +)
  9343. =>WM: (13577: O1936 ^name predict-no)
  9344. =>WM: (13576: O1935 ^name predict-yes)
  9345. =>WM: (13575: R971 ^value 1)
  9346. =>WM: (13574: R1 ^reward R971)
  9347. =>WM: (13573: I3 ^see 1)
  9348. <=WM: (13564: S1 ^operator O1933 +)
  9349. <=WM: (13566: S1 ^operator O1933)
  9350. <=WM: (13565: S1 ^operator O1934 +)
  9351. <=WM: (13559: R1 ^reward R970)
  9352. <=WM: (13558: I3 ^see 0)
  9353. <=WM: (13562: O1934 ^name predict-no)
  9354. <=WM: (13561: O1933 ^name predict-yes)
  9355. <=WM: (13560: R970 ^value 1)
  9356. --- Inner Elaboration Phase, active level 1 (S1) ---
  9357. Firing prefer*rvt*predict-yes*H0
  9358. -->
  9359. Firing rl*prefer*rvt*predict-yes*H0*3
  9360. -->
  9361. (S1 ^operator O1935 = 0.390807862285058)
  9362. Firing prefer*rvt*predict-yes*H0*3*H1
  9363. -->
  9364. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  9365. -->
  9366. (S1 ^operator O1935 = -0.2062723012911647)
  9367. Firing prefer*rvt*predict-no*H0
  9368. -->
  9369. Firing rl*prefer*rvt*predict-no*H0*4
  9370. -->
  9371. (S1 ^operator O1936 = 0.3145082389793297)
  9372. Firing prefer*rvt*predict-no*H0*4*H1
  9373. -->
  9374. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  9375. -->
  9376. (S1 ^operator O1936 = 0.685551861847024)
  9377. inner elaboration loop at bottom goal.
  9378. Retracting rl*prefer*rvt*predict-no*H0*4
  9379. -->
  9380. (S1 ^operator O1934 = 0.3145082389793297)
  9381. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  9382. -->
  9383. (S1 ^operator O1934 = 0.685551861847024)
  9384. Retracting rl*prefer*rvt*predict-yes*H0*3
  9385. -->
  9386. (S1 ^operator O1933 = 0.390807862285058)
  9387. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  9388. -->
  9389. (S1 ^operator O1933 = -0.2062723012911647)
  9390. --- END Proposal Phase ---
  9391. --- Decision Phase ---
  9392. RL update rl*prefer*rvt*predict-yes*H0*3 0.472349 -0.0815415 0.390808 -> 0.472337 -0.0815436 0.390793(R,m,v=1,0.941558,0.0553858)
  9393. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527802 0.0815677 0.60937 -> 0.527788 0.0815652 0.609353(R,m,v=1,1,0)
  9394. =>WM: (13580: S1 ^operator O1936)
  9395. 968: O: O1936 (predict-no)
  9396. --- END Decision Phase ---
  9397. --- Application Phase ---
  9398. --- Firing Productions (PE) For State At Depth 1 ---
  9399. --- Inner Elaboration Phase, active level 1 (S1) ---
  9400. Firing apply*operator
  9401. -->
  9402. (I3 ^predict-no N968 + :O )
  9403. Firing apply*operator*complete
  9404. -->
  9405. (I3 ^predict-yes N967 - :O )
  9406. inner elaboration loop at bottom goal.
  9407. --- Change Working Memory (PE) ---
  9408. =>WM: (13581: I3 ^predict-no N968)
  9409. <=WM: (13568: N967 ^status complete)
  9410. <=WM: (13567: I3 ^predict-yes N967)
  9411. --- Firing Productions (IE) For State At Depth 1 ---
  9412. --- Inner Elaboration Phase, active level 1 (S1) ---
  9413. Firing monitor*world
  9414. -->
  9415. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9416. --- Change Working Memory (IE) ---
  9417. --- END Application Phase ---
  9418. --- Output Phase ---
  9419. ENV: Agent did: predict-no for direction L in state State-A
  9420. In State-A moving L
  9421. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9422. predict error 0
  9423. dir: dir isR
  9424. --- END Output Phase ---
  9425. -/|--- Input Phase ---
  9426. =>WM: (13585: I2 ^dir R)
  9427. =>WM: (13584: I2 ^reward 1)
  9428. =>WM: (13583: I2 ^see 0)
  9429. =>WM: (13582: N968 ^status complete)
  9430. <=WM: (13571: I2 ^dir L)
  9431. <=WM: (13570: I2 ^reward 1)
  9432. <=WM: (13569: I2 ^see 1)
  9433. =>WM: (13586: I2 ^level-1 L0-root)
  9434. <=WM: (13572: I2 ^level-1 L1-root)
  9435. --- END Input Phase ---
  9436. --- Proposal Phase ---
  9437. --- Inner Elaboration Phase, active level 1 (S1) ---
  9438. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  9439. -->
  9440. (S1 ^operator O1935 = 0.8783918732984659)
  9441. Firing prefer*rvt*predict-yes*H0*5*H1
  9442. -->
  9443. Firing elaborate*copy-see-to-output-link
  9444. -->
  9445. (I3 ^see 0 +)
  9446. Firing elaborate*reward*based*on*reward
  9447. -->
  9448. (R972 ^value 1 +)
  9449. (R1 ^reward R972 +)
  9450. Firing propose*predict-yes
  9451. -->
  9452. (O1937 ^name predict-yes +)
  9453. (S1 ^operator O1937 +)
  9454. Firing propose*predict-no
  9455. -->
  9456. (O1938 ^name predict-no +)
  9457. (S1 ^operator O1938 +)
  9458. Firing rl*prefer*rvt*predict-no*H0*6
  9459. -->
  9460. (S1 ^operator O1936 = 0.9999810901454903)
  9461. Firing rl*prefer*rvt*predict-yes*H0*5
  9462. -->
  9463. (S1 ^operator O1935 = 0.1215986309459259)
  9464. Firing prefer*rvt*predict-yes*H0
  9465. -->
  9466. Firing prefer*rvt*predict-no*H0
  9467. -->
  9468. Firing elaborate*copy-dir-to-output-link
  9469. -->
  9470. (I3 ^dir R +)
  9471. inner elaboration loop at bottom goal.
  9472. Retracting elaborate*copy-see-to-output-link
  9473. -->
  9474. (I3 ^see 1 +)
  9475. Retracting propose*predict-no
  9476. -->
  9477. (O1936 ^name predict-no +)
  9478. (S1 ^operator O1936 +)
  9479. Retracting propose*predict-yes
  9480. -->
  9481. (O1935 ^name predict-yes +)
  9482. (S1 ^operator O1935 +)
  9483. Retracting elaborate*reward*based*on*reward
  9484. -->
  9485. (R971 ^value 1 +)
  9486. (R1 ^reward R971 +)
  9487. Retracting elaborate*copy-dir-to-output-link
  9488. -->
  9489. (I3 ^dir L +)
  9490. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  9491. -->
  9492. (S1 ^operator O1936 = 0.685551861847024)
  9493. Retracting rl*prefer*rvt*predict-no*H0*4
  9494. -->
  9495. (S1 ^operator O1936 = 0.3145082389793297)
  9496. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  9497. -->
  9498. (S1 ^operator O1935 = -0.2062723012911647)
  9499. Retracting rl*prefer*rvt*predict-yes*H0*3
  9500. -->
  9501. (S1 ^operator O1935 = 0.3907931512898603)
  9502. =>WM: (13594: S1 ^operator O1938 +)
  9503. =>WM: (13593: S1 ^operator O1937 +)
  9504. =>WM: (13592: I3 ^dir R)
  9505. =>WM: (13591: O1938 ^name predict-no)
  9506. =>WM: (13590: O1937 ^name predict-yes)
  9507. =>WM: (13589: R972 ^value 1)
  9508. =>WM: (13588: R1 ^reward R972)
  9509. =>WM: (13587: I3 ^see 0)
  9510. <=WM: (13578: S1 ^operator O1935 +)
  9511. <=WM: (13579: S1 ^operator O1936 +)
  9512. <=WM: (13580: S1 ^operator O1936)
  9513. <=WM: (13563: I3 ^dir L)
  9514. <=WM: (13574: R1 ^reward R971)
  9515. <=WM: (13573: I3 ^see 1)
  9516. <=WM: (13577: O1936 ^name predict-no)
  9517. <=WM: (13576: O1935 ^name predict-yes)
  9518. <=WM: (13575: R971 ^value 1)
  9519. --- Inner Elaboration Phase, active level 1 (S1) ---
  9520. Firing prefer*rvt*predict-yes*H0
  9521. -->
  9522. Firing rl*prefer*rvt*predict-yes*H0*5
  9523. -->
  9524. (S1 ^operator O1937 = 0.1215986309459259)
  9525. Firing prefer*rvt*predict-yes*H0*5*H1
  9526. -->
  9527. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  9528. -->
  9529. (S1 ^operator O1937 = 0.8783918732984659)
  9530. Firing prefer*rvt*predict-no*H0
  9531. -->
  9532. Firing rl*prefer*rvt*predict-no*H0*6
  9533. -->
  9534. (S1 ^operator O1938 = 0.9999810901454903)
  9535. inner elaboration loop at bottom goal.
  9536. Retracting rl*prefer*rvt*predict-no*H0*6
  9537. -->
  9538. (S1 ^operator O1936 = 0.9999810901454903)
  9539. Retracting rl*prefer*rvt*predict-yes*H0*5
  9540. -->
  9541. (S1 ^operator O1935 = 0.1215986309459259)
  9542. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  9543. -->
  9544. (S1 ^operator O1935 = 0.8783918732984659)
  9545. --- END Proposal Phase ---
  9546. --- Decision Phase ---
  9547. RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478552 -0.164048 0.314503(R,m,v=1,0.92,0.074094)
  9548. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521498 0.164053 0.685552 -> 0.521493 0.164053 0.685546(R,m,v=1,1,0)
  9549. =>WM: (13595: S1 ^operator O1937)
  9550. 969: O: O1937 (predict-yes)
  9551. --- END Decision Phase ---
  9552. --- Application Phase ---
  9553. --- Firing Productions (PE) For State At Depth 1 ---
  9554. --- Inner Elaboration Phase, active level 1 (S1) ---
  9555. Firing apply*operator
  9556. -->
  9557. (I3 ^predict-yes N969 + :O )
  9558. Firing apply*operator*complete
  9559. -->
  9560. (I3 ^predict-no N968 - :O )
  9561. inner elaboration loop at bottom goal.
  9562. --- Change Working Memory (PE) ---
  9563. =>WM: (13596: I3 ^predict-yes N969)
  9564. <=WM: (13582: N968 ^status complete)
  9565. <=WM: (13581: I3 ^predict-no N968)
  9566. --- Firing Productions (IE) For State At Depth 1 ---
  9567. --- Inner Elaboration Phase, active level 1 (S1) ---
  9568. Firing monitor*world
  9569. -->
  9570. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9571. --- Change Working Memory (IE) ---
  9572. --- END Application Phase ---
  9573. --- Output Phase ---
  9574. ENV: Agent did: predict-yes for direction R in state State-A
  9575. In State-A moving R
  9576. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9577. predict error 0
  9578. dir: dir isL
  9579. --- END Output Phase ---
  9580. \-/--- Input Phase ---
  9581. =>WM: (13600: I2 ^dir L)
  9582. =>WM: (13599: I2 ^reward 1)
  9583. =>WM: (13598: I2 ^see 1)
  9584. =>WM: (13597: N969 ^status complete)
  9585. <=WM: (13585: I2 ^dir R)
  9586. <=WM: (13584: I2 ^reward 1)
  9587. <=WM: (13583: I2 ^see 0)
  9588. =>WM: (13601: I2 ^level-1 R1-root)
  9589. <=WM: (13586: I2 ^level-1 L0-root)
  9590. --- END Input Phase ---
  9591. --- Proposal Phase ---
  9592. --- Inner Elaboration Phase, active level 1 (S1) ---
  9593. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9594. -->
  9595. (S1 ^operator O1938 = -0.168718511744511)
  9596. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9597. -->
  9598. (S1 ^operator O1937 = 0.6093527419421177)
  9599. Firing prefer*rvt*predict-no*H0*4*H1
  9600. -->
  9601. Firing prefer*rvt*predict-yes*H0*3*H1
  9602. -->
  9603. Firing elaborate*copy-see-to-output-link
  9604. -->
  9605. (I3 ^see 1 +)
  9606. Firing elaborate*reward*based*on*reward
  9607. -->
  9608. (R973 ^value 1 +)
  9609. (R1 ^reward R973 +)
  9610. Firing propose*predict-yes
  9611. -->
  9612. (O1939 ^name predict-yes +)
  9613. (S1 ^operator O1939 +)
  9614. Firing propose*predict-no
  9615. -->
  9616. (O1940 ^name predict-no +)
  9617. (S1 ^operator O1940 +)
  9618. Firing rl*prefer*rvt*predict-no*H0*4
  9619. -->
  9620. (S1 ^operator O1938 = 0.3145032394390637)
  9621. Firing rl*prefer*rvt*predict-yes*H0*3
  9622. -->
  9623. (S1 ^operator O1937 = 0.3907931512898603)
  9624. Firing prefer*rvt*predict-yes*H0
  9625. -->
  9626. Firing prefer*rvt*predict-no*H0
  9627. -->
  9628. Firing elaborate*copy-dir-to-output-link
  9629. -->
  9630. (I3 ^dir L +)
  9631. inner elaboration loop at bottom goal.
  9632. Retracting elaborate*copy-see-to-output-link
  9633. -->
  9634. (I3 ^see 0 +)
  9635. Retracting propose*predict-no
  9636. -->
  9637. (O1938 ^name predict-no +)
  9638. (S1 ^operator O1938 +)
  9639. Retracting propose*predict-yes
  9640. -->
  9641. (O1937 ^name predict-yes +)
  9642. (S1 ^operator O1937 +)
  9643. Retracting elaborate*reward*based*on*reward
  9644. -->
  9645. (R972 ^value 1 +)
  9646. (R1 ^reward R972 +)
  9647. Retracting elaborate*copy-dir-to-output-link
  9648. -->
  9649. (I3 ^dir R +)
  9650. Retracting rl*prefer*rvt*predict-no*H0*6
  9651. -->
  9652. (S1 ^operator O1938 = 0.9999810901454903)
  9653. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  9654. -->
  9655. (S1 ^operator O1937 = 0.8783918732984659)
  9656. Retracting rl*prefer*rvt*predict-yes*H0*5
  9657. -->
  9658. (S1 ^operator O1937 = 0.1215986309459259)
  9659. =>WM: (13609: S1 ^operator O1940 +)
  9660. =>WM: (13608: S1 ^operator O1939 +)
  9661. =>WM: (13607: I3 ^dir L)
  9662. =>WM: (13606: O1940 ^name predict-no)
  9663. =>WM: (13605: O1939 ^name predict-yes)
  9664. =>WM: (13604: R973 ^value 1)
  9665. =>WM: (13603: R1 ^reward R973)
  9666. =>WM: (13602: I3 ^see 1)
  9667. <=WM: (13593: S1 ^operator O1937 +)
  9668. <=WM: (13595: S1 ^operator O1937)
  9669. <=WM: (13594: S1 ^operator O1938 +)
  9670. <=WM: (13592: I3 ^dir R)
  9671. <=WM: (13588: R1 ^reward R972)
  9672. <=WM: (13587: I3 ^see 0)
  9673. <=WM: (13591: O1938 ^name predict-no)
  9674. <=WM: (13590: O1937 ^name predict-yes)
  9675. <=WM: (13589: R972 ^value 1)
  9676. --- Inner Elaboration Phase, active level 1 (S1) ---
  9677. Firing prefer*rvt*predict-yes*H0
  9678. -->
  9679. Firing rl*prefer*rvt*predict-yes*H0*3
  9680. -->
  9681. (S1 ^operator O1939 = 0.3907931512898603)
  9682. Firing prefer*rvt*predict-yes*H0*3*H1
  9683. -->
  9684. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9685. -->
  9686. (S1 ^operator O1939 = 0.6093527419421177)
  9687. Firing prefer*rvt*predict-no*H0
  9688. -->
  9689. Firing rl*prefer*rvt*predict-no*H0*4
  9690. -->
  9691. (S1 ^operator O1940 = 0.3145032394390637)
  9692. Firing prefer*rvt*predict-no*H0*4*H1
  9693. -->
  9694. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9695. -->
  9696. (S1 ^operator O1940 = -0.168718511744511)
  9697. inner elaboration loop at bottom goal.
  9698. Retracting rl*prefer*rvt*predict-no*H0*4
  9699. -->
  9700. (S1 ^operator O1938 = 0.3145032394390637)
  9701. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9702. -->
  9703. (S1 ^operator O1938 = -0.168718511744511)
  9704. Retracting rl*prefer*rvt*predict-yes*H0*3
  9705. -->
  9706. (S1 ^operator O1937 = 0.3907931512898603)
  9707. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9708. -->
  9709. (S1 ^operator O1937 = 0.6093527419421177)
  9710. --- END Proposal Phase ---
  9711. --- Decision Phase ---
  9712. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.859649,0.121362)
  9713. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465467 0.412924 0.878392 -> 0.465468 0.412925 0.878393(R,m,v=1,1,0)
  9714. =>WM: (13610: S1 ^operator O1939)
  9715. 970: O: O1939 (predict-yes)
  9716. --- END Decision Phase ---
  9717. --- Application Phase ---
  9718. --- Firing Productions (PE) For State At Depth 1 ---
  9719. --- Inner Elaboration Phase, active level 1 (S1) ---
  9720. Firing apply*operator
  9721. -->
  9722. (I3 ^predict-yes N970 + :O )
  9723. Firing apply*operator*complete
  9724. -->
  9725. (I3 ^predict-yes N969 - :O )
  9726. inner elaboration loop at bottom goal.
  9727. --- Change Working Memory (PE) ---
  9728. =>WM: (13611: I3 ^predict-yes N970)
  9729. <=WM: (13597: N969 ^status complete)
  9730. <=WM: (13596: I3 ^predict-yes N969)
  9731. --- Firing Productions (IE) For State At Depth 1 ---
  9732. --- Inner Elaboration Phase, active level 1 (S1) ---
  9733. Firing monitor*world
  9734. -->
  9735. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9736. --- Change Working Memory (IE) ---
  9737. --- END Application Phase ---
  9738. --- Output Phase ---
  9739. ENV: Agent did: predict-yes for direction L in state State-B
  9740. In State-B moving L
  9741. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9742. predict error 0
  9743. dir: dir isU
  9744. --- END Output Phase ---
  9745. |\--- Input Phase ---
  9746. =>WM: (13615: I2 ^dir U)
  9747. =>WM: (13614: I2 ^reward 1)
  9748. =>WM: (13613: I2 ^see 1)
  9749. =>WM: (13612: N970 ^status complete)
  9750. <=WM: (13600: I2 ^dir L)
  9751. <=WM: (13599: I2 ^reward 1)
  9752. <=WM: (13598: I2 ^see 1)
  9753. =>WM: (13616: I2 ^level-1 L1-root)
  9754. <=WM: (13601: I2 ^level-1 R1-root)
  9755. --- END Input Phase ---
  9756. --- Proposal Phase ---
  9757. --- Inner Elaboration Phase, active level 1 (S1) ---
  9758. Firing elaborate*copy-see-to-output-link
  9759. -->
  9760. (I3 ^see 1 +)
  9761. Firing elaborate*reward*based*on*reward
  9762. -->
  9763. (R974 ^value 1 +)
  9764. (R1 ^reward R974 +)
  9765. Firing propose*predict-yes
  9766. -->
  9767. (O1941 ^name predict-yes +)
  9768. (S1 ^operator O1941 +)
  9769. Firing propose*predict-no
  9770. -->
  9771. (O1942 ^name predict-no +)
  9772. (S1 ^operator O1942 +)
  9773. Firing rl*prefer*rvt*predict-no*H0*2
  9774. -->
  9775. (S1 ^operator O1940 = 1.)
  9776. Firing rl*prefer*rvt*predict-yes*H0*1
  9777. -->
  9778. (S1 ^operator O1939 = 0.)
  9779. Firing prefer*rvt*predict-yes*H0
  9780. -->
  9781. Firing prefer*rvt*predict-no*H0
  9782. -->
  9783. Firing elaborate*copy-dir-to-output-link
  9784. -->
  9785. (I3 ^dir U +)
  9786. inner elaboration loop at bottom goal.
  9787. Retracting elaborate*copy-see-to-output-link
  9788. -->
  9789. (I3 ^see 1 +)
  9790. Retracting propose*predict-no
  9791. -->
  9792. (O1940 ^name predict-no +)
  9793. (S1 ^operator O1940 +)
  9794. Retracting propose*predict-yes
  9795. -->
  9796. (O1939 ^name predict-yes +)
  9797. (S1 ^operator O1939 +)
  9798. Retracting elaborate*reward*based*on*reward
  9799. -->
  9800. (R973 ^value 1 +)
  9801. (R1 ^reward R973 +)
  9802. Retracting elaborate*copy-dir-to-output-link
  9803. -->
  9804. (I3 ^dir L +)
  9805. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9806. -->
  9807. (S1 ^operator O1940 = -0.168718511744511)
  9808. Retracting rl*prefer*rvt*predict-no*H0*4
  9809. -->
  9810. (S1 ^operator O1940 = 0.3145032394390637)
  9811. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9812. -->
  9813. (S1 ^operator O1939 = 0.6093527419421177)
  9814. Retracting rl*prefer*rvt*predict-yes*H0*3
  9815. -->
  9816. (S1 ^operator O1939 = 0.3907931512898603)
  9817. =>WM: (13623: S1 ^operator O1942 +)
  9818. =>WM: (13622: S1 ^operator O1941 +)
  9819. =>WM: (13621: I3 ^dir U)
  9820. =>WM: (13620: O1942 ^name predict-no)
  9821. =>WM: (13619: O1941 ^name predict-yes)
  9822. =>WM: (13618: R974 ^value 1)
  9823. =>WM: (13617: R1 ^reward R974)
  9824. <=WM: (13608: S1 ^operator O1939 +)
  9825. <=WM: (13610: S1 ^operator O1939)
  9826. <=WM: (13609: S1 ^operator O1940 +)
  9827. <=WM: (13607: I3 ^dir L)
  9828. <=WM: (13603: R1 ^reward R973)
  9829. <=WM: (13606: O1940 ^name predict-no)
  9830. <=WM: (13605: O1939 ^name predict-yes)
  9831. <=WM: (13604: R973 ^value 1)
  9832. --- Inner Elaboration Phase, active level 1 (S1) ---
  9833. Firing prefer*rvt*predict-yes*H0
  9834. -->
  9835. Firing rl*prefer*rvt*predict-yes*H0*1
  9836. -->
  9837. (S1 ^operator O1941 = 0.)
  9838. Firing prefer*rvt*predict-no*H0
  9839. -->
  9840. Firing rl*prefer*rvt*predict-no*H0*2
  9841. -->
  9842. (S1 ^operator O1942 = 1.)
  9843. inner elaboration loop at bottom goal.
  9844. Retracting rl*prefer*rvt*predict-no*H0*2
  9845. -->
  9846. (S1 ^operator O1940 = 1.)
  9847. Retracting rl*prefer*rvt*predict-yes*H0*1
  9848. -->
  9849. (S1 ^operator O1939 = 0.)
  9850. --- END Proposal Phase ---
  9851. --- Decision Phase ---
  9852. RL update rl*prefer*rvt*predict-yes*H0*3 0.472337 -0.0815436 0.390793 -> 0.472327 -0.0815454 0.390781(R,m,v=1,0.941935,0.0550482)
  9853. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527788 0.0815652 0.609353 -> 0.527776 0.0815632 0.609339(R,m,v=1,1,0)
  9854. =>WM: (13624: S1 ^operator O1942)
  9855. 971: O: O1942 (predict-no)
  9856. --- END Decision Phase ---
  9857. --- Application Phase ---
  9858. --- Firing Productions (PE) For State At Depth 1 ---
  9859. --- Inner Elaboration Phase, active level 1 (S1) ---
  9860. Firing apply*operator
  9861. -->
  9862. (I3 ^predict-no N971 + :O )
  9863. Firing apply*operator*complete
  9864. -->
  9865. (I3 ^predict-yes N970 - :O )
  9866. inner elaboration loop at bottom goal.
  9867. --- Change Working Memory (PE) ---
  9868. =>WM: (13625: I3 ^predict-no N971)
  9869. <=WM: (13612: N970 ^status complete)
  9870. <=WM: (13611: I3 ^predict-yes N970)
  9871. --- Firing Productions (IE) For State At Depth 1 ---
  9872. --- Inner Elaboration Phase, active level 1 (S1) ---
  9873. Firing monitor*world
  9874. -->
  9875. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9876. --- Change Working Memory (IE) ---
  9877. --- END Application Phase ---
  9878. --- Output Phase ---
  9879. ENV: Agent did: predict-no for direction U in state State-A
  9880. In State-A moving U
  9881. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9882. predict error 0
  9883. dir: dir isR
  9884. --- END Output Phase ---
  9885. ---- Input Phase ---
  9886. =>WM: (13629: I2 ^dir R)
  9887. =>WM: (13628: I2 ^reward 1)
  9888. =>WM: (13627: I2 ^see 0)
  9889. =>WM: (13626: N971 ^status complete)
  9890. <=WM: (13615: I2 ^dir U)
  9891. <=WM: (13614: I2 ^reward 1)
  9892. <=WM: (13613: I2 ^see 1)
  9893. =>WM: (13630: I2 ^level-1 L1-root)
  9894. <=WM: (13616: I2 ^level-1 L1-root)
  9895. --- END Input Phase ---
  9896. --- Proposal Phase ---
  9897. --- Inner Elaboration Phase, active level 1 (S1) ---
  9898. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  9899. -->
  9900. (S1 ^operator O1941 = 0.8784169509457307)
  9901. Firing prefer*rvt*predict-yes*H0*5*H1
  9902. -->
  9903. Firing elaborate*copy-see-to-output-link
  9904. -->
  9905. (I3 ^see 0 +)
  9906. Firing elaborate*reward*based*on*reward
  9907. -->
  9908. (R975 ^value 1 +)
  9909. (R1 ^reward R975 +)
  9910. Firing propose*predict-yes
  9911. -->
  9912. (O1943 ^name predict-yes +)
  9913. (S1 ^operator O1943 +)
  9914. Firing propose*predict-no
  9915. -->
  9916. (O1944 ^name predict-no +)
  9917. (S1 ^operator O1944 +)
  9918. Firing rl*prefer*rvt*predict-no*H0*6
  9919. -->
  9920. (S1 ^operator O1942 = 0.9999810901454903)
  9921. Firing rl*prefer*rvt*predict-yes*H0*5
  9922. -->
  9923. (S1 ^operator O1941 = 0.1215994040064755)
  9924. Firing prefer*rvt*predict-yes*H0
  9925. -->
  9926. Firing prefer*rvt*predict-no*H0
  9927. -->
  9928. Firing elaborate*copy-dir-to-output-link
  9929. -->
  9930. (I3 ^dir R +)
  9931. inner elaboration loop at bottom goal.
  9932. Retracting elaborate*copy-see-to-output-link
  9933. -->
  9934. (I3 ^see 1 +)
  9935. Retracting propose*predict-no
  9936. -->
  9937. (O1942 ^name predict-no +)
  9938. (S1 ^operator O1942 +)
  9939. Retracting propose*predict-yes
  9940. -->
  9941. (O1941 ^name predict-yes +)
  9942. (S1 ^operator O1941 +)
  9943. Retracting elaborate*reward*based*on*reward
  9944. -->
  9945. (R974 ^value 1 +)
  9946. (R1 ^reward R974 +)
  9947. Retracting elaborate*copy-dir-to-output-link
  9948. -->
  9949. (I3 ^dir U +)
  9950. Retracting rl*prefer*rvt*predict-no*H0*2
  9951. -->
  9952. (S1 ^operator O1942 = 1.)
  9953. Retracting rl*prefer*rvt*predict-yes*H0*1
  9954. -->
  9955. (S1 ^operator O1941 = 0.)
  9956. =>WM: (13638: S1 ^operator O1944 +)
  9957. =>WM: (13637: S1 ^operator O1943 +)
  9958. =>WM: (13636: I3 ^dir R)
  9959. =>WM: (13635: O1944 ^name predict-no)
  9960. =>WM: (13634: O1943 ^name predict-yes)
  9961. =>WM: (13633: R975 ^value 1)
  9962. =>WM: (13632: R1 ^reward R975)
  9963. =>WM: (13631: I3 ^see 0)
  9964. <=WM: (13622: S1 ^operator O1941 +)
  9965. <=WM: (13623: S1 ^operator O1942 +)
  9966. <=WM: (13624: S1 ^operator O1942)
  9967. <=WM: (13621: I3 ^dir U)
  9968. <=WM: (13617: R1 ^reward R974)
  9969. <=WM: (13602: I3 ^see 1)
  9970. <=WM: (13620: O1942 ^name predict-no)
  9971. <=WM: (13619: O1941 ^name predict-yes)
  9972. <=WM: (13618: R974 ^value 1)
  9973. --- Inner Elaboration Phase, active level 1 (S1) ---
  9974. Firing prefer*rvt*predict-yes*H0
  9975. -->
  9976. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  9977. -->
  9978. (S1 ^operator O1943 = 0.8784169509457307)
  9979. Firing rl*prefer*rvt*predict-yes*H0*5
  9980. -->
  9981. (S1 ^operator O1943 = 0.1215994040064755)
  9982. Firing prefer*rvt*predict-yes*H0*5*H1
  9983. -->
  9984. Firing prefer*rvt*predict-no*H0
  9985. -->
  9986. Firing rl*prefer*rvt*predict-no*H0*6
  9987. -->
  9988. (S1 ^operator O1944 = 0.9999810901454903)
  9989. inner elaboration loop at bottom goal.
  9990. Retracting rl*prefer*rvt*predict-no*H0*6
  9991. -->
  9992. (S1 ^operator O1942 = 0.9999810901454903)
  9993. Retracting rl*prefer*rvt*predict-yes*H0*5
  9994. -->
  9995. (S1 ^operator O1941 = 0.1215994040064755)
  9996. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  9997. -->
  9998. (S1 ^operator O1941 = 0.8784169509457307)
  9999. --- END Proposal Phase ---
  10000. --- Decision Phase ---
  10001. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10002. =>WM: (13639: S1 ^operator O1943)
  10003. 972: O: O1943 (predict-yes)
  10004. --- END Decision Phase ---
  10005. --- Application Phase ---
  10006. --- Firing Productions (PE) For State At Depth 1 ---
  10007. --- Inner Elaboration Phase, active level 1 (S1) ---
  10008. Firing apply*operator
  10009. -->
  10010. (I3 ^predict-yes N972 + :O )
  10011. Firing apply*operator*complete
  10012. -->
  10013. (I3 ^predict-no N971 - :O )
  10014. inner elaboration loop at bottom goal.
  10015. --- Change Working Memory (PE) ---
  10016. =>WM: (13640: I3 ^predict-yes N972)
  10017. <=WM: (13626: N971 ^status complete)
  10018. <=WM: (13625: I3 ^predict-no N971)
  10019. --- Firing Productions (IE) For State At Depth 1 ---
  10020. --- Inner Elaboration Phase, active level 1 (S1) ---
  10021. Firing monitor*world
  10022. -->
  10023. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10024. --- Change Working Memory (IE) ---
  10025. --- END Application Phase ---
  10026. --- Output Phase ---
  10027. ENV: Agent did: predict-yes for direction R in state State-A
  10028. In State-A moving R
  10029. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10030. predict error 0
  10031. dir: dir isU
  10032. --- END Output Phase ---
  10033. /|\--- Input Phase ---
  10034. =>WM: (13644: I2 ^dir U)
  10035. =>WM: (13643: I2 ^reward 1)
  10036. =>WM: (13642: I2 ^see 1)
  10037. =>WM: (13641: N972 ^status complete)
  10038. <=WM: (13629: I2 ^dir R)
  10039. <=WM: (13628: I2 ^reward 1)
  10040. <=WM: (13627: I2 ^see 0)
  10041. =>WM: (13645: I2 ^level-1 R1-root)
  10042. <=WM: (13630: I2 ^level-1 L1-root)
  10043. --- END Input Phase ---
  10044. --- Proposal Phase ---
  10045. --- Inner Elaboration Phase, active level 1 (S1) ---
  10046. Firing elaborate*copy-see-to-output-link
  10047. -->
  10048. (I3 ^see 1 +)
  10049. Firing elaborate*reward*based*on*reward
  10050. -->
  10051. (R976 ^value 1 +)
  10052. (R1 ^reward R976 +)
  10053. Firing propose*predict-yes
  10054. -->
  10055. (O1945 ^name predict-yes +)
  10056. (S1 ^operator O1945 +)
  10057. Firing propose*predict-no
  10058. -->
  10059. (O1946 ^name predict-no +)
  10060. (S1 ^operator O1946 +)
  10061. Firing rl*prefer*rvt*predict-no*H0*2
  10062. -->
  10063. (S1 ^operator O1944 = 1.)
  10064. Firing rl*prefer*rvt*predict-yes*H0*1
  10065. -->
  10066. (S1 ^operator O1943 = 0.)
  10067. Firing prefer*rvt*predict-yes*H0
  10068. -->
  10069. Firing prefer*rvt*predict-no*H0
  10070. -->
  10071. Firing elaborate*copy-dir-to-output-link
  10072. -->
  10073. (I3 ^dir U +)
  10074. inner elaboration loop at bottom goal.
  10075. Retracting elaborate*copy-see-to-output-link
  10076. -->
  10077. (I3 ^see 0 +)
  10078. Retracting propose*predict-no
  10079. -->
  10080. (O1944 ^name predict-no +)
  10081. (S1 ^operator O1944 +)
  10082. Retracting propose*predict-yes
  10083. -->
  10084. (O1943 ^name predict-yes +)
  10085. (S1 ^operator O1943 +)
  10086. Retracting elaborate*reward*based*on*reward
  10087. -->
  10088. (R975 ^value 1 +)
  10089. (R1 ^reward R975 +)
  10090. Retracting elaborate*copy-dir-to-output-link
  10091. -->
  10092. (I3 ^dir R +)
  10093. Retracting rl*prefer*rvt*predict-no*H0*6
  10094. -->
  10095. (S1 ^operator O1944 = 0.9999810901454903)
  10096. Retracting rl*prefer*rvt*predict-yes*H0*5
  10097. -->
  10098. (S1 ^operator O1943 = 0.1215994040064755)
  10099. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  10100. -->
  10101. (S1 ^operator O1943 = 0.8784169509457307)
  10102. =>WM: (13653: S1 ^operator O1946 +)
  10103. =>WM: (13652: S1 ^operator O1945 +)
  10104. =>WM: (13651: I3 ^dir U)
  10105. =>WM: (13650: O1946 ^name predict-no)
  10106. =>WM: (13649: O1945 ^name predict-yes)
  10107. =>WM: (13648: R976 ^value 1)
  10108. =>WM: (13647: R1 ^reward R976)
  10109. =>WM: (13646: I3 ^see 1)
  10110. <=WM: (13637: S1 ^operator O1943 +)
  10111. <=WM: (13639: S1 ^operator O1943)
  10112. <=WM: (13638: S1 ^operator O1944 +)
  10113. <=WM: (13636: I3 ^dir R)
  10114. <=WM: (13632: R1 ^reward R975)
  10115. <=WM: (13631: I3 ^see 0)
  10116. <=WM: (13635: O1944 ^name predict-no)
  10117. <=WM: (13634: O1943 ^name predict-yes)
  10118. <=WM: (13633: R975 ^value 1)
  10119. --- Inner Elaboration Phase, active level 1 (S1) ---
  10120. Firing prefer*rvt*predict-yes*H0
  10121. -->
  10122. Firing rl*prefer*rvt*predict-yes*H0*1
  10123. -->
  10124. (S1 ^operator O1945 = 0.)
  10125. Firing prefer*rvt*predict-no*H0
  10126. -->
  10127. Firing rl*prefer*rvt*predict-no*H0*2
  10128. -->
  10129. (S1 ^operator O1946 = 1.)
  10130. inner elaboration loop at bottom goal.
  10131. Retracting rl*prefer*rvt*predict-no*H0*2
  10132. -->
  10133. (S1 ^operator O1944 = 1.)
  10134. Retracting rl*prefer*rvt*predict-yes*H0*1
  10135. -->
  10136. (S1 ^operator O1943 = 0.)
  10137. --- END Proposal Phase ---
  10138. --- Decision Phase ---
  10139. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.860465,0.120767)
  10140. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465488 0.412929 0.878417 -> 0.465487 0.412928 0.878415(R,m,v=1,1,0)
  10141. =>WM: (13654: S1 ^operator O1946)
  10142. 973: O: O1946 (predict-no)
  10143. --- END Decision Phase ---
  10144. --- Application Phase ---
  10145. --- Firing Productions (PE) For State At Depth 1 ---
  10146. --- Inner Elaboration Phase, active level 1 (S1) ---
  10147. Firing apply*operator
  10148. -->
  10149. (I3 ^predict-no N973 + :O )
  10150. Firing apply*operator*complete
  10151. -->
  10152. (I3 ^predict-yes N972 - :O )
  10153. inner elaboration loop at bottom goal.
  10154. --- Change Working Memory (PE) ---
  10155. =>WM: (13655: I3 ^predict-no N973)
  10156. <=WM: (13641: N972 ^status complete)
  10157. <=WM: (13640: I3 ^predict-yes N972)
  10158. --- Firing Productions (IE) For State At Depth 1 ---
  10159. --- Inner Elaboration Phase, active level 1 (S1) ---
  10160. Firing monitor*world
  10161. -->
  10162. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10163. --- Change Working Memory (IE) ---
  10164. --- END Application Phase ---
  10165. --- Output Phase ---
  10166. ENV: Agent did: predict-no for direction U in state State-B
  10167. In State-B moving U
  10168. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10169. predict error 0
  10170. dir: dir isL
  10171. --- END Output Phase ---
  10172. -/--- Input Phase ---
  10173. =>WM: (13659: I2 ^dir L)
  10174. =>WM: (13658: I2 ^reward 1)
  10175. =>WM: (13657: I2 ^see 0)
  10176. =>WM: (13656: N973 ^status complete)
  10177. <=WM: (13644: I2 ^dir U)
  10178. <=WM: (13643: I2 ^reward 1)
  10179. <=WM: (13642: I2 ^see 1)
  10180. =>WM: (13660: I2 ^level-1 R1-root)
  10181. <=WM: (13645: I2 ^level-1 R1-root)
  10182. --- END Input Phase ---
  10183. --- Proposal Phase ---
  10184. --- Inner Elaboration Phase, active level 1 (S1) ---
  10185. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  10186. -->
  10187. (S1 ^operator O1946 = -0.168718511744511)
  10188. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  10189. -->
  10190. (S1 ^operator O1945 = 0.609338805157315)
  10191. Firing prefer*rvt*predict-no*H0*4*H1
  10192. -->
  10193. Firing prefer*rvt*predict-yes*H0*3*H1
  10194. -->
  10195. Firing elaborate*copy-see-to-output-link
  10196. -->
  10197. (I3 ^see 0 +)
  10198. Firing elaborate*reward*based*on*reward
  10199. -->
  10200. (R977 ^value 1 +)
  10201. (R1 ^reward R977 +)
  10202. Firing propose*predict-yes
  10203. -->
  10204. (O1947 ^name predict-yes +)
  10205. (S1 ^operator O1947 +)
  10206. Firing propose*predict-no
  10207. -->
  10208. (O1948 ^name predict-no +)
  10209. (S1 ^operator O1948 +)
  10210. Firing rl*prefer*rvt*predict-no*H0*4
  10211. -->
  10212. (S1 ^operator O1946 = 0.3145032394390637)
  10213. Firing rl*prefer*rvt*predict-yes*H0*3
  10214. -->
  10215. (S1 ^operator O1945 = 0.3907810808803528)
  10216. Firing prefer*rvt*predict-yes*H0
  10217. -->
  10218. Firing prefer*rvt*predict-no*H0
  10219. -->
  10220. Firing elaborate*copy-dir-to-output-link
  10221. -->
  10222. (I3 ^dir L +)
  10223. inner elaboration loop at bottom goal.
  10224. Retracting elaborate*copy-see-to-output-link
  10225. -->
  10226. (I3 ^see 1 +)
  10227. Retracting propose*predict-no
  10228. -->
  10229. (O1946 ^name predict-no +)
  10230. (S1 ^operator O1946 +)
  10231. Retracting propose*predict-yes
  10232. -->
  10233. (O1945 ^name predict-yes +)
  10234. (S1 ^operator O1945 +)
  10235. Retracting elaborate*reward*based*on*reward
  10236. -->
  10237. (R976 ^value 1 +)
  10238. (R1 ^reward R976 +)
  10239. Retracting elaborate*copy-dir-to-output-link
  10240. -->
  10241. (I3 ^dir U +)
  10242. Retracting rl*prefer*rvt*predict-no*H0*2
  10243. -->
  10244. (S1 ^operator O1946 = 1.)
  10245. Retracting rl*prefer*rvt*predict-yes*H0*1
  10246. -->
  10247. (S1 ^operator O1945 = 0.)
  10248. =>WM: (13668: S1 ^operator O1948 +)
  10249. =>WM: (13667: S1 ^operator O1947 +)
  10250. =>WM: (13666: I3 ^dir L)
  10251. =>WM: (13665: O1948 ^name predict-no)
  10252. =>WM: (13664: O1947 ^name predict-yes)
  10253. =>WM: (13663: R977 ^value 1)
  10254. =>WM: (13662: R1 ^reward R977)
  10255. =>WM: (13661: I3 ^see 0)
  10256. <=WM: (13652: S1 ^operator O1945 +)
  10257. <=WM: (13653: S1 ^operator O1946 +)
  10258. <=WM: (13654: S1 ^operator O1946)
  10259. <=WM: (13651: I3 ^dir U)
  10260. <=WM: (13647: R1 ^reward R976)
  10261. <=WM: (13646: I3 ^see 1)
  10262. <=WM: (13650: O1946 ^name predict-no)
  10263. <=WM: (13649: O1945 ^name predict-yes)
  10264. <=WM: (13648: R976 ^value 1)
  10265. --- Inner Elaboration Phase, active level 1 (S1) ---
  10266. Firing prefer*rvt*predict-yes*H0
  10267. -->
  10268. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  10269. -->
  10270. (S1 ^operator O1947 = 0.609338805157315)
  10271. Firing rl*prefer*rvt*predict-yes*H0*3
  10272. -->
  10273. (S1 ^operator O1947 = 0.3907810808803528)
  10274. Firing prefer*rvt*predict-yes*H0*3*H1
  10275. -->
  10276. Firing prefer*rvt*predict-no*H0
  10277. -->
  10278. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  10279. -->
  10280. (S1 ^operator O1948 = -0.168718511744511)
  10281. Firing rl*prefer*rvt*predict-no*H0*4
  10282. -->
  10283. (S1 ^operator O1948 = 0.3145032394390637)
  10284. Firing prefer*rvt*predict-no*H0*4*H1
  10285. -->
  10286. inner elaboration loop at bottom goal.
  10287. Retracting rl*prefer*rvt*predict-no*H0*4
  10288. -->
  10289. (S1 ^operator O1946 = 0.3145032394390637)
  10290. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  10291. -->
  10292. (S1 ^operator O1946 = -0.168718511744511)
  10293. Retracting rl*prefer*rvt*predict-yes*H0*3
  10294. -->
  10295. (S1 ^operator O1945 = 0.3907810808803528)
  10296. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  10297. -->
  10298. (S1 ^operator O1945 = 0.609338805157315)
  10299. --- END Proposal Phase ---
  10300. --- Decision Phase ---
  10301. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10302. =>WM: (13669: S1 ^operator O1947)
  10303. 974: O: O1947 (predict-yes)
  10304. --- END Decision Phase ---
  10305. --- Application Phase ---
  10306. --- Firing Productions (PE) For State At Depth 1 ---
  10307. --- Inner Elaboration Phase, active level 1 (S1) ---
  10308. Firing apply*operator
  10309. -->
  10310. (I3 ^predict-yes N974 + :O )
  10311. Firing apply*operator*complete
  10312. -->
  10313. (I3 ^predict-no N973 - :O )
  10314. inner elaboration loop at bottom goal.
  10315. --- Change Working Memory (PE) ---
  10316. =>WM: (13670: I3 ^predict-yes N974)
  10317. <=WM: (13656: N973 ^status complete)
  10318. <=WM: (13655: I3 ^predict-no N973)
  10319. --- Firing Productions (IE) For State At Depth 1 ---
  10320. --- Inner Elaboration Phase, active level 1 (S1) ---
  10321. Firing monitor*world
  10322. -->
  10323. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10324. --- Change Working Memory (IE) ---
  10325. --- END Application Phase ---
  10326. --- Output Phase ---
  10327. ENV: Agent did: predict-yes for direction L in state State-B
  10328. In State-B moving L
  10329. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10330. predict error 0
  10331. dir: dir isL
  10332. --- END Output Phase ---
  10333. |\--- Input Phase ---
  10334. =>WM: (13674: I2 ^dir L)
  10335. =>WM: (13673: I2 ^reward 1)
  10336. =>WM: (13672: I2 ^see 1)
  10337. =>WM: (13671: N974 ^status complete)
  10338. <=WM: (13659: I2 ^dir L)
  10339. <=WM: (13658: I2 ^reward 1)
  10340. <=WM: (13657: I2 ^see 0)
  10341. =>WM: (13675: I2 ^level-1 L1-root)
  10342. <=WM: (13660: I2 ^level-1 R1-root)
  10343. --- END Input Phase ---
  10344. --- Proposal Phase ---
  10345. --- Inner Elaboration Phase, active level 1 (S1) ---
  10346. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  10347. -->
  10348. (S1 ^operator O1947 = -0.2062723012911647)
  10349. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  10350. -->
  10351. (S1 ^operator O1948 = 0.6855461517499103)
  10352. Firing prefer*rvt*predict-no*H0*4*H1
  10353. -->
  10354. Firing prefer*rvt*predict-yes*H0*3*H1
  10355. -->
  10356. Firing elaborate*copy-see-to-output-link
  10357. -->
  10358. (I3 ^see 1 +)
  10359. Firing elaborate*reward*based*on*reward
  10360. -->
  10361. (R978 ^value 1 +)
  10362. (R1 ^reward R978 +)
  10363. Firing propose*predict-yes
  10364. -->
  10365. (O1949 ^name predict-yes +)
  10366. (S1 ^operator O1949 +)
  10367. Firing propose*predict-no
  10368. -->
  10369. (O1950 ^name predict-no +)
  10370. (S1 ^operator O1950 +)
  10371. Firing rl*prefer*rvt*predict-no*H0*4
  10372. -->
  10373. (S1 ^operator O1948 = 0.3145032394390637)
  10374. Firing rl*prefer*rvt*predict-yes*H0*3
  10375. -->
  10376. (S1 ^operator O1947 = 0.3907810808803528)
  10377. Firing prefer*rvt*predict-yes*H0
  10378. -->
  10379. Firing prefer*rvt*predict-no*H0
  10380. -->
  10381. Firing elaborate*copy-dir-to-output-link
  10382. -->
  10383. (I3 ^dir L +)
  10384. inner elaboration loop at bottom goal.
  10385. Retracting elaborate*copy-see-to-output-link
  10386. -->
  10387. (I3 ^see 0 +)
  10388. Retracting propose*predict-no
  10389. -->
  10390. (O1948 ^name predict-no +)
  10391. (S1 ^operator O1948 +)
  10392. Retracting propose*predict-yes
  10393. -->
  10394. (O1947 ^name predict-yes +)
  10395. (S1 ^operator O1947 +)
  10396. Retracting elaborate*reward*based*on*reward
  10397. -->
  10398. (R977 ^value 1 +)
  10399. (R1 ^reward R977 +)
  10400. Retracting elaborate*copy-dir-to-output-link
  10401. -->
  10402. (I3 ^dir L +)
  10403. Retracting rl*prefer*rvt*predict-no*H0*4
  10404. -->
  10405. (S1 ^operator O1948 = 0.3145032394390637)
  10406. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  10407. -->
  10408. (S1 ^operator O1948 = -0.168718511744511)
  10409. Retracting rl*prefer*rvt*predict-yes*H0*3
  10410. -->
  10411. (S1 ^operator O1947 = 0.3907810808803528)
  10412. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  10413. -->
  10414. (S1 ^operator O1947 = 0.609338805157315)
  10415. =>WM: (13682: S1 ^operator O1950 +)
  10416. =>WM: (13681: S1 ^operator O1949 +)
  10417. =>WM: (13680: O1950 ^name predict-no)
  10418. =>WM: (13679: O1949 ^name predict-yes)
  10419. =>WM: (13678: R978 ^value 1)
  10420. =>WM: (13677: R1 ^reward R978)
  10421. =>WM: (13676: I3 ^see 1)
  10422. <=WM: (13667: S1 ^operator O1947 +)
  10423. <=WM: (13669: S1 ^operator O1947)
  10424. <=WM: (13668: S1 ^operator O1948 +)
  10425. <=WM: (13662: R1 ^reward R977)
  10426. <=WM: (13661: I3 ^see 0)
  10427. <=WM: (13665: O1948 ^name predict-no)
  10428. <=WM: (13664: O1947 ^name predict-yes)
  10429. <=WM: (13663: R977 ^value 1)
  10430. --- Inner Elaboration Phase, active level 1 (S1) ---
  10431. Firing prefer*rvt*predict-yes*H0
  10432. -->
  10433. Firing rl*prefer*rvt*predict-yes*H0*3
  10434. -->
  10435. (S1 ^operator O1949 = 0.3907810808803528)
  10436. Firing prefer*rvt*predict-yes*H0*3*H1
  10437. -->
  10438. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  10439. -->
  10440. (S1 ^operator O1949 = -0.2062723012911647)
  10441. Firing prefer*rvt*predict-no*H0
  10442. -->
  10443. Firing rl*prefer*rvt*predict-no*H0*4
  10444. -->
  10445. (S1 ^operator O1950 = 0.3145032394390637)
  10446. Firing prefer*rvt*predict-no*H0*4*H1
  10447. -->
  10448. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  10449. -->
  10450. (S1 ^operator O1950 = 0.6855461517499103)
  10451. inner elaboration loop at bottom goal.
  10452. Retracting rl*prefer*rvt*predict-no*H0*4
  10453. -->
  10454. (S1 ^operator O1948 = 0.3145032394390637)
  10455. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  10456. -->
  10457. (S1 ^operator O1948 = 0.6855461517499103)
  10458. Retracting rl*prefer*rvt*predict-yes*H0*3
  10459. -->
  10460. (S1 ^operator O1947 = 0.3907810808803528)
  10461. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  10462. -->
  10463. (S1 ^operator O1947 = -0.2062723012911647)
  10464. --- END Proposal Phase ---
  10465. --- Decision Phase ---
  10466. RL update rl*prefer*rvt*predict-yes*H0*3 0.472327 -0.0815454 0.390781 -> 0.472318 -0.0815469 0.390771(R,m,v=1,0.942308,0.0547146)
  10467. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527776 0.0815632 0.609339 -> 0.527766 0.0815615 0.609327(R,m,v=1,1,0)
  10468. =>WM: (13683: S1 ^operator O1950)
  10469. 975: O: O1950 (predict-no)
  10470. --- END Decision Phase ---
  10471. --- Application Phase ---
  10472. --- Firing Productions (PE) For State At Depth 1 ---
  10473. --- Inner Elaboration Phase, active level 1 (S1) ---
  10474. Firing apply*operator
  10475. -->
  10476. (I3 ^predict-no N975 + :O )
  10477. Firing apply*operator*complete
  10478. -->
  10479. (I3 ^predict-yes N974 - :O )
  10480. inner elaboration loop at bottom goal.
  10481. --- Change Working Memory (PE) ---
  10482. =>WM: (13684: I3 ^predict-no N975)
  10483. <=WM: (13671: N974 ^status complete)
  10484. <=WM: (13670: I3 ^predict-yes N974)
  10485. --- Firing Productions (IE) For State At Depth 1 ---
  10486. --- Inner Elaboration Phase, active level 1 (S1) ---
  10487. Firing monitor*world
  10488. -->
  10489. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10490. --- Change Working Memory (IE) ---
  10491. --- END Application Phase ---
  10492. --- Output Phase ---
  10493. ENV: Agent did: predict-no for direction L in state State-A
  10494. In State-A moving L
  10495. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10496. predict error 0
  10497. dir: dir isU
  10498. --- END Output Phase ---
  10499. -/|--- Input Phase ---
  10500. =>WM: (13688: I2 ^dir U)
  10501. =>WM: (13687: I2 ^reward 1)
  10502. =>WM: (13686: I2 ^see 0)
  10503. =>WM: (13685: N975 ^status complete)
  10504. <=WM: (13674: I2 ^dir L)
  10505. <=WM: (13673: I2 ^reward 1)
  10506. <=WM: (13672: I2 ^see 1)
  10507. =>WM: (13689: I2 ^level-1 L0-root)
  10508. <=WM: (13675: I2 ^level-1 L1-root)
  10509. --- END Input Phase ---
  10510. --- Proposal Phase ---
  10511. --- Inner Elaboration Phase, active level 1 (S1) ---
  10512. Firing elaborate*copy-see-to-output-link
  10513. -->
  10514. (I3 ^see 0 +)
  10515. Firing elaborate*reward*based*on*reward
  10516. -->
  10517. (R979 ^value 1 +)
  10518. (R1 ^reward R979 +)
  10519. Firing propose*predict-yes
  10520. -->
  10521. (O1951 ^name predict-yes +)
  10522. (S1 ^operator O1951 +)
  10523. Firing propose*predict-no
  10524. -->
  10525. (O1952 ^name predict-no +)
  10526. (S1 ^operator O1952 +)
  10527. Firing rl*prefer*rvt*predict-no*H0*2
  10528. -->
  10529. (S1 ^operator O1950 = 1.)
  10530. Firing rl*prefer*rvt*predict-yes*H0*1
  10531. -->
  10532. (S1 ^operator O1949 = 0.)
  10533. Firing prefer*rvt*predict-yes*H0
  10534. -->
  10535. Firing prefer*rvt*predict-no*H0
  10536. -->
  10537. Firing elaborate*copy-dir-to-output-link
  10538. -->
  10539. (I3 ^dir U +)
  10540. inner elaboration loop at bottom goal.
  10541. Retracting elaborate*copy-see-to-output-link
  10542. -->
  10543. (I3 ^see 1 +)
  10544. Retracting propose*predict-no
  10545. -->
  10546. (O1950 ^name predict-no +)
  10547. (S1 ^operator O1950 +)
  10548. Retracting propose*predict-yes
  10549. -->
  10550. (O1949 ^name predict-yes +)
  10551. (S1 ^operator O1949 +)
  10552. Retracting elaborate*reward*based*on*reward
  10553. -->
  10554. (R978 ^value 1 +)
  10555. (R1 ^reward R978 +)
  10556. Retracting elaborate*copy-dir-to-output-link
  10557. -->
  10558. (I3 ^dir L +)
  10559. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  10560. -->
  10561. (S1 ^operator O1950 = 0.6855461517499103)
  10562. Retracting rl*prefer*rvt*predict-no*H0*4
  10563. -->
  10564. (S1 ^operator O1950 = 0.3145032394390637)
  10565. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  10566. -->
  10567. (S1 ^operator O1949 = -0.2062723012911647)
  10568. Retracting rl*prefer*rvt*predict-yes*H0*3
  10569. -->
  10570. (S1 ^operator O1949 = 0.3907711727075364)
  10571. =>WM: (13697: S1 ^operator O1952 +)
  10572. =>WM: (13696: S1 ^operator O1951 +)
  10573. =>WM: (13695: I3 ^dir U)
  10574. =>WM: (13694: O1952 ^name predict-no)
  10575. =>WM: (13693: O1951 ^name predict-yes)
  10576. =>WM: (13692: R979 ^value 1)
  10577. =>WM: (13691: R1 ^reward R979)
  10578. =>WM: (13690: I3 ^see 0)
  10579. <=WM: (13681: S1 ^operator O1949 +)
  10580. <=WM: (13682: S1 ^operator O1950 +)
  10581. <=WM: (13683: S1 ^operator O1950)
  10582. <=WM: (13666: I3 ^dir L)
  10583. <=WM: (13677: R1 ^reward R978)
  10584. <=WM: (13676: I3 ^see 1)
  10585. <=WM: (13680: O1950 ^name predict-no)
  10586. <=WM: (13679: O1949 ^name predict-yes)
  10587. <=WM: (13678: R978 ^value 1)
  10588. --- Inner Elaboration Phase, active level 1 (S1) ---
  10589. Firing prefer*rvt*predict-yes*H0
  10590. -->
  10591. Firing rl*prefer*rvt*predict-yes*H0*1
  10592. -->
  10593. (S1 ^operator O1951 = 0.)
  10594. Firing prefer*rvt*predict-no*H0
  10595. -->
  10596. Firing rl*prefer*rvt*predict-no*H0*2
  10597. -->
  10598. (S1 ^operator O1952 = 1.)
  10599. inner elaboration loop at bottom goal.
  10600. Retracting rl*prefer*rvt*predict-no*H0*2
  10601. -->
  10602. (S1 ^operator O1950 = 1.)
  10603. Retracting rl*prefer*rvt*predict-yes*H0*1
  10604. -->
  10605. (S1 ^operator O1949 = 0.)
  10606. --- END Proposal Phase ---
  10607. --- Decision Phase ---
  10608. RL update rl*prefer*rvt*predict-no*H0*4 0.478552 -0.164048 0.314503 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.92053,0.0736424)
  10609. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521493 0.164053 0.685546 -> 0.521489 0.164052 0.685541(R,m,v=1,1,0)
  10610. =>WM: (13698: S1 ^operator O1952)
  10611. 976: O: O1952 (predict-no)
  10612. --- END Decision Phase ---
  10613. --- Application Phase ---
  10614. --- Firing Productions (PE) For State At Depth 1 ---
  10615. --- Inner Elaboration Phase, active level 1 (S1) ---
  10616. Firing apply*operator
  10617. -->
  10618. (I3 ^predict-no N976 + :O )
  10619. Firing apply*operator*complete
  10620. -->
  10621. (I3 ^predict-no N975 - :O )
  10622. inner elaboration loop at bottom goal.
  10623. --- Change Working Memory (PE) ---
  10624. =>WM: (13699: I3 ^predict-no N976)
  10625. <=WM: (13685: N975 ^status complete)
  10626. <=WM: (13684: I3 ^predict-no N975)
  10627. --- Firing Productions (IE) For State At Depth 1 ---
  10628. --- Inner Elaboration Phase, active level 1 (S1) ---
  10629. Firing monitor*world
  10630. -->
  10631. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10632. --- Change Working Memory (IE) ---
  10633. --- END Application Phase ---
  10634. --- Output Phase ---
  10635. ENV: Agent did: predict-no for direction U in state State-A
  10636. In State-A moving U
  10637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10638. predict error 0
  10639. dir: dir isL
  10640. --- END Output Phase ---
  10641. \-/--- Input Phase ---
  10642. =>WM: (13703: I2 ^dir L)
  10643. =>WM: (13702: I2 ^reward 1)
  10644. =>WM: (13701: I2 ^see 0)
  10645. =>WM: (13700: N976 ^status complete)
  10646. <=WM: (13688: I2 ^dir U)
  10647. <=WM: (13687: I2 ^reward 1)
  10648. <=WM: (13686: I2 ^see 0)
  10649. =>WM: (13704: I2 ^level-1 L0-root)
  10650. <=WM: (13689: I2 ^level-1 L0-root)
  10651. --- END Input Phase ---
  10652. --- Proposal Phase ---
  10653. --- Inner Elaboration Phase, active level 1 (S1) ---
  10654. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  10655. -->
  10656. (S1 ^operator O1951 = -0.208713043145708)
  10657. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  10658. -->
  10659. (S1 ^operator O1952 = 0.6854177156873388)
  10660. Firing prefer*rvt*predict-no*H0*4*H1
  10661. -->
  10662. Firing prefer*rvt*predict-yes*H0*3*H1
  10663. -->
  10664. Firing elaborate*copy-see-to-output-link
  10665. -->
  10666. (I3 ^see 0 +)
  10667. Firing elaborate*reward*based*on*reward
  10668. -->
  10669. (R980 ^value 1 +)
  10670. (R1 ^reward R980 +)
  10671. Firing propose*predict-yes
  10672. -->
  10673. (O1953 ^name predict-yes +)
  10674. (S1 ^operator O1953 +)
  10675. Firing propose*predict-no
  10676. -->
  10677. (O1954 ^name predict-no +)
  10678. (S1 ^operator O1954 +)
  10679. Firing rl*prefer*rvt*predict-no*H0*4
  10680. -->
  10681. (S1 ^operator O1952 = 0.3144991353263821)
  10682. Firing rl*prefer*rvt*predict-yes*H0*3
  10683. -->
  10684. (S1 ^operator O1951 = 0.3907711727075364)
  10685. Firing prefer*rvt*predict-yes*H0
  10686. -->
  10687. Firing prefer*rvt*predict-no*H0
  10688. -->
  10689. Firing elaborate*copy-dir-to-output-link
  10690. -->
  10691. (I3 ^dir L +)
  10692. inner elaboration loop at bottom goal.
  10693. Retracting elaborate*copy-see-to-output-link
  10694. -->
  10695. (I3 ^see 0 +)
  10696. Retracting propose*predict-no
  10697. -->
  10698. (O1952 ^name predict-no +)
  10699. (S1 ^operator O1952 +)
  10700. Retracting propose*predict-yes
  10701. -->
  10702. (O1951 ^name predict-yes +)
  10703. (S1 ^operator O1951 +)
  10704. Retracting elaborate*reward*based*on*reward
  10705. -->
  10706. (R979 ^value 1 +)
  10707. (R1 ^reward R979 +)
  10708. Retracting elaborate*copy-dir-to-output-link
  10709. -->
  10710. (I3 ^dir U +)
  10711. Retracting rl*prefer*rvt*predict-no*H0*2
  10712. -->
  10713. (S1 ^operator O1952 = 1.)
  10714. Retracting rl*prefer*rvt*predict-yes*H0*1
  10715. -->
  10716. (S1 ^operator O1951 = 0.)
  10717. =>WM: (13711: S1 ^operator O1954 +)
  10718. =>WM: (13710: S1 ^operator O1953 +)
  10719. =>WM: (13709: I3 ^dir L)
  10720. =>WM: (13708: O1954 ^name predict-no)
  10721. =>WM: (13707: O1953 ^name predict-yes)
  10722. =>WM: (13706: R980 ^value 1)
  10723. =>WM: (13705: R1 ^reward R980)
  10724. <=WM: (13696: S1 ^operator O1951 +)
  10725. <=WM: (13697: S1 ^operator O1952 +)
  10726. <=WM: (13698: S1 ^operator O1952)
  10727. <=WM: (13695: I3 ^dir U)
  10728. <=WM: (13691: R1 ^reward R979)
  10729. <=WM: (13694: O1952 ^name predict-no)
  10730. <=WM: (13693: O1951 ^name predict-yes)
  10731. <=WM: (13692: R979 ^value 1)
  10732. --- Inner Elaboration Phase, active level 1 (S1) ---
  10733. Firing prefer*rvt*predict-yes*H0
  10734. -->
  10735. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  10736. -->
  10737. (S1 ^operator O1953 = -0.208713043145708)
  10738. Firing rl*prefer*rvt*predict-yes*H0*3
  10739. -->
  10740. (S1 ^operator O1953 = 0.3907711727075364)
  10741. Firing prefer*rvt*predict-yes*H0*3*H1
  10742. -->
  10743. Firing prefer*rvt*predict-no*H0
  10744. -->
  10745. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  10746. -->
  10747. (S1 ^operator O1954 = 0.6854177156873388)
  10748. Firing rl*prefer*rvt*predict-no*H0*4
  10749. -->
  10750. (S1 ^operator O1954 = 0.3144991353263821)
  10751. Firing prefer*rvt*predict-no*H0*4*H1
  10752. -->
  10753. inner elaboration loop at bottom goal.
  10754. Retracting rl*prefer*rvt*predict-no*H0*4
  10755. -->
  10756. (S1 ^operator O1952 = 0.3144991353263821)
  10757. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  10758. -->
  10759. (S1 ^operator O1952 = 0.6854177156873388)
  10760. Retracting rl*prefer*rvt*predict-yes*H0*3
  10761. -->
  10762. (S1 ^operator O1951 = 0.3907711727075364)
  10763. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  10764. -->
  10765. (S1 ^operator O1951 = -0.208713043145708)
  10766. --- END Proposal Phase ---
  10767. --- Decision Phase ---
  10768. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10769. =>WM: (13712: S1 ^operator O1954)
  10770. 977: O: O1954 (predict-no)
  10771. --- END Decision Phase ---
  10772. --- Application Phase ---
  10773. --- Firing Productions (PE) For State At Depth 1 ---
  10774. --- Inner Elaboration Phase, active level 1 (S1) ---
  10775. Firing apply*operator
  10776. -->
  10777. (I3 ^predict-no N977 + :O )
  10778. Firing apply*operator*complete
  10779. -->
  10780. (I3 ^predict-no N976 - :O )
  10781. inner elaboration loop at bottom goal.
  10782. --- Change Working Memory (PE) ---
  10783. =>WM: (13713: I3 ^predict-no N977)
  10784. <=WM: (13700: N976 ^status complete)
  10785. <=WM: (13699: I3 ^predict-no N976)
  10786. --- Firing Productions (IE) For State At Depth 1 ---
  10787. --- Inner Elaboration Phase, active level 1 (S1) ---
  10788. Firing monitor*world
  10789. -->
  10790. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10791. --- Change Working Memory (IE) ---
  10792. --- END Application Phase ---
  10793. --- Output Phase ---
  10794. ENV: Agent did: predict-no for direction L in state State-A
  10795. In State-A moving L
  10796. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10797. predict error 0
  10798. dir: dir isR
  10799. --- END Output Phase ---
  10800. |\---- Input Phase ---
  10801. =>WM: (13717: I2 ^dir R)
  10802. =>WM: (13716: I2 ^reward 1)
  10803. =>WM: (13715: I2 ^see 0)
  10804. =>WM: (13714: N977 ^status complete)
  10805. <=WM: (13703: I2 ^dir L)
  10806. <=WM: (13702: I2 ^reward 1)
  10807. <=WM: (13701: I2 ^see 0)
  10808. =>WM: (13718: I2 ^level-1 L0-root)
  10809. <=WM: (13704: I2 ^level-1 L0-root)
  10810. --- END Input Phase ---
  10811. --- Proposal Phase ---
  10812. --- Inner Elaboration Phase, active level 1 (S1) ---
  10813. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  10814. -->
  10815. (S1 ^operator O1953 = 0.8783927855286688)
  10816. Firing prefer*rvt*predict-yes*H0*5*H1
  10817. -->
  10818. Firing elaborate*copy-see-to-output-link
  10819. -->
  10820. (I3 ^see 0 +)
  10821. Firing elaborate*reward*based*on*reward
  10822. -->
  10823. (R981 ^value 1 +)
  10824. (R1 ^reward R981 +)
  10825. Firing propose*predict-yes
  10826. -->
  10827. (O1955 ^name predict-yes +)
  10828. (S1 ^operator O1955 +)
  10829. Firing propose*predict-no
  10830. -->
  10831. (O1956 ^name predict-no +)
  10832. (S1 ^operator O1956 +)
  10833. Firing rl*prefer*rvt*predict-no*H0*6
  10834. -->
  10835. (S1 ^operator O1954 = 0.9999810901454903)
  10836. Firing rl*prefer*rvt*predict-yes*H0*5
  10837. -->
  10838. (S1 ^operator O1953 = 0.1215980737936329)
  10839. Firing prefer*rvt*predict-yes*H0
  10840. -->
  10841. Firing prefer*rvt*predict-no*H0
  10842. -->
  10843. Firing elaborate*copy-dir-to-output-link
  10844. -->
  10845. (I3 ^dir R +)
  10846. inner elaboration loop at bottom goal.
  10847. Retracting elaborate*copy-see-to-output-link
  10848. -->
  10849. (I3 ^see 0 +)
  10850. Retracting propose*predict-no
  10851. -->
  10852. (O1954 ^name predict-no +)
  10853. (S1 ^operator O1954 +)
  10854. Retracting propose*predict-yes
  10855. -->
  10856. (O1953 ^name predict-yes +)
  10857. (S1 ^operator O1953 +)
  10858. Retracting elaborate*reward*based*on*reward
  10859. -->
  10860. (R980 ^value 1 +)
  10861. (R1 ^reward R980 +)
  10862. Retracting elaborate*copy-dir-to-output-link
  10863. -->
  10864. (I3 ^dir L +)
  10865. Retracting rl*prefer*rvt*predict-no*H0*4
  10866. -->
  10867. (S1 ^operator O1954 = 0.3144991353263821)
  10868. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  10869. -->
  10870. (S1 ^operator O1954 = 0.6854177156873388)
  10871. Retracting rl*prefer*rvt*predict-yes*H0*3
  10872. -->
  10873. (S1 ^operator O1953 = 0.3907711727075364)
  10874. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  10875. -->
  10876. (S1 ^operator O1953 = -0.208713043145708)
  10877. =>WM: (13725: S1 ^operator O1956 +)
  10878. =>WM: (13724: S1 ^operator O1955 +)
  10879. =>WM: (13723: I3 ^dir R)
  10880. =>WM: (13722: O1956 ^name predict-no)
  10881. =>WM: (13721: O1955 ^name predict-yes)
  10882. =>WM: (13720: R981 ^value 1)
  10883. =>WM: (13719: R1 ^reward R981)
  10884. <=WM: (13710: S1 ^operator O1953 +)
  10885. <=WM: (13711: S1 ^operator O1954 +)
  10886. <=WM: (13712: S1 ^operator O1954)
  10887. <=WM: (13709: I3 ^dir L)
  10888. <=WM: (13705: R1 ^reward R980)
  10889. <=WM: (13708: O1954 ^name predict-no)
  10890. <=WM: (13707: O1953 ^name predict-yes)
  10891. <=WM: (13706: R980 ^value 1)
  10892. --- Inner Elaboration Phase, active level 1 (S1) ---
  10893. Firing prefer*rvt*predict-yes*H0
  10894. -->
  10895. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  10896. -->
  10897. (S1 ^operator O1955 = 0.8783927855286688)
  10898. Firing rl*prefer*rvt*predict-yes*H0*5
  10899. -->
  10900. (S1 ^operator O1955 = 0.1215980737936329)
  10901. Firing prefer*rvt*predict-yes*H0*5*H1
  10902. -->
  10903. Firing prefer*rvt*predict-no*H0
  10904. -->
  10905. Firing rl*prefer*rvt*predict-no*H0*6
  10906. -->
  10907. (S1 ^operator O1956 = 0.9999810901454903)
  10908. inner elaboration loop at bottom goal.
  10909. Retracting rl*prefer*rvt*predict-no*H0*6
  10910. -->
  10911. (S1 ^operator O1954 = 0.9999810901454903)
  10912. Retracting rl*prefer*rvt*predict-yes*H0*5
  10913. -->
  10914. (S1 ^operator O1953 = 0.1215980737936329)
  10915. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  10916. -->
  10917. (S1 ^operator O1953 = 0.8783927855286688)
  10918. --- END Proposal Phase ---
  10919. --- Decision Phase ---
  10920. RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478554 -0.164048 0.314506(R,m,v=1,0.921053,0.0731962)
  10921. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521377 0.164041 0.685418 -> 0.521384 0.164042 0.685426(R,m,v=1,1,0)
  10922. =>WM: (13726: S1 ^operator O1955)
  10923. 978: O: O1955 (predict-yes)
  10924. --- END Decision Phase ---
  10925. --- Application Phase ---
  10926. --- Firing Productions (PE) For State At Depth 1 ---
  10927. --- Inner Elaboration Phase, active level 1 (S1) ---
  10928. Firing apply*operator
  10929. -->
  10930. (I3 ^predict-yes N978 + :O )
  10931. Firing apply*operator*complete
  10932. -->
  10933. (I3 ^predict-no N977 - :O )
  10934. inner elaboration loop at bottom goal.
  10935. --- Change Working Memory (PE) ---
  10936. =>WM: (13727: I3 ^predict-yes N978)
  10937. <=WM: (13714: N977 ^status complete)
  10938. <=WM: (13713: I3 ^predict-no N977)
  10939. --- Firing Productions (IE) For State At Depth 1 ---
  10940. --- Inner Elaboration Phase, active level 1 (S1) ---
  10941. Firing monitor*world
  10942. -->
  10943. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10944. --- Change Working Memory (IE) ---
  10945. --- END Application Phase ---
  10946. --- Output Phase ---
  10947. ENV: Agent did: predict-yes for direction R in state State-A
  10948. In State-A moving R
  10949. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10950. predict error 0
  10951. dir: dir isL
  10952. --- END Output Phase ---
  10953. /|\--- Input Phase ---
  10954. =>WM: (13731: I2 ^dir L)
  10955. =>WM: (13730: I2 ^reward 1)
  10956. =>WM: (13729: I2 ^see 1)
  10957. =>WM: (13728: N978 ^status complete)
  10958. <=WM: (13717: I2 ^dir R)
  10959. <=WM: (13716: I2 ^reward 1)
  10960. <=WM: (13715: I2 ^see 0)
  10961. =>WM: (13732: I2 ^level-1 R1-root)
  10962. <=WM: (13718: I2 ^level-1 L0-root)
  10963. --- END Input Phase ---
  10964. --- Proposal Phase ---
  10965. --- Inner Elaboration Phase, active level 1 (S1) ---
  10966. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  10967. -->
  10968. (S1 ^operator O1956 = -0.168718511744511)
  10969. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  10970. -->
  10971. (S1 ^operator O1955 = 0.6093273841659509)
  10972. Firing prefer*rvt*predict-no*H0*4*H1
  10973. -->
  10974. Firing prefer*rvt*predict-yes*H0*3*H1
  10975. -->
  10976. Firing elaborate*copy-see-to-output-link
  10977. -->
  10978. (I3 ^see 1 +)
  10979. Firing elaborate*reward*based*on*reward
  10980. -->
  10981. (R982 ^value 1 +)
  10982. (R1 ^reward R982 +)
  10983. Firing propose*predict-yes
  10984. -->
  10985. (O1957 ^name predict-yes +)
  10986. (S1 ^operator O1957 +)
  10987. Firing propose*predict-no
  10988. -->
  10989. (O1958 ^name predict-no +)
  10990. (S1 ^operator O1958 +)
  10991. Firing rl*prefer*rvt*predict-no*H0*4
  10992. -->
  10993. (S1 ^operator O1956 = 0.3145060369395525)
  10994. Firing rl*prefer*rvt*predict-yes*H0*3
  10995. -->
  10996. (S1 ^operator O1955 = 0.3907711727075364)
  10997. Firing prefer*rvt*predict-yes*H0
  10998. -->
  10999. Firing prefer*rvt*predict-no*H0
  11000. -->
  11001. Firing elaborate*copy-dir-to-output-link
  11002. -->
  11003. (I3 ^dir L +)
  11004. inner elaboration loop at bottom goal.
  11005. Retracting elaborate*copy-see-to-output-link
  11006. -->
  11007. (I3 ^see 0 +)
  11008. Retracting propose*predict-no
  11009. -->
  11010. (O1956 ^name predict-no +)
  11011. (S1 ^operator O1956 +)
  11012. Retracting propose*predict-yes
  11013. -->
  11014. (O1955 ^name predict-yes +)
  11015. (S1 ^operator O1955 +)
  11016. Retracting elaborate*reward*based*on*reward
  11017. -->
  11018. (R981 ^value 1 +)
  11019. (R1 ^reward R981 +)
  11020. Retracting elaborate*copy-dir-to-output-link
  11021. -->
  11022. (I3 ^dir R +)
  11023. Retracting rl*prefer*rvt*predict-no*H0*6
  11024. -->
  11025. (S1 ^operator O1956 = 0.9999810901454903)
  11026. Retracting rl*prefer*rvt*predict-yes*H0*5
  11027. -->
  11028. (S1 ^operator O1955 = 0.1215980737936329)
  11029. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  11030. -->
  11031. (S1 ^operator O1955 = 0.8783927855286688)
  11032. =>WM: (13740: S1 ^operator O1958 +)
  11033. =>WM: (13739: S1 ^operator O1957 +)
  11034. =>WM: (13738: I3 ^dir L)
  11035. =>WM: (13737: O1958 ^name predict-no)
  11036. =>WM: (13736: O1957 ^name predict-yes)
  11037. =>WM: (13735: R982 ^value 1)
  11038. =>WM: (13734: R1 ^reward R982)
  11039. =>WM: (13733: I3 ^see 1)
  11040. <=WM: (13724: S1 ^operator O1955 +)
  11041. <=WM: (13726: S1 ^operator O1955)
  11042. <=WM: (13725: S1 ^operator O1956 +)
  11043. <=WM: (13723: I3 ^dir R)
  11044. <=WM: (13719: R1 ^reward R981)
  11045. <=WM: (13690: I3 ^see 0)
  11046. <=WM: (13722: O1956 ^name predict-no)
  11047. <=WM: (13721: O1955 ^name predict-yes)
  11048. <=WM: (13720: R981 ^value 1)
  11049. --- Inner Elaboration Phase, active level 1 (S1) ---
  11050. Firing prefer*rvt*predict-yes*H0
  11051. -->
  11052. Firing rl*prefer*rvt*predict-yes*H0*3
  11053. -->
  11054. (S1 ^operator O1957 = 0.3907711727075364)
  11055. Firing prefer*rvt*predict-yes*H0*3*H1
  11056. -->
  11057. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  11058. -->
  11059. (S1 ^operator O1957 = 0.6093273841659509)
  11060. Firing prefer*rvt*predict-no*H0
  11061. -->
  11062. Firing rl*prefer*rvt*predict-no*H0*4
  11063. -->
  11064. (S1 ^operator O1958 = 0.3145060369395525)
  11065. Firing prefer*rvt*predict-no*H0*4*H1
  11066. -->
  11067. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  11068. -->
  11069. (S1 ^operator O1958 = -0.168718511744511)
  11070. inner elaboration loop at bottom goal.
  11071. Retracting rl*prefer*rvt*predict-no*H0*4
  11072. -->
  11073. (S1 ^operator O1956 = 0.3145060369395525)
  11074. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  11075. -->
  11076. (S1 ^operator O1956 = -0.168718511744511)
  11077. Retracting rl*prefer*rvt*predict-yes*H0*3
  11078. -->
  11079. (S1 ^operator O1955 = 0.3907711727075364)
  11080. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  11081. -->
  11082. (S1 ^operator O1955 = 0.6093273841659509)
  11083. --- END Proposal Phase ---
  11084. --- Decision Phase ---
  11085. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.861272,0.120177)
  11086. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465468 0.412925 0.878393 -> 0.465469 0.412925 0.878394(R,m,v=1,1,0)
  11087. =>WM: (13741: S1 ^operator O1957)
  11088. 979: O: O1957 (predict-yes)
  11089. --- END Decision Phase ---
  11090. --- Application Phase ---
  11091. --- Firing Productions (PE) For State At Depth 1 ---
  11092. --- Inner Elaboration Phase, active level 1 (S1) ---
  11093. Firing apply*operator
  11094. -->
  11095. (I3 ^predict-yes N979 + :O )
  11096. Firing apply*operator*complete
  11097. -->
  11098. (I3 ^predict-yes N978 - :O )
  11099. inner elaboration loop at bottom goal.
  11100. --- Change Working Memory (PE) ---
  11101. =>WM: (13742: I3 ^predict-yes N979)
  11102. <=WM: (13728: N978 ^status complete)
  11103. <=WM: (13727: I3 ^predict-yes N978)
  11104. --- Firing Productions (IE) For State At Depth 1 ---
  11105. --- Inner Elaboration Phase, active level 1 (S1) ---
  11106. Firing monitor*world
  11107. -->
  11108. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11109. --- Change Working Memory (IE) ---
  11110. --- END Application Phase ---
  11111. --- Output Phase ---
  11112. ENV: Agent did: predict-yes for direction L in state State-B
  11113. In State-B moving L
  11114. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11115. predict error 0
  11116. dir: dir isR
  11117. --- END Output Phase ---
  11118. -/|--- Input Phase ---
  11119. =>WM: (13746: I2 ^dir R)
  11120. =>WM: (13745: I2 ^reward 1)
  11121. =>WM: (13744: I2 ^see 1)
  11122. =>WM: (13743: N979 ^status complete)
  11123. <=WM: (13731: I2 ^dir L)
  11124. <=WM: (13730: I2 ^reward 1)
  11125. <=WM: (13729: I2 ^see 1)
  11126. =>WM: (13747: I2 ^level-1 L1-root)
  11127. <=WM: (13732: I2 ^level-1 R1-root)
  11128. --- END Input Phase ---
  11129. --- Proposal Phase ---
  11130. --- Inner Elaboration Phase, active level 1 (S1) ---
  11131. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  11132. -->
  11133. (S1 ^operator O1957 = 0.8784154092082219)
  11134. Firing prefer*rvt*predict-yes*H0*5*H1
  11135. -->
  11136. Firing elaborate*copy-see-to-output-link
  11137. -->
  11138. (I3 ^see 1 +)
  11139. Firing elaborate*reward*based*on*reward
  11140. -->
  11141. (R983 ^value 1 +)
  11142. (R1 ^reward R983 +)
  11143. Firing propose*predict-yes
  11144. -->
  11145. (O1959 ^name predict-yes +)
  11146. (S1 ^operator O1959 +)
  11147. Firing propose*predict-no
  11148. -->
  11149. (O1960 ^name predict-no +)
  11150. (S1 ^operator O1960 +)
  11151. Firing rl*prefer*rvt*predict-no*H0*6
  11152. -->
  11153. (S1 ^operator O1958 = 0.9999810901454903)
  11154. Firing rl*prefer*rvt*predict-yes*H0*5
  11155. -->
  11156. (S1 ^operator O1957 = 0.1215988165406292)
  11157. Firing prefer*rvt*predict-yes*H0
  11158. -->
  11159. Firing prefer*rvt*predict-no*H0
  11160. -->
  11161. Firing elaborate*copy-dir-to-output-link
  11162. -->
  11163. (I3 ^dir R +)
  11164. inner elaboration loop at bottom goal.
  11165. Retracting elaborate*copy-see-to-output-link
  11166. -->
  11167. (I3 ^see 1 +)
  11168. Retracting propose*predict-no
  11169. -->
  11170. (O1958 ^name predict-no +)
  11171. (S1 ^operator O1958 +)
  11172. Retracting propose*predict-yes
  11173. -->
  11174. (O1957 ^name predict-yes +)
  11175. (S1 ^operator O1957 +)
  11176. Retracting elaborate*reward*based*on*reward
  11177. -->
  11178. (R982 ^value 1 +)
  11179. (R1 ^reward R982 +)
  11180. Retracting elaborate*copy-dir-to-output-link
  11181. -->
  11182. (I3 ^dir L +)
  11183. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  11184. -->
  11185. (S1 ^operator O1958 = -0.168718511744511)
  11186. Retracting rl*prefer*rvt*predict-no*H0*4
  11187. -->
  11188. (S1 ^operator O1958 = 0.3145060369395525)
  11189. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  11190. -->
  11191. (S1 ^operator O1957 = 0.6093273841659509)
  11192. Retracting rl*prefer*rvt*predict-yes*H0*3
  11193. -->
  11194. (S1 ^operator O1957 = 0.3907711727075364)
  11195. =>WM: (13754: S1 ^operator O1960 +)
  11196. =>WM: (13753: S1 ^operator O1959 +)
  11197. =>WM: (13752: I3 ^dir R)
  11198. =>WM: (13751: O1960 ^name predict-no)
  11199. =>WM: (13750: O1959 ^name predict-yes)
  11200. =>WM: (13749: R983 ^value 1)
  11201. =>WM: (13748: R1 ^reward R983)
  11202. <=WM: (13739: S1 ^operator O1957 +)
  11203. <=WM: (13741: S1 ^operator O1957)
  11204. <=WM: (13740: S1 ^operator O1958 +)
  11205. <=WM: (13738: I3 ^dir L)
  11206. <=WM: (13734: R1 ^reward R982)
  11207. <=WM: (13737: O1958 ^name predict-no)
  11208. <=WM: (13736: O1957 ^name predict-yes)
  11209. <=WM: (13735: R982 ^value 1)
  11210. --- Inner Elaboration Phase, active level 1 (S1) ---
  11211. Firing prefer*rvt*predict-yes*H0
  11212. -->
  11213. Firing rl*prefer*rvt*predict-yes*H0*5
  11214. -->
  11215. (S1 ^operator O1959 = 0.1215988165406292)
  11216. Firing prefer*rvt*predict-yes*H0*5*H1
  11217. -->
  11218. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  11219. -->
  11220. (S1 ^operator O1959 = 0.8784154092082219)
  11221. Firing prefer*rvt*predict-no*H0
  11222. -->
  11223. Firing rl*prefer*rvt*predict-no*H0*6
  11224. -->
  11225. (S1 ^operator O1960 = 0.9999810901454903)
  11226. inner elaboration loop at bottom goal.
  11227. Retracting rl*prefer*rvt*predict-no*H0*6
  11228. -->
  11229. (S1 ^operator O1958 = 0.9999810901454903)
  11230. Retracting rl*prefer*rvt*predict-yes*H0*5
  11231. -->
  11232. (S1 ^operator O1957 = 0.1215988165406292)
  11233. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  11234. -->
  11235. (S1 ^operator O1957 = 0.8784154092082219)
  11236. --- END Proposal Phase ---
  11237. --- Decision Phase ---
  11238. RL update rl*prefer*rvt*predict-yes*H0*3 0.472318 -0.0815469 0.390771 -> 0.472311 -0.0815481 0.390763(R,m,v=1,0.942675,0.0543851)
  11239. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527766 0.0815615 0.609327 -> 0.527758 0.0815601 0.609318(R,m,v=1,1,0)
  11240. =>WM: (13755: S1 ^operator O1959)
  11241. 980: O: O1959 (predict-yes)
  11242. --- END Decision Phase ---
  11243. --- Application Phase ---
  11244. --- Firing Productions (PE) For State At Depth 1 ---
  11245. --- Inner Elaboration Phase, active level 1 (S1) ---
  11246. Firing apply*operator
  11247. -->
  11248. (I3 ^predict-yes N980 + :O )
  11249. Firing apply*operator*complete
  11250. -->
  11251. (I3 ^predict-yes N979 - :O )
  11252. inner elaboration loop at bottom goal.
  11253. --- Change Working Memory (PE) ---
  11254. =>WM: (13756: I3 ^predict-yes N980)
  11255. <=WM: (13743: N979 ^status complete)
  11256. <=WM: (13742: I3 ^predict-yes N979)
  11257. --- Firing Productions (IE) For State At Depth 1 ---
  11258. --- Inner Elaboration Phase, active level 1 (S1) ---
  11259. Firing monitor*world
  11260. -->
  11261. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11262. --- Change Working Memory (IE) ---
  11263. --- END Application Phase ---
  11264. --- Output Phase ---
  11265. ENV: Agent did: predict-yes for direction R in state State-A
  11266. In State-A moving R
  11267. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11268. predict error 0
  11269. dir: dir isR
  11270. --- END Output Phase ---
  11271. \---- Input Phase ---
  11272. =>WM: (13760: I2 ^dir R)
  11273. =>WM: (13759: I2 ^reward 1)
  11274. =>WM: (13758: I2 ^see 1)
  11275. =>WM: (13757: N980 ^status complete)
  11276. <=WM: (13746: I2 ^dir R)
  11277. <=WM: (13745: I2 ^reward 1)
  11278. <=WM: (13744: I2 ^see 1)
  11279. =>WM: (13761: I2 ^level-1 R1-root)
  11280. <=WM: (13747: I2 ^level-1 L1-root)
  11281. --- END Input Phase ---
  11282. --- Proposal Phase ---
  11283. --- Inner Elaboration Phase, active level 1 (S1) ---
  11284. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  11285. -->
  11286. (S1 ^operator O1959 = -0.04253361215288998)
  11287. Firing prefer*rvt*predict-yes*H0*5*H1
  11288. -->
  11289. Firing elaborate*copy-see-to-output-link
  11290. -->
  11291. (I3 ^see 1 +)
  11292. Firing elaborate*reward*based*on*reward
  11293. -->
  11294. (R984 ^value 1 +)
  11295. (R1 ^reward R984 +)
  11296. Firing propose*predict-yes
  11297. -->
  11298. (O1961 ^name predict-yes +)
  11299. (S1 ^operator O1961 +)
  11300. Firing propose*predict-no
  11301. -->
  11302. (O1962 ^name predict-no +)
  11303. (S1 ^operator O1962 +)
  11304. Firing rl*prefer*rvt*predict-no*H0*6
  11305. -->
  11306. (S1 ^operator O1960 = 0.9999810901454903)
  11307. Firing rl*prefer*rvt*predict-yes*H0*5
  11308. -->
  11309. (S1 ^operator O1959 = 0.1215988165406292)
  11310. Firing prefer*rvt*predict-yes*H0
  11311. -->
  11312. Firing prefer*rvt*predict-no*H0
  11313. -->
  11314. Firing elaborate*copy-dir-to-output-link
  11315. -->
  11316. (I3 ^dir R +)
  11317. inner elaboration loop at bottom goal.
  11318. Retracting elaborate*copy-see-to-output-link
  11319. -->
  11320. (I3 ^see 1 +)
  11321. Retracting propose*predict-no
  11322. -->
  11323. (O1960 ^name predict-no +)
  11324. (S1 ^operator O1960 +)
  11325. Retracting propose*predict-yes
  11326. -->
  11327. (O1959 ^name predict-yes +)
  11328. (S1 ^operator O1959 +)
  11329. Retracting elaborate*reward*based*on*reward
  11330. -->
  11331. (R983 ^value 1 +)
  11332. (R1 ^reward R983 +)
  11333. Retracting elaborate*copy-dir-to-output-link
  11334. -->
  11335. (I3 ^dir R +)
  11336. Retracting rl*prefer*rvt*predict-no*H0*6
  11337. -->
  11338. (S1 ^operator O1960 = 0.9999810901454903)
  11339. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  11340. -->
  11341. (S1 ^operator O1959 = 0.8784154092082219)
  11342. Retracting rl*prefer*rvt*predict-yes*H0*5
  11343. -->
  11344. (S1 ^operator O1959 = 0.1215988165406292)
  11345. =>WM: (13767: S1 ^operator O1962 +)
  11346. =>WM: (13766: S1 ^operator O1961 +)
  11347. =>WM: (13765: O1962 ^name predict-no)
  11348. =>WM: (13764: O1961 ^name predict-yes)
  11349. =>WM: (13763: R984 ^value 1)
  11350. =>WM: (13762: R1 ^reward R984)
  11351. <=WM: (13753: S1 ^operator O1959 +)
  11352. <=WM: (13755: S1 ^operator O1959)
  11353. <=WM: (13754: S1 ^operator O1960 +)
  11354. <=WM: (13748: R1 ^reward R983)
  11355. <=WM: (13751: O1960 ^name predict-no)
  11356. <=WM: (13750: O1959 ^name predict-yes)
  11357. <=WM: (13749: R983 ^value 1)
  11358. --- Inner Elaboration Phase, active level 1 (S1) ---
  11359. Firing prefer*rvt*predict-yes*H0
  11360. -->
  11361. Firing rl*prefer*rvt*predict-yes*H0*5
  11362. -->
  11363. (S1 ^operator O1961 = 0.1215988165406292)
  11364. Firing prefer*rvt*predict-yes*H0*5*H1
  11365. -->
  11366. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  11367. -->
  11368. (S1 ^operator O1961 = -0.04253361215288998)
  11369. Firing prefer*rvt*predict-no*H0
  11370. -->
  11371. Firing rl*prefer*rvt*predict-no*H0*6
  11372. -->
  11373. (S1 ^operator O1962 = 0.9999810901454903)
  11374. inner elaboration loop at bottom goal.
  11375. Retracting rl*prefer*rvt*predict-no*H0*6
  11376. -->
  11377. (S1 ^operator O1960 = 0.9999810901454903)
  11378. Retracting rl*prefer*rvt*predict-yes*H0*5
  11379. -->
  11380. (S1 ^operator O1959 = 0.1215988165406292)
  11381. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  11382. -->
  11383. (S1 ^operator O1959 = -0.04253361215288998)
  11384. --- END Proposal Phase ---
  11385. --- Decision Phase ---
  11386. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.862069,0.119593)
  11387. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465487 0.412928 0.878415 -> 0.465486 0.412928 0.878414(R,m,v=1,1,0)
  11388. =>WM: (13768: S1 ^operator O1962)
  11389. 981: O: O1962 (predict-no)
  11390. --- END Decision Phase ---
  11391. --- Application Phase ---
  11392. --- Firing Productions (PE) For State At Depth 1 ---
  11393. --- Inner Elaboration Phase, active level 1 (S1) ---
  11394. Firing apply*operator
  11395. -->
  11396. (I3 ^predict-no N981 + :O )
  11397. Firing apply*operator*complete
  11398. -->
  11399. (I3 ^predict-yes N980 - :O )
  11400. inner elaboration loop at bottom goal.
  11401. --- Change Working Memory (PE) ---
  11402. =>WM: (13769: I3 ^predict-no N981)
  11403. <=WM: (13757: N980 ^status complete)
  11404. <=WM: (13756: I3 ^predict-yes N980)
  11405. --- Firing Productions (IE) For State At Depth 1 ---
  11406. --- Inner Elaboration Phase, active level 1 (S1) ---
  11407. Firing monitor*world
  11408. -->
  11409. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11410. --- Change Working Memory (IE) ---
  11411. --- END Application Phase ---
  11412. --- Output Phase ---
  11413. ENV: Agent did: predict-no for direction R in state State-B
  11414. In State-B moving R
  11415. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11416. predict error 0
  11417. dir: dir isL
  11418. --- END Output Phase ---
  11419. /--- Input Phase ---
  11420. =>WM: (13773: I2 ^dir L)
  11421. =>WM: (13772: I2 ^reward 1)
  11422. =>WM: (13771: I2 ^see 0)
  11423. =>WM: (13770: N981 ^status complete)
  11424. <=WM: (13760: I2 ^dir R)
  11425. <=WM: (13759: I2 ^reward 1)
  11426. <=WM: (13758: I2 ^see 1)
  11427. =>WM: (13774: I2 ^level-1 R0-root)
  11428. <=WM: (13761: I2 ^level-1 R1-root)
  11429. --- END Input Phase ---
  11430. --- Proposal Phase ---
  11431. --- Inner Elaboration Phase, active level 1 (S1) ---
  11432. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  11433. -->
  11434. (S1 ^operator O1962 = -0.1984300550322165)
  11435. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  11436. -->
  11437. (S1 ^operator O1961 = 0.609089086334031)
  11438. Firing prefer*rvt*predict-no*H0*4*H1
  11439. -->
  11440. Firing prefer*rvt*predict-yes*H0*3*H1
  11441. -->
  11442. Firing elaborate*copy-see-to-output-link
  11443. -->
  11444. (I3 ^see 0 +)
  11445. Firing elaborate*reward*based*on*reward
  11446. -->
  11447. (R985 ^value 1 +)
  11448. (R1 ^reward R985 +)
  11449. Firing propose*predict-yes
  11450. -->
  11451. (O1963 ^name predict-yes +)
  11452. (S1 ^operator O1963 +)
  11453. Firing propose*predict-no
  11454. -->
  11455. (O1964 ^name predict-no +)
  11456. (S1 ^operator O1964 +)
  11457. Firing rl*prefer*rvt*predict-no*H0*4
  11458. -->
  11459. (S1 ^operator O1962 = 0.3145060369395525)
  11460. Firing rl*prefer*rvt*predict-yes*H0*3
  11461. -->
  11462. (S1 ^operator O1961 = 0.39076303591152)
  11463. Firing prefer*rvt*predict-yes*H0
  11464. -->
  11465. Firing prefer*rvt*predict-no*H0
  11466. -->
  11467. Firing elaborate*copy-dir-to-output-link
  11468. -->
  11469. (I3 ^dir L +)
  11470. inner elaboration loop at bottom goal.
  11471. Retracting elaborate*copy-see-to-output-link
  11472. -->
  11473. (I3 ^see 1 +)
  11474. Retracting propose*predict-no
  11475. -->
  11476. (O1962 ^name predict-no +)
  11477. (S1 ^operator O1962 +)
  11478. Retracting propose*predict-yes
  11479. -->
  11480. (O1961 ^name predict-yes +)
  11481. (S1 ^operator O1961 +)
  11482. Retracting elaborate*reward*based*on*reward
  11483. -->
  11484. (R984 ^value 1 +)
  11485. (R1 ^reward R984 +)
  11486. Retracting elaborate*copy-dir-to-output-link
  11487. -->
  11488. (I3 ^dir R +)
  11489. Retracting rl*prefer*rvt*predict-no*H0*6
  11490. -->
  11491. (S1 ^operator O1962 = 0.9999810901454903)
  11492. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  11493. -->
  11494. (S1 ^operator O1961 = -0.04253361215288998)
  11495. Retracting rl*prefer*rvt*predict-yes*H0*5
  11496. -->
  11497. (S1 ^operator O1961 = 0.1215976616761118)
  11498. =>WM: (13782: S1 ^operator O1964 +)
  11499. =>WM: (13781: S1 ^operator O1963 +)
  11500. =>WM: (13780: I3 ^dir L)
  11501. =>WM: (13779: O1964 ^name predict-no)
  11502. =>WM: (13778: O1963 ^name predict-yes)
  11503. =>WM: (13777: R985 ^value 1)
  11504. =>WM: (13776: R1 ^reward R985)
  11505. =>WM: (13775: I3 ^see 0)
  11506. <=WM: (13766: S1 ^operator O1961 +)
  11507. <=WM: (13767: S1 ^operator O1962 +)
  11508. <=WM: (13768: S1 ^operator O1962)
  11509. <=WM: (13752: I3 ^dir R)
  11510. <=WM: (13762: R1 ^reward R984)
  11511. <=WM: (13733: I3 ^see 1)
  11512. <=WM: (13765: O1962 ^name predict-no)
  11513. <=WM: (13764: O1961 ^name predict-yes)
  11514. <=WM: (13763: R984 ^value 1)
  11515. --- Inner Elaboration Phase, active level 1 (S1) ---
  11516. Firing prefer*rvt*predict-yes*H0
  11517. -->
  11518. Firing rl*prefer*rvt*predict-yes*H0*3
  11519. -->
  11520. (S1 ^operator O1963 = 0.39076303591152)
  11521. Firing prefer*rvt*predict-yes*H0*3*H1
  11522. -->
  11523. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  11524. -->
  11525. (S1 ^operator O1963 = 0.609089086334031)
  11526. Firing prefer*rvt*predict-no*H0
  11527. -->
  11528. Firing rl*prefer*rvt*predict-no*H0*4
  11529. -->
  11530. (S1 ^operator O1964 = 0.3145060369395525)
  11531. Firing prefer*rvt*predict-no*H0*4*H1
  11532. -->
  11533. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  11534. -->
  11535. (S1 ^operator O1964 = -0.1984300550322165)
  11536. inner elaboration loop at bottom goal.
  11537. Retracting rl*prefer*rvt*predict-no*H0*4
  11538. -->
  11539. (S1 ^operator O1962 = 0.3145060369395525)
  11540. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  11541. -->
  11542. (S1 ^operator O1962 = -0.1984300550322165)
  11543. Retracting rl*prefer*rvt*predict-yes*H0*3
  11544. -->
  11545. (S1 ^operator O1961 = 0.39076303591152)
  11546. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  11547. -->
  11548. (S1 ^operator O1961 = 0.609089086334031)
  11549. --- END Proposal Phase ---
  11550. --- Decision Phase ---
  11551. RL update rl*prefer*rvt*predict-no*H0*6 0.999981 0 0.999981 -> 0.999984 0 0.999984(R,m,v=1,0.937143,0.0592447)
  11552. =>WM: (13783: S1 ^operator O1963)
  11553. 982: O: O1963 (predict-yes)
  11554. --- END Decision Phase ---
  11555. --- Application Phase ---
  11556. --- Firing Productions (PE) For State At Depth 1 ---
  11557. --- Inner Elaboration Phase, active level 1 (S1) ---
  11558. Firing apply*operator
  11559. -->
  11560. (I3 ^predict-yes N982 + :O )
  11561. Firing apply*operator*complete
  11562. -->
  11563. (I3 ^predict-no N981 - :O )
  11564. inner elaboration loop at bottom goal.
  11565. --- Change Working Memory (PE) ---
  11566. =>WM: (13784: I3 ^predict-yes N982)
  11567. <=WM: (13770: N981 ^status complete)
  11568. <=WM: (13769: I3 ^predict-no N981)
  11569. --- Firing Productions (IE) For State At Depth 1 ---
  11570. --- Inner Elaboration Phase, active level 1 (S1) ---
  11571. Firing monitor*world
  11572. -->
  11573. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11574. --- Change Working Memory (IE) ---
  11575. --- END Application Phase ---
  11576. --- Output Phase ---
  11577. ENV: Agent did: predict-yes for direction L in state State-B
  11578. In State-B moving L
  11579. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11580. predict error 0
  11581. dir: dir isL
  11582. --- END Output Phase ---
  11583. |\--- Input Phase ---
  11584. =>WM: (13788: I2 ^dir L)
  11585. =>WM: (13787: I2 ^reward 1)
  11586. =>WM: (13786: I2 ^see 1)
  11587. =>WM: (13785: N982 ^status complete)
  11588. <=WM: (13773: I2 ^dir L)
  11589. <=WM: (13772: I2 ^reward 1)
  11590. <=WM: (13771: I2 ^see 0)
  11591. =>WM: (13789: I2 ^level-1 L1-root)
  11592. <=WM: (13774: I2 ^level-1 R0-root)
  11593. --- END Input Phase ---
  11594. --- Proposal Phase ---
  11595. --- Inner Elaboration Phase, active level 1 (S1) ---
  11596. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  11597. -->
  11598. (S1 ^operator O1963 = -0.2062723012911647)
  11599. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  11600. -->
  11601. (S1 ^operator O1964 = 0.6855414715988584)
  11602. Firing prefer*rvt*predict-no*H0*4*H1
  11603. -->
  11604. Firing prefer*rvt*predict-yes*H0*3*H1
  11605. -->
  11606. Firing elaborate*copy-see-to-output-link
  11607. -->
  11608. (I3 ^see 1 +)
  11609. Firing elaborate*reward*based*on*reward
  11610. -->
  11611. (R986 ^value 1 +)
  11612. (R1 ^reward R986 +)
  11613. Firing propose*predict-yes
  11614. -->
  11615. (O1965 ^name predict-yes +)
  11616. (S1 ^operator O1965 +)
  11617. Firing propose*predict-no
  11618. -->
  11619. (O1966 ^name predict-no +)
  11620. (S1 ^operator O1966 +)
  11621. Firing rl*prefer*rvt*predict-no*H0*4
  11622. -->
  11623. (S1 ^operator O1964 = 0.3145060369395525)
  11624. Firing rl*prefer*rvt*predict-yes*H0*3
  11625. -->
  11626. (S1 ^operator O1963 = 0.39076303591152)
  11627. Firing prefer*rvt*predict-yes*H0
  11628. -->
  11629. Firing prefer*rvt*predict-no*H0
  11630. -->
  11631. Firing elaborate*copy-dir-to-output-link
  11632. -->
  11633. (I3 ^dir L +)
  11634. inner elaboration loop at bottom goal.
  11635. Retracting elaborate*copy-see-to-output-link
  11636. -->
  11637. (I3 ^see 0 +)
  11638. Retracting propose*predict-no
  11639. -->
  11640. (O1964 ^name predict-no +)
  11641. (S1 ^operator O1964 +)
  11642. Retracting propose*predict-yes
  11643. -->
  11644. (O1963 ^name predict-yes +)
  11645. (S1 ^operator O1963 +)
  11646. Retracting elaborate*reward*based*on*reward
  11647. -->
  11648. (R985 ^value 1 +)
  11649. (R1 ^reward R985 +)
  11650. Retracting elaborate*copy-dir-to-output-link
  11651. -->
  11652. (I3 ^dir L +)
  11653. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  11654. -->
  11655. (S1 ^operator O1964 = -0.1984300550322165)
  11656. Retracting rl*prefer*rvt*predict-no*H0*4
  11657. -->
  11658. (S1 ^operator O1964 = 0.3145060369395525)
  11659. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  11660. -->
  11661. (S1 ^operator O1963 = 0.609089086334031)
  11662. Retracting rl*prefer*rvt*predict-yes*H0*3
  11663. -->
  11664. (S1 ^operator O1963 = 0.39076303591152)
  11665. =>WM: (13796: S1 ^operator O1966 +)
  11666. =>WM: (13795: S1 ^operator O1965 +)
  11667. =>WM: (13794: O1966 ^name predict-no)
  11668. =>WM: (13793: O1965 ^name predict-yes)
  11669. =>WM: (13792: R986 ^value 1)
  11670. =>WM: (13791: R1 ^reward R986)
  11671. =>WM: (13790: I3 ^see 1)
  11672. <=WM: (13781: S1 ^operator O1963 +)
  11673. <=WM: (13783: S1 ^operator O1963)
  11674. <=WM: (13782: S1 ^operator O1964 +)
  11675. <=WM: (13776: R1 ^reward R985)
  11676. <=WM: (13775: I3 ^see 0)
  11677. <=WM: (13779: O1964 ^name predict-no)
  11678. <=WM: (13778: O1963 ^name predict-yes)
  11679. <=WM: (13777: R985 ^value 1)
  11680. --- Inner Elaboration Phase, active level 1 (S1) ---
  11681. Firing prefer*rvt*predict-yes*H0
  11682. -->
  11683. Firing rl*prefer*rvt*predict-yes*H0*3
  11684. -->
  11685. (S1 ^operator O1965 = 0.39076303591152)
  11686. Firing prefer*rvt*predict-yes*H0*3*H1
  11687. -->
  11688. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  11689. -->
  11690. (S1 ^operator O1965 = -0.2062723012911647)
  11691. Firing prefer*rvt*predict-no*H0
  11692. -->
  11693. Firing rl*prefer*rvt*predict-no*H0*4
  11694. -->
  11695. (S1 ^operator O1966 = 0.3145060369395525)
  11696. Firing prefer*rvt*predict-no*H0*4*H1
  11697. -->
  11698. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  11699. -->
  11700. (S1 ^operator O1966 = 0.6855414715988584)
  11701. inner elaboration loop at bottom goal.
  11702. Retracting rl*prefer*rvt*predict-no*H0*4
  11703. -->
  11704. (S1 ^operator O1964 = 0.3145060369395525)
  11705. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  11706. -->
  11707. (S1 ^operator O1964 = 0.6855414715988584)
  11708. Retracting rl*prefer*rvt*predict-yes*H0*3
  11709. -->
  11710. (S1 ^operator O1963 = 0.39076303591152)
  11711. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  11712. -->
  11713. (S1 ^operator O1963 = -0.2062723012911647)
  11714. --- END Proposal Phase ---
  11715. --- Decision Phase ---
  11716. RL update rl*prefer*rvt*predict-yes*H0*3 0.472311 -0.0815481 0.390763 -> 0.472322 -0.0815463 0.390775(R,m,v=1,0.943038,0.0540595)
  11717. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527563 0.0815262 0.609089 -> 0.527575 0.0815283 0.609103(R,m,v=1,1,0)
  11718. =>WM: (13797: S1 ^operator O1966)
  11719. 983: O: O1966 (predict-no)
  11720. --- END Decision Phase ---
  11721. --- Application Phase ---
  11722. --- Firing Productions (PE) For State At Depth 1 ---
  11723. --- Inner Elaboration Phase, active level 1 (S1) ---
  11724. Firing apply*operator
  11725. -->
  11726. (I3 ^predict-no N983 + :O )
  11727. Firing apply*operator*complete
  11728. -->
  11729. (I3 ^predict-yes N982 - :O )
  11730. inner elaboration loop at bottom goal.
  11731. --- Change Working Memory (PE) ---
  11732. =>WM: (13798: I3 ^predict-no N983)
  11733. <=WM: (13785: N982 ^status complete)
  11734. <=WM: (13784: I3 ^predict-yes N982)
  11735. --- Firing Productions (IE) For State At Depth 1 ---
  11736. --- Inner Elaboration Phase, active level 1 (S1) ---
  11737. Firing monitor*world
  11738. -->
  11739. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11740. --- Change Working Memory (IE) ---
  11741. --- END Application Phase ---
  11742. --- Output Phase ---
  11743. ENV: Agent did: predict-no for direction L in state State-A
  11744. In State-A moving L
  11745. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11746. predict error 0
  11747. dir: dir isR
  11748. --- END Output Phase ---
  11749. -/--- Input Phase ---
  11750. =>WM: (13802: I2 ^dir R)
  11751. =>WM: (13801: I2 ^reward 1)
  11752. =>WM: (13800: I2 ^see 0)
  11753. =>WM: (13799: N983 ^status complete)
  11754. <=WM: (13788: I2 ^dir L)
  11755. <=WM: (13787: I2 ^reward 1)
  11756. <=WM: (13786: I2 ^see 1)
  11757. =>WM: (13803: I2 ^level-1 L0-root)
  11758. <=WM: (13789: I2 ^level-1 L1-root)
  11759. --- END Input Phase ---
  11760. --- Proposal Phase ---
  11761. --- Inner Elaboration Phase, active level 1 (S1) ---
  11762. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  11763. -->
  11764. (S1 ^operator O1965 = 0.8783936611550894)
  11765. Firing prefer*rvt*predict-yes*H0*5*H1
  11766. -->
  11767. Firing elaborate*copy-see-to-output-link
  11768. -->
  11769. (I3 ^see 0 +)
  11770. Firing elaborate*reward*based*on*reward
  11771. -->
  11772. (R987 ^value 1 +)
  11773. (R1 ^reward R987 +)
  11774. Firing propose*predict-yes
  11775. -->
  11776. (O1967 ^name predict-yes +)
  11777. (S1 ^operator O1967 +)
  11778. Firing propose*predict-no
  11779. -->
  11780. (O1968 ^name predict-no +)
  11781. (S1 ^operator O1968 +)
  11782. Firing rl*prefer*rvt*predict-no*H0*6
  11783. -->
  11784. (S1 ^operator O1966 = 0.9999841575438704)
  11785. Firing rl*prefer*rvt*predict-yes*H0*5
  11786. -->
  11787. (S1 ^operator O1965 = 0.1215976616761118)
  11788. Firing prefer*rvt*predict-yes*H0
  11789. -->
  11790. Firing prefer*rvt*predict-no*H0
  11791. -->
  11792. Firing elaborate*copy-dir-to-output-link
  11793. -->
  11794. (I3 ^dir R +)
  11795. inner elaboration loop at bottom goal.
  11796. Retracting elaborate*copy-see-to-output-link
  11797. -->
  11798. (I3 ^see 1 +)
  11799. Retracting propose*predict-no
  11800. -->
  11801. (O1966 ^name predict-no +)
  11802. (S1 ^operator O1966 +)
  11803. Retracting propose*predict-yes
  11804. -->
  11805. (O1965 ^name predict-yes +)
  11806. (S1 ^operator O1965 +)
  11807. Retracting elaborate*reward*based*on*reward
  11808. -->
  11809. (R986 ^value 1 +)
  11810. (R1 ^reward R986 +)
  11811. Retracting elaborate*copy-dir-to-output-link
  11812. -->
  11813. (I3 ^dir L +)
  11814. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  11815. -->
  11816. (S1 ^operator O1966 = 0.6855414715988584)
  11817. Retracting rl*prefer*rvt*predict-no*H0*4
  11818. -->
  11819. (S1 ^operator O1966 = 0.3145060369395525)
  11820. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  11821. -->
  11822. (S1 ^operator O1965 = -0.2062723012911647)
  11823. Retracting rl*prefer*rvt*predict-yes*H0*3
  11824. -->
  11825. (S1 ^operator O1965 = 0.390775231823802)
  11826. =>WM: (13811: S1 ^operator O1968 +)
  11827. =>WM: (13810: S1 ^operator O1967 +)
  11828. =>WM: (13809: I3 ^dir R)
  11829. =>WM: (13808: O1968 ^name predict-no)
  11830. =>WM: (13807: O1967 ^name predict-yes)
  11831. =>WM: (13806: R987 ^value 1)
  11832. =>WM: (13805: R1 ^reward R987)
  11833. =>WM: (13804: I3 ^see 0)
  11834. <=WM: (13795: S1 ^operator O1965 +)
  11835. <=WM: (13796: S1 ^operator O1966 +)
  11836. <=WM: (13797: S1 ^operator O1966)
  11837. <=WM: (13780: I3 ^dir L)
  11838. <=WM: (13791: R1 ^reward R986)
  11839. <=WM: (13790: I3 ^see 1)
  11840. <=WM: (13794: O1966 ^name predict-no)
  11841. <=WM: (13793: O1965 ^name predict-yes)
  11842. <=WM: (13792: R986 ^value 1)
  11843. --- Inner Elaboration Phase, active level 1 (S1) ---
  11844. Firing prefer*rvt*predict-yes*H0
  11845. -->
  11846. Firing rl*prefer*rvt*predict-yes*H0*5
  11847. -->
  11848. (S1 ^operator O1967 = 0.1215976616761118)
  11849. Firing prefer*rvt*predict-yes*H0*5*H1
  11850. -->
  11851. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  11852. -->
  11853. (S1 ^operator O1967 = 0.8783936611550894)
  11854. Firing prefer*rvt*predict-no*H0
  11855. -->
  11856. Firing rl*prefer*rvt*predict-no*H0*6
  11857. -->
  11858. (S1 ^operator O1968 = 0.9999841575438704)
  11859. inner elaboration loop at bottom goal.
  11860. Retracting rl*prefer*rvt*predict-no*H0*6
  11861. -->
  11862. (S1 ^operator O1966 = 0.9999841575438704)
  11863. Retracting rl*prefer*rvt*predict-yes*H0*5
  11864. -->
  11865. (S1 ^operator O1965 = 0.1215976616761118)
  11866. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  11867. -->
  11868. (S1 ^operator O1965 = 0.8783936611550894)
  11869. --- END Proposal Phase ---
  11870. --- Decision Phase ---
  11871. RL update rl*prefer*rvt*predict-no*H0*4 0.478554 -0.164048 0.314506 -> 0.478551 -0.164048 0.314502(R,m,v=1,0.921569,0.0727554)
  11872. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521489 0.164052 0.685541 -> 0.521485 0.164052 0.685537(R,m,v=1,1,0)
  11873. =>WM: (13812: S1 ^operator O1967)
  11874. 984: O: O1967 (predict-yes)
  11875. --- END Decision Phase ---
  11876. --- Application Phase ---
  11877. --- Firing Productions (PE) For State At Depth 1 ---
  11878. --- Inner Elaboration Phase, active level 1 (S1) ---
  11879. Firing apply*operator
  11880. -->
  11881. (I3 ^predict-yes N984 + :O )
  11882. Firing apply*operator*complete
  11883. -->
  11884. (I3 ^predict-no N983 - :O )
  11885. inner elaboration loop at bottom goal.
  11886. --- Change Working Memory (PE) ---
  11887. =>WM: (13813: I3 ^predict-yes N984)
  11888. <=WM: (13799: N983 ^status complete)
  11889. <=WM: (13798: I3 ^predict-no N983)
  11890. --- Firing Productions (IE) For State At Depth 1 ---
  11891. --- Inner Elaboration Phase, active level 1 (S1) ---
  11892. Firing monitor*world
  11893. -->
  11894. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11895. --- Change Working Memory (IE) ---
  11896. --- END Application Phase ---
  11897. --- Output Phase ---
  11898. ENV: Agent did: predict-yes for direction R in state State-A
  11899. In State-A moving R
  11900. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11901. predict error 0
  11902. dir: dir isU
  11903. --- END Output Phase ---
  11904. |\---- Input Phase ---
  11905. =>WM: (13817: I2 ^dir U)
  11906. =>WM: (13816: I2 ^reward 1)
  11907. =>WM: (13815: I2 ^see 1)
  11908. =>WM: (13814: N984 ^status complete)
  11909. <=WM: (13802: I2 ^dir R)
  11910. <=WM: (13801: I2 ^reward 1)
  11911. <=WM: (13800: I2 ^see 0)
  11912. =>WM: (13818: I2 ^level-1 R1-root)
  11913. <=WM: (13803: I2 ^level-1 L0-root)
  11914. --- END Input Phase ---
  11915. --- Proposal Phase ---
  11916. --- Inner Elaboration Phase, active level 1 (S1) ---
  11917. Firing elaborate*copy-see-to-output-link
  11918. -->
  11919. (I3 ^see 1 +)
  11920. Firing elaborate*reward*based*on*reward
  11921. -->
  11922. (R988 ^value 1 +)
  11923. (R1 ^reward R988 +)
  11924. Firing propose*predict-yes
  11925. -->
  11926. (O1969 ^name predict-yes +)
  11927. (S1 ^operator O1969 +)
  11928. Firing propose*predict-no
  11929. -->
  11930. (O1970 ^name predict-no +)
  11931. (S1 ^operator O1970 +)
  11932. Firing rl*prefer*rvt*predict-no*H0*2
  11933. -->
  11934. (S1 ^operator O1968 = 1.)
  11935. Firing rl*prefer*rvt*predict-yes*H0*1
  11936. -->
  11937. (S1 ^operator O1967 = 0.)
  11938. Firing prefer*rvt*predict-yes*H0
  11939. -->
  11940. Firing prefer*rvt*predict-no*H0
  11941. -->
  11942. Firing elaborate*copy-dir-to-output-link
  11943. -->
  11944. (I3 ^dir U +)
  11945. inner elaboration loop at bottom goal.
  11946. Retracting elaborate*copy-see-to-output-link
  11947. -->
  11948. (I3 ^see 0 +)
  11949. Retracting propose*predict-no
  11950. -->
  11951. (O1968 ^name predict-no +)
  11952. (S1 ^operator O1968 +)
  11953. Retracting propose*predict-yes
  11954. -->
  11955. (O1967 ^name predict-yes +)
  11956. (S1 ^operator O1967 +)
  11957. Retracting elaborate*reward*based*on*reward
  11958. -->
  11959. (R987 ^value 1 +)
  11960. (R1 ^reward R987 +)
  11961. Retracting elaborate*copy-dir-to-output-link
  11962. -->
  11963. (I3 ^dir R +)
  11964. Retracting rl*prefer*rvt*predict-no*H0*6
  11965. -->
  11966. (S1 ^operator O1968 = 0.9999841575438704)
  11967. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  11968. -->
  11969. (S1 ^operator O1967 = 0.8783936611550894)
  11970. Retracting rl*prefer*rvt*predict-yes*H0*5
  11971. -->
  11972. (S1 ^operator O1967 = 0.1215976616761118)
  11973. =>WM: (13826: S1 ^operator O1970 +)
  11974. =>WM: (13825: S1 ^operator O1969 +)
  11975. =>WM: (13824: I3 ^dir U)
  11976. =>WM: (13823: O1970 ^name predict-no)
  11977. =>WM: (13822: O1969 ^name predict-yes)
  11978. =>WM: (13821: R988 ^value 1)
  11979. =>WM: (13820: R1 ^reward R988)
  11980. =>WM: (13819: I3 ^see 1)
  11981. <=WM: (13810: S1 ^operator O1967 +)
  11982. <=WM: (13812: S1 ^operator O1967)
  11983. <=WM: (13811: S1 ^operator O1968 +)
  11984. <=WM: (13809: I3 ^dir R)
  11985. <=WM: (13805: R1 ^reward R987)
  11986. <=WM: (13804: I3 ^see 0)
  11987. <=WM: (13808: O1968 ^name predict-no)
  11988. <=WM: (13807: O1967 ^name predict-yes)
  11989. <=WM: (13806: R987 ^value 1)
  11990. --- Inner Elaboration Phase, active level 1 (S1) ---
  11991. Firing prefer*rvt*predict-yes*H0
  11992. -->
  11993. Firing rl*prefer*rvt*predict-yes*H0*1
  11994. -->
  11995. (S1 ^operator O1969 = 0.)
  11996. Firing prefer*rvt*predict-no*H0
  11997. -->
  11998. Firing rl*prefer*rvt*predict-no*H0*2
  11999. -->
  12000. (S1 ^operator O1970 = 1.)
  12001. inner elaboration loop at bottom goal.
  12002. Retracting rl*prefer*rvt*predict-no*H0*2
  12003. -->
  12004. (S1 ^operator O1968 = 1.)
  12005. Retracting rl*prefer*rvt*predict-yes*H0*1
  12006. -->
  12007. (S1 ^operator O1967 = 0.)
  12008. --- END Proposal Phase ---
  12009. --- Decision Phase ---
  12010. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.862857,0.119015)
  12011. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465469 0.412925 0.878394 -> 0.46547 0.412925 0.878394(R,m,v=1,1,0)
  12012. =>WM: (13827: S1 ^operator O1970)
  12013. 985: O: O1970 (predict-no)
  12014. --- END Decision Phase ---
  12015. --- Application Phase ---
  12016. --- Firing Productions (PE) For State At Depth 1 ---
  12017. --- Inner Elaboration Phase, active level 1 (S1) ---
  12018. Firing apply*operator
  12019. -->
  12020. (I3 ^predict-no N985 + :O )
  12021. Firing apply*operator*complete
  12022. -->
  12023. (I3 ^predict-yes N984 - :O )
  12024. inner elaboration loop at bottom goal.
  12025. --- Change Working Memory (PE) ---
  12026. =>WM: (13828: I3 ^predict-no N985)
  12027. <=WM: (13814: N984 ^status complete)
  12028. <=WM: (13813: I3 ^predict-yes N984)
  12029. --- Firing Productions (IE) For State At Depth 1 ---
  12030. --- Inner Elaboration Phase, active level 1 (S1) ---
  12031. Firing monitor*world
  12032. -->
  12033. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12034. --- Change Working Memory (IE) ---
  12035. --- END Application Phase ---
  12036. --- Output Phase ---
  12037. ENV: Agent did: predict-no for direction U in state State-B
  12038. In State-B moving U
  12039. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12040. predict error 0
  12041. dir: dir isL
  12042. --- END Output Phase ---
  12043. /|--- Input Phase ---
  12044. =>WM: (13832: I2 ^dir L)
  12045. =>WM: (13831: I2 ^reward 1)
  12046. =>WM: (13830: I2 ^see 0)
  12047. =>WM: (13829: N985 ^status complete)
  12048. <=WM: (13817: I2 ^dir U)
  12049. <=WM: (13816: I2 ^reward 1)
  12050. <=WM: (13815: I2 ^see 1)
  12051. =>WM: (13833: I2 ^level-1 R1-root)
  12052. <=WM: (13818: I2 ^level-1 R1-root)
  12053. --- END Input Phase ---
  12054. --- Proposal Phase ---
  12055. --- Inner Elaboration Phase, active level 1 (S1) ---
  12056. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  12057. -->
  12058. (S1 ^operator O1970 = -0.168718511744511)
  12059. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  12060. -->
  12061. (S1 ^operator O1969 = 0.6093180204125221)
  12062. Firing prefer*rvt*predict-no*H0*4*H1
  12063. -->
  12064. Firing prefer*rvt*predict-yes*H0*3*H1
  12065. -->
  12066. Firing elaborate*copy-see-to-output-link
  12067. -->
  12068. (I3 ^see 0 +)
  12069. Firing elaborate*reward*based*on*reward
  12070. -->
  12071. (R989 ^value 1 +)
  12072. (R1 ^reward R989 +)
  12073. Firing propose*predict-yes
  12074. -->
  12075. (O1971 ^name predict-yes +)
  12076. (S1 ^operator O1971 +)
  12077. Firing propose*predict-no
  12078. -->
  12079. (O1972 ^name predict-no +)
  12080. (S1 ^operator O1972 +)
  12081. Firing rl*prefer*rvt*predict-no*H0*4
  12082. -->
  12083. (S1 ^operator O1970 = 0.3145020978774952)
  12084. Firing rl*prefer*rvt*predict-yes*H0*3
  12085. -->
  12086. (S1 ^operator O1969 = 0.390775231823802)
  12087. Firing prefer*rvt*predict-yes*H0
  12088. -->
  12089. Firing prefer*rvt*predict-no*H0
  12090. -->
  12091. Firing elaborate*copy-dir-to-output-link
  12092. -->
  12093. (I3 ^dir L +)
  12094. inner elaboration loop at bottom goal.
  12095. Retracting elaborate*copy-see-to-output-link
  12096. -->
  12097. (I3 ^see 1 +)
  12098. Retracting propose*predict-no
  12099. -->
  12100. (O1970 ^name predict-no +)
  12101. (S1 ^operator O1970 +)
  12102. Retracting propose*predict-yes
  12103. -->
  12104. (O1969 ^name predict-yes +)
  12105. (S1 ^operator O1969 +)
  12106. Retracting elaborate*reward*based*on*reward
  12107. -->
  12108. (R988 ^value 1 +)
  12109. (R1 ^reward R988 +)
  12110. Retracting elaborate*copy-dir-to-output-link
  12111. -->
  12112. (I3 ^dir U +)
  12113. Retracting rl*prefer*rvt*predict-no*H0*2
  12114. -->
  12115. (S1 ^operator O1970 = 1.)
  12116. Retracting rl*prefer*rvt*predict-yes*H0*1
  12117. -->
  12118. (S1 ^operator O1969 = 0.)
  12119. =>WM: (13841: S1 ^operator O1972 +)
  12120. =>WM: (13840: S1 ^operator O1971 +)
  12121. =>WM: (13839: I3 ^dir L)
  12122. =>WM: (13838: O1972 ^name predict-no)
  12123. =>WM: (13837: O1971 ^name predict-yes)
  12124. =>WM: (13836: R989 ^value 1)
  12125. =>WM: (13835: R1 ^reward R989)
  12126. =>WM: (13834: I3 ^see 0)
  12127. <=WM: (13825: S1 ^operator O1969 +)
  12128. <=WM: (13826: S1 ^operator O1970 +)
  12129. <=WM: (13827: S1 ^operator O1970)
  12130. <=WM: (13824: I3 ^dir U)
  12131. <=WM: (13820: R1 ^reward R988)
  12132. <=WM: (13819: I3 ^see 1)
  12133. <=WM: (13823: O1970 ^name predict-no)
  12134. <=WM: (13822: O1969 ^name predict-yes)
  12135. <=WM: (13821: R988 ^value 1)
  12136. --- Inner Elaboration Phase, active level 1 (S1) ---
  12137. Firing prefer*rvt*predict-yes*H0
  12138. -->
  12139. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  12140. -->
  12141. (S1 ^operator O1971 = 0.6093180204125221)
  12142. Firing rl*prefer*rvt*predict-yes*H0*3
  12143. -->
  12144. (S1 ^operator O1971 = 0.390775231823802)
  12145. Firing prefer*rvt*predict-yes*H0*3*H1
  12146. -->
  12147. Firing prefer*rvt*predict-no*H0
  12148. -->
  12149. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  12150. -->
  12151. (S1 ^operator O1972 = -0.168718511744511)
  12152. Firing rl*prefer*rvt*predict-no*H0*4
  12153. -->
  12154. (S1 ^operator O1972 = 0.3145020978774952)
  12155. Firing prefer*rvt*predict-no*H0*4*H1
  12156. -->
  12157. inner elaboration loop at bottom goal.
  12158. Retracting rl*prefer*rvt*predict-no*H0*4
  12159. -->
  12160. (S1 ^operator O1970 = 0.3145020978774952)
  12161. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  12162. -->
  12163. (S1 ^operator O1970 = -0.168718511744511)
  12164. Retracting rl*prefer*rvt*predict-yes*H0*3
  12165. -->
  12166. (S1 ^operator O1969 = 0.390775231823802)
  12167. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  12168. -->
  12169. (S1 ^operator O1969 = 0.6093180204125221)
  12170. --- END Proposal Phase ---
  12171. --- Decision Phase ---
  12172. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12173. =>WM: (13842: S1 ^operator O1971)
  12174. 986: O: O1971 (predict-yes)
  12175. --- END Decision Phase ---
  12176. --- Application Phase ---
  12177. --- Firing Productions (PE) For State At Depth 1 ---
  12178. --- Inner Elaboration Phase, active level 1 (S1) ---
  12179. Firing apply*operator
  12180. -->
  12181. (I3 ^predict-yes N986 + :O )
  12182. Firing apply*operator*complete
  12183. -->
  12184. (I3 ^predict-no N985 - :O )
  12185. inner elaboration loop at bottom goal.
  12186. --- Change Working Memory (PE) ---
  12187. =>WM: (13843: I3 ^predict-yes N986)
  12188. <=WM: (13829: N985 ^status complete)
  12189. <=WM: (13828: I3 ^predict-no N985)
  12190. --- Firing Productions (IE) For State At Depth 1 ---
  12191. --- Inner Elaboration Phase, active level 1 (S1) ---
  12192. Firing monitor*world
  12193. -->
  12194. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12195. --- Change Working Memory (IE) ---
  12196. --- END Application Phase ---
  12197. --- Output Phase ---
  12198. ENV: Agent did: predict-yes for direction L in state State-B
  12199. In State-B moving L
  12200. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12201. predict error 0
  12202. dir: dir isL
  12203. --- END Output Phase ---
  12204. \-/--- Input Phase ---
  12205. =>WM: (13847: I2 ^dir L)
  12206. =>WM: (13846: I2 ^reward 1)
  12207. =>WM: (13845: I2 ^see 1)
  12208. =>WM: (13844: N986 ^status complete)
  12209. <=WM: (13832: I2 ^dir L)
  12210. <=WM: (13831: I2 ^reward 1)
  12211. <=WM: (13830: I2 ^see 0)
  12212. =>WM: (13848: I2 ^level-1 L1-root)
  12213. <=WM: (13833: I2 ^level-1 R1-root)
  12214. --- END Input Phase ---
  12215. --- Proposal Phase ---
  12216. --- Inner Elaboration Phase, active level 1 (S1) ---
  12217. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  12218. -->
  12219. (S1 ^operator O1971 = -0.2062723012911647)
  12220. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  12221. -->
  12222. (S1 ^operator O1972 = 0.6855369815787629)
  12223. Firing prefer*rvt*predict-no*H0*4*H1
  12224. -->
  12225. Firing prefer*rvt*predict-yes*H0*3*H1
  12226. -->
  12227. Firing elaborate*copy-see-to-output-link
  12228. -->
  12229. (I3 ^see 1 +)
  12230. Firing elaborate*reward*based*on*reward
  12231. -->
  12232. (R990 ^value 1 +)
  12233. (R1 ^reward R990 +)
  12234. Firing propose*predict-yes
  12235. -->
  12236. (O1973 ^name predict-yes +)
  12237. (S1 ^operator O1973 +)
  12238. Firing propose*predict-no
  12239. -->
  12240. (O1974 ^name predict-no +)
  12241. (S1 ^operator O1974 +)
  12242. Firing rl*prefer*rvt*predict-no*H0*4
  12243. -->
  12244. (S1 ^operator O1972 = 0.3145020978774952)
  12245. Firing rl*prefer*rvt*predict-yes*H0*3
  12246. -->
  12247. (S1 ^operator O1971 = 0.390775231823802)
  12248. Firing prefer*rvt*predict-yes*H0
  12249. -->
  12250. Firing prefer*rvt*predict-no*H0
  12251. -->
  12252. Firing elaborate*copy-dir-to-output-link
  12253. -->
  12254. (I3 ^dir L +)
  12255. inner elaboration loop at bottom goal.
  12256. Retracting elaborate*copy-see-to-output-link
  12257. -->
  12258. (I3 ^see 0 +)
  12259. Retracting propose*predict-no
  12260. -->
  12261. (O1972 ^name predict-no +)
  12262. (S1 ^operator O1972 +)
  12263. Retracting propose*predict-yes
  12264. -->
  12265. (O1971 ^name predict-yes +)
  12266. (S1 ^operator O1971 +)
  12267. Retracting elaborate*reward*based*on*reward
  12268. -->
  12269. (R989 ^value 1 +)
  12270. (R1 ^reward R989 +)
  12271. Retracting elaborate*copy-dir-to-output-link
  12272. -->
  12273. (I3 ^dir L +)
  12274. Retracting rl*prefer*rvt*predict-no*H0*4
  12275. -->
  12276. (S1 ^operator O1972 = 0.3145020978774952)
  12277. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  12278. -->
  12279. (S1 ^operator O1972 = -0.168718511744511)
  12280. Retracting rl*prefer*rvt*predict-yes*H0*3
  12281. -->
  12282. (S1 ^operator O1971 = 0.390775231823802)
  12283. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  12284. -->
  12285. (S1 ^operator O1971 = 0.6093180204125221)
  12286. =>WM: (13855: S1 ^operator O1974 +)
  12287. =>WM: (13854: S1 ^operator O1973 +)
  12288. =>WM: (13853: O1974 ^name predict-no)
  12289. =>WM: (13852: O1973 ^name predict-yes)
  12290. =>WM: (13851: R990 ^value 1)
  12291. =>WM: (13850: R1 ^reward R990)
  12292. =>WM: (13849: I3 ^see 1)
  12293. <=WM: (13840: S1 ^operator O1971 +)
  12294. <=WM: (13842: S1 ^operator O1971)
  12295. <=WM: (13841: S1 ^operator O1972 +)
  12296. <=WM: (13835: R1 ^reward R989)
  12297. <=WM: (13834: I3 ^see 0)
  12298. <=WM: (13838: O1972 ^name predict-no)
  12299. <=WM: (13837: O1971 ^name predict-yes)
  12300. <=WM: (13836: R989 ^value 1)
  12301. --- Inner Elaboration Phase, active level 1 (S1) ---
  12302. Firing prefer*rvt*predict-yes*H0
  12303. -->
  12304. Firing rl*prefer*rvt*predict-yes*H0*3
  12305. -->
  12306. (S1 ^operator O1973 = 0.390775231823802)
  12307. Firing prefer*rvt*predict-yes*H0*3*H1
  12308. -->
  12309. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  12310. -->
  12311. (S1 ^operator O1973 = -0.2062723012911647)
  12312. Firing prefer*rvt*predict-no*H0
  12313. -->
  12314. Firing rl*prefer*rvt*predict-no*H0*4
  12315. -->
  12316. (S1 ^operator O1974 = 0.3145020978774952)
  12317. Firing prefer*rvt*predict-no*H0*4*H1
  12318. -->
  12319. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  12320. -->
  12321. (S1 ^operator O1974 = 0.6855369815787629)
  12322. inner elaboration loop at bottom goal.
  12323. Retracting rl*prefer*rvt*predict-no*H0*4
  12324. -->
  12325. (S1 ^operator O1972 = 0.3145020978774952)
  12326. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  12327. -->
  12328. (S1 ^operator O1972 = 0.6855369815787629)
  12329. Retracting rl*prefer*rvt*predict-yes*H0*3
  12330. -->
  12331. (S1 ^operator O1971 = 0.390775231823802)
  12332. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  12333. -->
  12334. (S1 ^operator O1971 = -0.2062723012911647)
  12335. --- END Proposal Phase ---
  12336. --- Decision Phase ---
  12337. RL update rl*prefer*rvt*predict-yes*H0*3 0.472322 -0.0815463 0.390775 -> 0.472315 -0.0815474 0.390768(R,m,v=1,0.943396,0.0537378)
  12338. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527758 0.0815601 0.609318 -> 0.52775 0.0815588 0.609309(R,m,v=1,1,0)
  12339. =>WM: (13856: S1 ^operator O1974)
  12340. 987: O: O1974 (predict-no)
  12341. --- END Decision Phase ---
  12342. --- Application Phase ---
  12343. --- Firing Productions (PE) For State At Depth 1 ---
  12344. --- Inner Elaboration Phase, active level 1 (S1) ---
  12345. Firing apply*operator
  12346. -->
  12347. (I3 ^predict-no N987 + :O )
  12348. Firing apply*operator*complete
  12349. -->
  12350. (I3 ^predict-yes N986 - :O )
  12351. inner elaboration loop at bottom goal.
  12352. --- Change Working Memory (PE) ---
  12353. =>WM: (13857: I3 ^predict-no N987)
  12354. <=WM: (13844: N986 ^status complete)
  12355. <=WM: (13843: I3 ^predict-yes N986)
  12356. --- Firing Productions (IE) For State At Depth 1 ---
  12357. --- Inner Elaboration Phase, active level 1 (S1) ---
  12358. Firing monitor*world
  12359. -->
  12360. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12361. --- Change Working Memory (IE) ---
  12362. --- END Application Phase ---
  12363. --- Output Phase ---
  12364. ENV: Agent did: predict-no for direction L in state State-A
  12365. In State-A moving L
  12366. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12367. predict error 0
  12368. dir: dir isR
  12369. --- END Output Phase ---
  12370. |\---- Input Phase ---
  12371. =>WM: (13861: I2 ^dir R)
  12372. =>WM: (13860: I2 ^reward 1)
  12373. =>WM: (13859: I2 ^see 0)
  12374. =>WM: (13858: N987 ^status complete)
  12375. <=WM: (13847: I2 ^dir L)
  12376. <=WM: (13846: I2 ^reward 1)
  12377. <=WM: (13845: I2 ^see 1)
  12378. =>WM: (13862: I2 ^level-1 L0-root)
  12379. <=WM: (13848: I2 ^level-1 L1-root)
  12380. --- END Input Phase ---
  12381. --- Proposal Phase ---
  12382. --- Inner Elaboration Phase, active level 1 (S1) ---
  12383. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  12384. -->
  12385. (S1 ^operator O1973 = 0.8783944900614931)
  12386. Firing prefer*rvt*predict-yes*H0*5*H1
  12387. -->
  12388. Firing elaborate*copy-see-to-output-link
  12389. -->
  12390. (I3 ^see 0 +)
  12391. Firing elaborate*reward*based*on*reward
  12392. -->
  12393. (R991 ^value 1 +)
  12394. (R1 ^reward R991 +)
  12395. Firing propose*predict-yes
  12396. -->
  12397. (O1975 ^name predict-yes +)
  12398. (S1 ^operator O1975 +)
  12399. Firing propose*predict-no
  12400. -->
  12401. (O1976 ^name predict-no +)
  12402. (S1 ^operator O1976 +)
  12403. Firing rl*prefer*rvt*predict-no*H0*6
  12404. -->
  12405. (S1 ^operator O1974 = 0.9999841575438704)
  12406. Firing rl*prefer*rvt*predict-yes*H0*5
  12407. -->
  12408. (S1 ^operator O1973 = 0.1215983654449722)
  12409. Firing prefer*rvt*predict-yes*H0
  12410. -->
  12411. Firing prefer*rvt*predict-no*H0
  12412. -->
  12413. Firing elaborate*copy-dir-to-output-link
  12414. -->
  12415. (I3 ^dir R +)
  12416. inner elaboration loop at bottom goal.
  12417. Retracting elaborate*copy-see-to-output-link
  12418. -->
  12419. (I3 ^see 1 +)
  12420. Retracting propose*predict-no
  12421. -->
  12422. (O1974 ^name predict-no +)
  12423. (S1 ^operator O1974 +)
  12424. Retracting propose*predict-yes
  12425. -->
  12426. (O1973 ^name predict-yes +)
  12427. (S1 ^operator O1973 +)
  12428. Retracting elaborate*reward*based*on*reward
  12429. -->
  12430. (R990 ^value 1 +)
  12431. (R1 ^reward R990 +)
  12432. Retracting elaborate*copy-dir-to-output-link
  12433. -->
  12434. (I3 ^dir L +)
  12435. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  12436. -->
  12437. (S1 ^operator O1974 = 0.6855369815787629)
  12438. Retracting rl*prefer*rvt*predict-no*H0*4
  12439. -->
  12440. (S1 ^operator O1974 = 0.3145020978774952)
  12441. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  12442. -->
  12443. (S1 ^operator O1973 = -0.2062723012911647)
  12444. Retracting rl*prefer*rvt*predict-yes*H0*3
  12445. -->
  12446. (S1 ^operator O1973 = 0.3907675490335307)
  12447. =>WM: (13870: S1 ^operator O1976 +)
  12448. =>WM: (13869: S1 ^operator O1975 +)
  12449. =>WM: (13868: I3 ^dir R)
  12450. =>WM: (13867: O1976 ^name predict-no)
  12451. =>WM: (13866: O1975 ^name predict-yes)
  12452. =>WM: (13865: R991 ^value 1)
  12453. =>WM: (13864: R1 ^reward R991)
  12454. =>WM: (13863: I3 ^see 0)
  12455. <=WM: (13854: S1 ^operator O1973 +)
  12456. <=WM: (13855: S1 ^operator O1974 +)
  12457. <=WM: (13856: S1 ^operator O1974)
  12458. <=WM: (13839: I3 ^dir L)
  12459. <=WM: (13850: R1 ^reward R990)
  12460. <=WM: (13849: I3 ^see 1)
  12461. <=WM: (13853: O1974 ^name predict-no)
  12462. <=WM: (13852: O1973 ^name predict-yes)
  12463. <=WM: (13851: R990 ^value 1)
  12464. --- Inner Elaboration Phase, active level 1 (S1) ---
  12465. Firing prefer*rvt*predict-yes*H0
  12466. -->
  12467. Firing rl*prefer*rvt*predict-yes*H0*5
  12468. -->
  12469. (S1 ^operator O1975 = 0.1215983654449722)
  12470. Firing prefer*rvt*predict-yes*H0*5*H1
  12471. -->
  12472. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  12473. -->
  12474. (S1 ^operator O1975 = 0.8783944900614931)
  12475. Firing prefer*rvt*predict-no*H0
  12476. -->
  12477. Firing rl*prefer*rvt*predict-no*H0*6
  12478. -->
  12479. (S1 ^operator O1976 = 0.9999841575438704)
  12480. inner elaboration loop at bottom goal.
  12481. Retracting rl*prefer*rvt*predict-no*H0*6
  12482. -->
  12483. (S1 ^operator O1974 = 0.9999841575438704)
  12484. Retracting rl*prefer*rvt*predict-yes*H0*5
  12485. -->
  12486. (S1 ^operator O1973 = 0.1215983654449722)
  12487. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  12488. -->
  12489. (S1 ^operator O1973 = 0.8783944900614931)
  12490. --- END Proposal Phase ---
  12491. --- Decision Phase ---
  12492. RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314502 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.922078,0.0723198)
  12493. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521485 0.164052 0.685537 -> 0.521482 0.164052 0.685533(R,m,v=1,1,0)
  12494. =>WM: (13871: S1 ^operator O1975)
  12495. 988: O: O1975 (predict-yes)
  12496. --- END Decision Phase ---
  12497. --- Application Phase ---
  12498. --- Firing Productions (PE) For State At Depth 1 ---
  12499. --- Inner Elaboration Phase, active level 1 (S1) ---
  12500. Firing apply*operator
  12501. -->
  12502. (I3 ^predict-yes N988 + :O )
  12503. Firing apply*operator*complete
  12504. -->
  12505. (I3 ^predict-no N987 - :O )
  12506. inner elaboration loop at bottom goal.
  12507. --- Change Working Memory (PE) ---
  12508. =>WM: (13872: I3 ^predict-yes N988)
  12509. <=WM: (13858: N987 ^status complete)
  12510. <=WM: (13857: I3 ^predict-no N987)
  12511. --- Firing Productions (IE) For State At Depth 1 ---
  12512. --- Inner Elaboration Phase, active level 1 (S1) ---
  12513. Firing monitor*world
  12514. -->
  12515. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12516. --- Change Working Memory (IE) ---
  12517. --- END Application Phase ---
  12518. --- Output Phase ---
  12519. ENV: Agent did: predict-yes for direction R in state State-A
  12520. In State-A moving R
  12521. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12522. predict error 0
  12523. dir: dir isR
  12524. --- END Output Phase ---
  12525. /|\--- Input Phase ---
  12526. =>WM: (13876: I2 ^dir R)
  12527. =>WM: (13875: I2 ^reward 1)
  12528. =>WM: (13874: I2 ^see 1)
  12529. =>WM: (13873: N988 ^status complete)
  12530. <=WM: (13861: I2 ^dir R)
  12531. <=WM: (13860: I2 ^reward 1)
  12532. <=WM: (13859: I2 ^see 0)
  12533. =>WM: (13877: I2 ^level-1 R1-root)
  12534. <=WM: (13862: I2 ^level-1 L0-root)
  12535. --- END Input Phase ---
  12536. --- Proposal Phase ---
  12537. --- Inner Elaboration Phase, active level 1 (S1) ---
  12538. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  12539. -->
  12540. (S1 ^operator O1975 = -0.04253361215288998)
  12541. Firing prefer*rvt*predict-yes*H0*5*H1
  12542. -->
  12543. Firing elaborate*copy-see-to-output-link
  12544. -->
  12545. (I3 ^see 1 +)
  12546. Firing elaborate*reward*based*on*reward
  12547. -->
  12548. (R992 ^value 1 +)
  12549. (R1 ^reward R992 +)
  12550. Firing propose*predict-yes
  12551. -->
  12552. (O1977 ^name predict-yes +)
  12553. (S1 ^operator O1977 +)
  12554. Firing propose*predict-no
  12555. -->
  12556. (O1978 ^name predict-no +)
  12557. (S1 ^operator O1978 +)
  12558. Firing rl*prefer*rvt*predict-no*H0*6
  12559. -->
  12560. (S1 ^operator O1976 = 0.9999841575438704)
  12561. Firing rl*prefer*rvt*predict-yes*H0*5
  12562. -->
  12563. (S1 ^operator O1975 = 0.1215983654449722)
  12564. Firing prefer*rvt*predict-yes*H0
  12565. -->
  12566. Firing prefer*rvt*predict-no*H0
  12567. -->
  12568. Firing elaborate*copy-dir-to-output-link
  12569. -->
  12570. (I3 ^dir R +)
  12571. inner elaboration loop at bottom goal.
  12572. Retracting elaborate*copy-see-to-output-link
  12573. -->
  12574. (I3 ^see 0 +)
  12575. Retracting propose*predict-no
  12576. -->
  12577. (O1976 ^name predict-no +)
  12578. (S1 ^operator O1976 +)
  12579. Retracting propose*predict-yes
  12580. -->
  12581. (O1975 ^name predict-yes +)
  12582. (S1 ^operator O1975 +)
  12583. Retracting elaborate*reward*based*on*reward
  12584. -->
  12585. (R991 ^value 1 +)
  12586. (R1 ^reward R991 +)
  12587. Retracting elaborate*copy-dir-to-output-link
  12588. -->
  12589. (I3 ^dir R +)
  12590. Retracting rl*prefer*rvt*predict-no*H0*6
  12591. -->
  12592. (S1 ^operator O1976 = 0.9999841575438704)
  12593. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  12594. -->
  12595. (S1 ^operator O1975 = 0.8783944900614931)
  12596. Retracting rl*prefer*rvt*predict-yes*H0*5
  12597. -->
  12598. (S1 ^operator O1975 = 0.1215983654449722)
  12599. =>WM: (13884: S1 ^operator O1978 +)
  12600. =>WM: (13883: S1 ^operator O1977 +)
  12601. =>WM: (13882: O1978 ^name predict-no)
  12602. =>WM: (13881: O1977 ^name predict-yes)
  12603. =>WM: (13880: R992 ^value 1)
  12604. =>WM: (13879: R1 ^reward R992)
  12605. =>WM: (13878: I3 ^see 1)
  12606. <=WM: (13869: S1 ^operator O1975 +)
  12607. <=WM: (13871: S1 ^operator O1975)
  12608. <=WM: (13870: S1 ^operator O1976 +)
  12609. <=WM: (13864: R1 ^reward R991)
  12610. <=WM: (13863: I3 ^see 0)
  12611. <=WM: (13867: O1976 ^name predict-no)
  12612. <=WM: (13866: O1975 ^name predict-yes)
  12613. <=WM: (13865: R991 ^value 1)
  12614. --- Inner Elaboration Phase, active level 1 (S1) ---
  12615. Firing prefer*rvt*predict-yes*H0
  12616. -->
  12617. Firing rl*prefer*rvt*predict-yes*H0*5
  12618. -->
  12619. (S1 ^operator O1977 = 0.1215983654449722)
  12620. Firing prefer*rvt*predict-yes*H0*5*H1
  12621. -->
  12622. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  12623. -->
  12624. (S1 ^operator O1977 = -0.04253361215288998)
  12625. Firing prefer*rvt*predict-no*H0
  12626. -->
  12627. Firing rl*prefer*rvt*predict-no*H0*6
  12628. -->
  12629. (S1 ^operator O1978 = 0.9999841575438704)
  12630. inner elaboration loop at bottom goal.
  12631. Retracting rl*prefer*rvt*predict-no*H0*6
  12632. -->
  12633. (S1 ^operator O1976 = 0.9999841575438704)
  12634. Retracting rl*prefer*rvt*predict-yes*H0*5
  12635. -->
  12636. (S1 ^operator O1975 = 0.1215983654449722)
  12637. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  12638. -->
  12639. (S1 ^operator O1975 = -0.04253361215288998)
  12640. --- END Proposal Phase ---
  12641. --- Decision Phase ---
  12642. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.863636,0.118442)
  12643. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.46547 0.412925 0.878394 -> 0.46547 0.412925 0.878395(R,m,v=1,1,0)
  12644. =>WM: (13885: S1 ^operator O1978)
  12645. 989: O: O1978 (predict-no)
  12646. --- END Decision Phase ---
  12647. --- Application Phase ---
  12648. --- Firing Productions (PE) For State At Depth 1 ---
  12649. --- Inner Elaboration Phase, active level 1 (S1) ---
  12650. Firing apply*operator
  12651. -->
  12652. (I3 ^predict-no N989 + :O )
  12653. Firing apply*operator*complete
  12654. -->
  12655. (I3 ^predict-yes N988 - :O )
  12656. inner elaboration loop at bottom goal.
  12657. --- Change Working Memory (PE) ---
  12658. =>WM: (13886: I3 ^predict-no N989)
  12659. <=WM: (13873: N988 ^status complete)
  12660. <=WM: (13872: I3 ^predict-yes N988)
  12661. --- Firing Productions (IE) For State At Depth 1 ---
  12662. --- Inner Elaboration Phase, active level 1 (S1) ---
  12663. Firing monitor*world
  12664. -->
  12665. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12666. --- Change Working Memory (IE) ---
  12667. --- END Application Phase ---
  12668. --- Output Phase ---
  12669. ENV: Agent did: predict-no for direction R in state State-B
  12670. In State-B moving R
  12671. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12672. predict error 0
  12673. dir: dir isU
  12674. --- END Output Phase ---
  12675. -/--- Input Phase ---
  12676. =>WM: (13890: I2 ^dir U)
  12677. =>WM: (13889: I2 ^reward 1)
  12678. =>WM: (13888: I2 ^see 0)
  12679. =>WM: (13887: N989 ^status complete)
  12680. <=WM: (13876: I2 ^dir R)
  12681. <=WM: (13875: I2 ^reward 1)
  12682. <=WM: (13874: I2 ^see 1)
  12683. =>WM: (13891: I2 ^level-1 R0-root)
  12684. <=WM: (13877: I2 ^level-1 R1-root)
  12685. --- END Input Phase ---
  12686. --- Proposal Phase ---
  12687. --- Inner Elaboration Phase, active level 1 (S1) ---
  12688. Firing elaborate*copy-see-to-output-link
  12689. -->
  12690. (I3 ^see 0 +)
  12691. Firing elaborate*reward*based*on*reward
  12692. -->
  12693. (R993 ^value 1 +)
  12694. (R1 ^reward R993 +)
  12695. Firing propose*predict-yes
  12696. -->
  12697. (O1979 ^name predict-yes +)
  12698. (S1 ^operator O1979 +)
  12699. Firing propose*predict-no
  12700. -->
  12701. (O1980 ^name predict-no +)
  12702. (S1 ^operator O1980 +)
  12703. Firing rl*prefer*rvt*predict-no*H0*2
  12704. -->
  12705. (S1 ^operator O1978 = 1.)
  12706. Firing rl*prefer*rvt*predict-yes*H0*1
  12707. -->
  12708. (S1 ^operator O1977 = 0.)
  12709. Firing prefer*rvt*predict-yes*H0
  12710. -->
  12711. Firing prefer*rvt*predict-no*H0
  12712. -->
  12713. Firing elaborate*copy-dir-to-output-link
  12714. -->
  12715. (I3 ^dir U +)
  12716. inner elaboration loop at bottom goal.
  12717. Retracting elaborate*copy-see-to-output-link
  12718. -->
  12719. (I3 ^see 1 +)
  12720. Retracting propose*predict-no
  12721. -->
  12722. (O1978 ^name predict-no +)
  12723. (S1 ^operator O1978 +)
  12724. Retracting propose*predict-yes
  12725. -->
  12726. (O1977 ^name predict-yes +)
  12727. (S1 ^operator O1977 +)
  12728. Retracting elaborate*reward*based*on*reward
  12729. -->
  12730. (R992 ^value 1 +)
  12731. (R1 ^reward R992 +)
  12732. Retracting elaborate*copy-dir-to-output-link
  12733. -->
  12734. (I3 ^dir R +)
  12735. Retracting rl*prefer*rvt*predict-no*H0*6
  12736. -->
  12737. (S1 ^operator O1978 = 0.9999841575438704)
  12738. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  12739. -->
  12740. (S1 ^operator O1977 = -0.04253361215288998)
  12741. Retracting rl*prefer*rvt*predict-yes*H0*5
  12742. -->
  12743. (S1 ^operator O1977 = 0.1215989443698621)
  12744. =>WM: (13899: S1 ^operator O1980 +)
  12745. =>WM: (13898: S1 ^operator O1979 +)
  12746. =>WM: (13897: I3 ^dir U)
  12747. =>WM: (13896: O1980 ^name predict-no)
  12748. =>WM: (13895: O1979 ^name predict-yes)
  12749. =>WM: (13894: R993 ^value 1)
  12750. =>WM: (13893: R1 ^reward R993)
  12751. =>WM: (13892: I3 ^see 0)
  12752. <=WM: (13883: S1 ^operator O1977 +)
  12753. <=WM: (13884: S1 ^operator O1978 +)
  12754. <=WM: (13885: S1 ^operator O1978)
  12755. <=WM: (13868: I3 ^dir R)
  12756. <=WM: (13879: R1 ^reward R992)
  12757. <=WM: (13878: I3 ^see 1)
  12758. <=WM: (13882: O1978 ^name predict-no)
  12759. <=WM: (13881: O1977 ^name predict-yes)
  12760. <=WM: (13880: R992 ^value 1)
  12761. --- Inner Elaboration Phase, active level 1 (S1) ---
  12762. Firing prefer*rvt*predict-yes*H0
  12763. -->
  12764. Firing rl*prefer*rvt*predict-yes*H0*1
  12765. -->
  12766. (S1 ^operator O1979 = 0.)
  12767. Firing prefer*rvt*predict-no*H0
  12768. -->
  12769. Firing rl*prefer*rvt*predict-no*H0*2
  12770. -->
  12771. (S1 ^operator O1980 = 1.)
  12772. inner elaboration loop at bottom goal.
  12773. Retracting rl*prefer*rvt*predict-no*H0*2
  12774. -->
  12775. (S1 ^operator O1978 = 1.)
  12776. Retracting rl*prefer*rvt*predict-yes*H0*1
  12777. -->
  12778. (S1 ^operator O1977 = 0.)
  12779. --- END Proposal Phase ---
  12780. --- Decision Phase ---
  12781. RL update rl*prefer*rvt*predict-no*H0*6 0.999984 0 0.999984 -> 0.999987 0 0.999987(R,m,v=1,0.9375,0.0589286)
  12782. =>WM: (13900: S1 ^operator O1980)
  12783. 990: O: O1980 (predict-no)
  12784. --- END Decision Phase ---
  12785. --- Application Phase ---
  12786. --- Firing Productions (PE) For State At Depth 1 ---
  12787. --- Inner Elaboration Phase, active level 1 (S1) ---
  12788. Firing apply*operator
  12789. -->
  12790. (I3 ^predict-no N990 + :O )
  12791. Firing apply*operator*complete
  12792. -->
  12793. (I3 ^predict-no N989 - :O )
  12794. inner elaboration loop at bottom goal.
  12795. --- Change Working Memory (PE) ---
  12796. =>WM: (13901: I3 ^predict-no N990)
  12797. <=WM: (13887: N989 ^status complete)
  12798. <=WM: (13886: I3 ^predict-no N989)
  12799. --- Firing Productions (IE) For State At Depth 1 ---
  12800. --- Inner Elaboration Phase, active level 1 (S1) ---
  12801. Firing monitor*world
  12802. -->
  12803. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12804. --- Change Working Memory (IE) ---
  12805. --- END Application Phase ---
  12806. --- Output Phase ---
  12807. ENV: Agent did: predict-no for direction U in state State-B
  12808. In State-B moving U
  12809. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12810. predict error 0
  12811. dir: dir isR
  12812. --- END Output Phase ---
  12813. |\---- Input Phase ---
  12814. =>WM: (13905: I2 ^dir R)
  12815. =>WM: (13904: I2 ^reward 1)
  12816. =>WM: (13903: I2 ^see 0)
  12817. =>WM: (13902: N990 ^status complete)
  12818. <=WM: (13890: I2 ^dir U)
  12819. <=WM: (13889: I2 ^reward 1)
  12820. <=WM: (13888: I2 ^see 0)
  12821. =>WM: (13906: I2 ^level-1 R0-root)
  12822. <=WM: (13891: I2 ^level-1 R0-root)
  12823. --- END Input Phase ---
  12824. --- Proposal Phase ---
  12825. --- Inner Elaboration Phase, active level 1 (S1) ---
  12826. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  12827. -->
  12828. (S1 ^operator O1979 = -0.1512366769350551)
  12829. Firing prefer*rvt*predict-yes*H0*5*H1
  12830. -->
  12831. Firing elaborate*copy-see-to-output-link
  12832. -->
  12833. (I3 ^see 0 +)
  12834. Firing elaborate*reward*based*on*reward
  12835. -->
  12836. (R994 ^value 1 +)
  12837. (R1 ^reward R994 +)
  12838. Firing propose*predict-yes
  12839. -->
  12840. (O1981 ^name predict-yes +)
  12841. (S1 ^operator O1981 +)
  12842. Firing propose*predict-no
  12843. -->
  12844. (O1982 ^name predict-no +)
  12845. (S1 ^operator O1982 +)
  12846. Firing rl*prefer*rvt*predict-no*H0*6
  12847. -->
  12848. (S1 ^operator O1980 = 0.9999867250014868)
  12849. Firing rl*prefer*rvt*predict-yes*H0*5
  12850. -->
  12851. (S1 ^operator O1979 = 0.1215989443698621)
  12852. Firing prefer*rvt*predict-yes*H0
  12853. -->
  12854. Firing prefer*rvt*predict-no*H0
  12855. -->
  12856. Firing elaborate*copy-dir-to-output-link
  12857. -->
  12858. (I3 ^dir R +)
  12859. inner elaboration loop at bottom goal.
  12860. Retracting elaborate*copy-see-to-output-link
  12861. -->
  12862. (I3 ^see 0 +)
  12863. Retracting propose*predict-no
  12864. -->
  12865. (O1980 ^name predict-no +)
  12866. (S1 ^operator O1980 +)
  12867. Retracting propose*predict-yes
  12868. -->
  12869. (O1979 ^name predict-yes +)
  12870. (S1 ^operator O1979 +)
  12871. Retracting elaborate*reward*based*on*reward
  12872. -->
  12873. (R993 ^value 1 +)
  12874. (R1 ^reward R993 +)
  12875. Retracting elaborate*copy-dir-to-output-link
  12876. -->
  12877. (I3 ^dir U +)
  12878. Retracting rl*prefer*rvt*predict-no*H0*2
  12879. -->
  12880. (S1 ^operator O1980 = 1.)
  12881. Retracting rl*prefer*rvt*predict-yes*H0*1
  12882. -->
  12883. (S1 ^operator O1979 = 0.)
  12884. =>WM: (13913: S1 ^operator O1982 +)
  12885. =>WM: (13912: S1 ^operator O1981 +)
  12886. =>WM: (13911: I3 ^dir R)
  12887. =>WM: (13910: O1982 ^name predict-no)
  12888. =>WM: (13909: O1981 ^name predict-yes)
  12889. =>WM: (13908: R994 ^value 1)
  12890. =>WM: (13907: R1 ^reward R994)
  12891. <=WM: (13898: S1 ^operator O1979 +)
  12892. <=WM: (13899: S1 ^operator O1980 +)
  12893. <=WM: (13900: S1 ^operator O1980)
  12894. <=WM: (13897: I3 ^dir U)
  12895. <=WM: (13893: R1 ^reward R993)
  12896. <=WM: (13896: O1980 ^name predict-no)
  12897. <=WM: (13895: O1979 ^name predict-yes)
  12898. <=WM: (13894: R993 ^value 1)
  12899. --- Inner Elaboration Phase, active level 1 (S1) ---
  12900. Firing prefer*rvt*predict-yes*H0
  12901. -->
  12902. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  12903. -->
  12904. (S1 ^operator O1981 = -0.1512366769350551)
  12905. Firing rl*prefer*rvt*predict-yes*H0*5
  12906. -->
  12907. (S1 ^operator O1981 = 0.1215989443698621)
  12908. Firing prefer*rvt*predict-yes*H0*5*H1
  12909. -->
  12910. Firing prefer*rvt*predict-no*H0
  12911. -->
  12912. Firing rl*prefer*rvt*predict-no*H0*6
  12913. -->
  12914. (S1 ^operator O1982 = 0.9999867250014868)
  12915. inner elaboration loop at bottom goal.
  12916. Retracting rl*prefer*rvt*predict-no*H0*6
  12917. -->
  12918. (S1 ^operator O1980 = 0.9999867250014868)
  12919. Retracting rl*prefer*rvt*predict-yes*H0*5
  12920. -->
  12921. (S1 ^operator O1979 = 0.1215989443698621)
  12922. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  12923. -->
  12924. (S1 ^operator O1979 = -0.1512366769350551)
  12925. --- END Proposal Phase ---
  12926. --- Decision Phase ---
  12927. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12928. =>WM: (13914: S1 ^operator O1982)
  12929. 991: O: O1982 (predict-no)
  12930. --- END Decision Phase ---
  12931. --- Application Phase ---
  12932. --- Firing Productions (PE) For State At Depth 1 ---
  12933. --- Inner Elaboration Phase, active level 1 (S1) ---
  12934. Firing apply*operator
  12935. -->
  12936. (I3 ^predict-no N991 + :O )
  12937. Firing apply*operator*complete
  12938. -->
  12939. (I3 ^predict-no N990 - :O )
  12940. inner elaboration loop at bottom goal.
  12941. --- Change Working Memory (PE) ---
  12942. =>WM: (13915: I3 ^predict-no N991)
  12943. <=WM: (13902: N990 ^status complete)
  12944. <=WM: (13901: I3 ^predict-no N990)
  12945. --- Firing Productions (IE) For State At Depth 1 ---
  12946. --- Inner Elaboration Phase, active level 1 (S1) ---
  12947. Firing monitor*world
  12948. -->
  12949. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12950. --- Change Working Memory (IE) ---
  12951. --- END Application Phase ---
  12952. --- Output Phase ---
  12953. ENV: Agent did: predict-no for direction R in state State-B
  12954. In State-B moving R
  12955. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12956. predict error 0
  12957. dir: dir isU
  12958. --- END Output Phase ---
  12959. /--- Input Phase ---
  12960. =>WM: (13919: I2 ^dir U)
  12961. =>WM: (13918: I2 ^reward 1)
  12962. =>WM: (13917: I2 ^see 0)
  12963. =>WM: (13916: N991 ^status complete)
  12964. <=WM: (13905: I2 ^dir R)
  12965. <=WM: (13904: I2 ^reward 1)
  12966. <=WM: (13903: I2 ^see 0)
  12967. =>WM: (13920: I2 ^level-1 R0-root)
  12968. <=WM: (13906: I2 ^level-1 R0-root)
  12969. --- END Input Phase ---
  12970. --- Proposal Phase ---
  12971. --- Inner Elaboration Phase, active level 1 (S1) ---
  12972. Firing elaborate*copy-see-to-output-link
  12973. -->
  12974. (I3 ^see 0 +)
  12975. Firing elaborate*reward*based*on*reward
  12976. -->
  12977. (R995 ^value 1 +)
  12978. (R1 ^reward R995 +)
  12979. Firing propose*predict-yes
  12980. -->
  12981. (O1983 ^name predict-yes +)
  12982. (S1 ^operator O1983 +)
  12983. Firing propose*predict-no
  12984. -->
  12985. (O1984 ^name predict-no +)
  12986. (S1 ^operator O1984 +)
  12987. Firing rl*prefer*rvt*predict-no*H0*2
  12988. -->
  12989. (S1 ^operator O1982 = 1.)
  12990. Firing rl*prefer*rvt*predict-yes*H0*1
  12991. -->
  12992. (S1 ^operator O1981 = 0.)
  12993. Firing prefer*rvt*predict-yes*H0
  12994. -->
  12995. Firing prefer*rvt*predict-no*H0
  12996. -->
  12997. Firing elaborate*copy-dir-to-output-link
  12998. -->
  12999. (I3 ^dir U +)
  13000. inner elaboration loop at bottom goal.
  13001. Retracting elaborate*copy-see-to-output-link
  13002. -->
  13003. (I3 ^see 0 +)
  13004. Retracting propose*predict-no
  13005. -->
  13006. (O1982 ^name predict-no +)
  13007. (S1 ^operator O1982 +)
  13008. Retracting propose*predict-yes
  13009. -->
  13010. (O1981 ^name predict-yes +)
  13011. (S1 ^operator O1981 +)
  13012. Retracting elaborate*reward*based*on*reward
  13013. -->
  13014. (R994 ^value 1 +)
  13015. (R1 ^reward R994 +)
  13016. Retracting elaborate*copy-dir-to-output-link
  13017. -->
  13018. (I3 ^dir R +)
  13019. Retracting rl*prefer*rvt*predict-no*H0*6
  13020. -->
  13021. (S1 ^operator O1982 = 0.9999867250014868)
  13022. Retracting rl*prefer*rvt*predict-yes*H0*5
  13023. -->
  13024. (S1 ^operator O1981 = 0.1215989443698621)
  13025. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  13026. -->
  13027. (S1 ^operator O1981 = -0.1512366769350551)
  13028. =>WM: (13927: S1 ^operator O1984 +)
  13029. =>WM: (13926: S1 ^operator O1983 +)
  13030. =>WM: (13925: I3 ^dir U)
  13031. =>WM: (13924: O1984 ^name predict-no)
  13032. =>WM: (13923: O1983 ^name predict-yes)
  13033. =>WM: (13922: R995 ^value 1)
  13034. =>WM: (13921: R1 ^reward R995)
  13035. <=WM: (13912: S1 ^operator O1981 +)
  13036. <=WM: (13913: S1 ^operator O1982 +)
  13037. <=WM: (13914: S1 ^operator O1982)
  13038. <=WM: (13911: I3 ^dir R)
  13039. <=WM: (13907: R1 ^reward R994)
  13040. <=WM: (13910: O1982 ^name predict-no)
  13041. <=WM: (13909: O1981 ^name predict-yes)
  13042. <=WM: (13908: R994 ^value 1)
  13043. --- Inner Elaboration Phase, active level 1 (S1) ---
  13044. Firing prefer*rvt*predict-yes*H0
  13045. -->
  13046. Firing rl*prefer*rvt*predict-yes*H0*1
  13047. -->
  13048. (S1 ^operator O1983 = 0.)
  13049. Firing prefer*rvt*predict-no*H0
  13050. -->
  13051. Firing rl*prefer*rvt*predict-no*H0*2
  13052. -->
  13053. (S1 ^operator O1984 = 1.)
  13054. inner elaboration loop at bottom goal.
  13055. Retracting rl*prefer*rvt*predict-no*H0*2
  13056. -->
  13057. (S1 ^operator O1982 = 1.)
  13058. Retracting rl*prefer*rvt*predict-yes*H0*1
  13059. -->
  13060. (S1 ^operator O1981 = 0.)
  13061. --- END Proposal Phase ---
  13062. --- Decision Phase ---
  13063. RL update rl*prefer*rvt*predict-no*H0*6 0.999987 0 0.999987 -> 0.999989 0 0.999989(R,m,v=1,0.937853,0.0586158)
  13064. =>WM: (13928: S1 ^operator O1984)
  13065. 992: O: O1984 (predict-no)
  13066. --- END Decision Phase ---
  13067. --- Application Phase ---
  13068. --- Firing Productions (PE) For State At Depth 1 ---
  13069. --- Inner Elaboration Phase, active level 1 (S1) ---
  13070. Firing apply*operator
  13071. -->
  13072. (I3 ^predict-no N992 + :O )
  13073. Firing apply*operator*complete
  13074. -->
  13075. (I3 ^predict-no N991 - :O )
  13076. inner elaboration loop at bottom goal.
  13077. --- Change Working Memory (PE) ---
  13078. =>WM: (13929: I3 ^predict-no N992)
  13079. <=WM: (13916: N991 ^status complete)
  13080. <=WM: (13915: I3 ^predict-no N991)
  13081. --- Firing Productions (IE) For State At Depth 1 ---
  13082. --- Inner Elaboration Phase, active level 1 (S1) ---
  13083. Firing monitor*world
  13084. -->
  13085. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13086. --- Change Working Memory (IE) ---
  13087. --- END Application Phase ---
  13088. --- Output Phase ---
  13089. ENV: Agent did: predict-no for direction U in state State-B
  13090. In State-B moving U
  13091. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13092. predict error 0
  13093. dir: dir isL
  13094. --- END Output Phase ---
  13095. |\--- Input Phase ---
  13096. =>WM: (13933: I2 ^dir L)
  13097. =>WM: (13932: I2 ^reward 1)
  13098. =>WM: (13931: I2 ^see 0)
  13099. =>WM: (13930: N992 ^status complete)
  13100. <=WM: (13919: I2 ^dir U)
  13101. <=WM: (13918: I2 ^reward 1)
  13102. <=WM: (13917: I2 ^see 0)
  13103. =>WM: (13934: I2 ^level-1 R0-root)
  13104. <=WM: (13920: I2 ^level-1 R0-root)
  13105. --- END Input Phase ---
  13106. --- Proposal Phase ---
  13107. --- Inner Elaboration Phase, active level 1 (S1) ---
  13108. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  13109. -->
  13110. (S1 ^operator O1984 = -0.1984300550322165)
  13111. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  13112. -->
  13113. (S1 ^operator O1983 = 0.6091029227055655)
  13114. Firing prefer*rvt*predict-no*H0*4*H1
  13115. -->
  13116. Firing prefer*rvt*predict-yes*H0*3*H1
  13117. -->
  13118. Firing elaborate*copy-see-to-output-link
  13119. -->
  13120. (I3 ^see 0 +)
  13121. Firing elaborate*reward*based*on*reward
  13122. -->
  13123. (R996 ^value 1 +)
  13124. (R1 ^reward R996 +)
  13125. Firing propose*predict-yes
  13126. -->
  13127. (O1985 ^name predict-yes +)
  13128. (S1 ^operator O1985 +)
  13129. Firing propose*predict-no
  13130. -->
  13131. (O1986 ^name predict-no +)
  13132. (S1 ^operator O1986 +)
  13133. Firing rl*prefer*rvt*predict-no*H0*4
  13134. -->
  13135. (S1 ^operator O1984 = 0.3144988611901438)
  13136. Firing rl*prefer*rvt*predict-yes*H0*3
  13137. -->
  13138. (S1 ^operator O1983 = 0.3907675490335307)
  13139. Firing prefer*rvt*predict-yes*H0
  13140. -->
  13141. Firing prefer*rvt*predict-no*H0
  13142. -->
  13143. Firing elaborate*copy-dir-to-output-link
  13144. -->
  13145. (I3 ^dir L +)
  13146. inner elaboration loop at bottom goal.
  13147. Retracting elaborate*copy-see-to-output-link
  13148. -->
  13149. (I3 ^see 0 +)
  13150. Retracting propose*predict-no
  13151. -->
  13152. (O1984 ^name predict-no +)
  13153. (S1 ^operator O1984 +)
  13154. Retracting propose*predict-yes
  13155. -->
  13156. (O1983 ^name predict-yes +)
  13157. (S1 ^operator O1983 +)
  13158. Retracting elaborate*reward*based*on*reward
  13159. -->
  13160. (R995 ^value 1 +)
  13161. (R1 ^reward R995 +)
  13162. Retracting elaborate*copy-dir-to-output-link
  13163. -->
  13164. (I3 ^dir U +)
  13165. Retracting rl*prefer*rvt*predict-no*H0*2
  13166. -->
  13167. (S1 ^operator O1984 = 1.)
  13168. Retracting rl*prefer*rvt*predict-yes*H0*1
  13169. -->
  13170. (S1 ^operator O1983 = 0.)
  13171. =>WM: (13941: S1 ^operator O1986 +)
  13172. =>WM: (13940: S1 ^operator O1985 +)
  13173. =>WM: (13939: I3 ^dir L)
  13174. =>WM: (13938: O1986 ^name predict-no)
  13175. =>WM: (13937: O1985 ^name predict-yes)
  13176. =>WM: (13936: R996 ^value 1)
  13177. =>WM: (13935: R1 ^reward R996)
  13178. <=WM: (13926: S1 ^operator O1983 +)
  13179. <=WM: (13927: S1 ^operator O1984 +)
  13180. <=WM: (13928: S1 ^operator O1984)
  13181. <=WM: (13925: I3 ^dir U)
  13182. <=WM: (13921: R1 ^reward R995)
  13183. <=WM: (13924: O1984 ^name predict-no)
  13184. <=WM: (13923: O1983 ^name predict-yes)
  13185. <=WM: (13922: R995 ^value 1)
  13186. --- Inner Elaboration Phase, active level 1 (S1) ---
  13187. Firing prefer*rvt*predict-yes*H0
  13188. -->
  13189. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  13190. -->
  13191. (S1 ^operator O1985 = 0.6091029227055655)
  13192. Firing rl*prefer*rvt*predict-yes*H0*3
  13193. -->
  13194. (S1 ^operator O1985 = 0.3907675490335307)
  13195. Firing prefer*rvt*predict-yes*H0*3*H1
  13196. -->
  13197. Firing prefer*rvt*predict-no*H0
  13198. -->
  13199. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  13200. -->
  13201. (S1 ^operator O1986 = -0.1984300550322165)
  13202. Firing rl*prefer*rvt*predict-no*H0*4
  13203. -->
  13204. (S1 ^operator O1986 = 0.3144988611901438)
  13205. Firing prefer*rvt*predict-no*H0*4*H1
  13206. -->
  13207. inner elaboration loop at bottom goal.
  13208. Retracting rl*prefer*rvt*predict-no*H0*4
  13209. -->
  13210. (S1 ^operator O1984 = 0.3144988611901438)
  13211. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  13212. -->
  13213. (S1 ^operator O1984 = -0.1984300550322165)
  13214. Retracting rl*prefer*rvt*predict-yes*H0*3
  13215. -->
  13216. (S1 ^operator O1983 = 0.3907675490335307)
  13217. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  13218. -->
  13219. (S1 ^operator O1983 = 0.6091029227055655)
  13220. --- END Proposal Phase ---
  13221. --- Decision Phase ---
  13222. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13223. =>WM: (13942: S1 ^operator O1985)
  13224. 993: O: O1985 (predict-yes)
  13225. --- END Decision Phase ---
  13226. --- Application Phase ---
  13227. --- Firing Productions (PE) For State At Depth 1 ---
  13228. --- Inner Elaboration Phase, active level 1 (S1) ---
  13229. Firing apply*operator
  13230. -->
  13231. (I3 ^predict-yes N993 + :O )
  13232. Firing apply*operator*complete
  13233. -->
  13234. (I3 ^predict-no N992 - :O )
  13235. inner elaboration loop at bottom goal.
  13236. --- Change Working Memory (PE) ---
  13237. =>WM: (13943: I3 ^predict-yes N993)
  13238. <=WM: (13930: N992 ^status complete)
  13239. <=WM: (13929: I3 ^predict-no N992)
  13240. --- Firing Productions (IE) For State At Depth 1 ---
  13241. --- Inner Elaboration Phase, active level 1 (S1) ---
  13242. Firing monitor*world
  13243. -->
  13244. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13245. --- Change Working Memory (IE) ---
  13246. --- END Application Phase ---
  13247. --- Output Phase ---
  13248. ENV: Agent did: predict-yes for direction L in state State-B
  13249. In State-B moving L
  13250. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13251. predict error 0
  13252. dir: dir isU
  13253. --- END Output Phase ---
  13254. -/|--- Input Phase ---
  13255. =>WM: (13947: I2 ^dir U)
  13256. =>WM: (13946: I2 ^reward 1)
  13257. =>WM: (13945: I2 ^see 1)
  13258. =>WM: (13944: N993 ^status complete)
  13259. <=WM: (13933: I2 ^dir L)
  13260. <=WM: (13932: I2 ^reward 1)
  13261. <=WM: (13931: I2 ^see 0)
  13262. =>WM: (13948: I2 ^level-1 L1-root)
  13263. <=WM: (13934: I2 ^level-1 R0-root)
  13264. --- END Input Phase ---
  13265. --- Proposal Phase ---
  13266. --- Inner Elaboration Phase, active level 1 (S1) ---
  13267. Firing elaborate*copy-see-to-output-link
  13268. -->
  13269. (I3 ^see 1 +)
  13270. Firing elaborate*reward*based*on*reward
  13271. -->
  13272. (R997 ^value 1 +)
  13273. (R1 ^reward R997 +)
  13274. Firing propose*predict-yes
  13275. -->
  13276. (O1987 ^name predict-yes +)
  13277. (S1 ^operator O1987 +)
  13278. Firing propose*predict-no
  13279. -->
  13280. (O1988 ^name predict-no +)
  13281. (S1 ^operator O1988 +)
  13282. Firing rl*prefer*rvt*predict-no*H0*2
  13283. -->
  13284. (S1 ^operator O1986 = 1.)
  13285. Firing rl*prefer*rvt*predict-yes*H0*1
  13286. -->
  13287. (S1 ^operator O1985 = 0.)
  13288. Firing prefer*rvt*predict-yes*H0
  13289. -->
  13290. Firing prefer*rvt*predict-no*H0
  13291. -->
  13292. Firing elaborate*copy-dir-to-output-link
  13293. -->
  13294. (I3 ^dir U +)
  13295. inner elaboration loop at bottom goal.
  13296. Retracting elaborate*copy-see-to-output-link
  13297. -->
  13298. (I3 ^see 0 +)
  13299. Retracting propose*predict-no
  13300. -->
  13301. (O1986 ^name predict-no +)
  13302. (S1 ^operator O1986 +)
  13303. Retracting propose*predict-yes
  13304. -->
  13305. (O1985 ^name predict-yes +)
  13306. (S1 ^operator O1985 +)
  13307. Retracting elaborate*reward*based*on*reward
  13308. -->
  13309. (R996 ^value 1 +)
  13310. (R1 ^reward R996 +)
  13311. Retracting elaborate*copy-dir-to-output-link
  13312. -->
  13313. (I3 ^dir L +)
  13314. Retracting rl*prefer*rvt*predict-no*H0*4
  13315. -->
  13316. (S1 ^operator O1986 = 0.3144988611901438)
  13317. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  13318. -->
  13319. (S1 ^operator O1986 = -0.1984300550322165)
  13320. Retracting rl*prefer*rvt*predict-yes*H0*3
  13321. -->
  13322. (S1 ^operator O1985 = 0.3907675490335307)
  13323. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  13324. -->
  13325. (S1 ^operator O1985 = 0.6091029227055655)
  13326. =>WM: (13956: S1 ^operator O1988 +)
  13327. =>WM: (13955: S1 ^operator O1987 +)
  13328. =>WM: (13954: I3 ^dir U)
  13329. =>WM: (13953: O1988 ^name predict-no)
  13330. =>WM: (13952: O1987 ^name predict-yes)
  13331. =>WM: (13951: R997 ^value 1)
  13332. =>WM: (13950: R1 ^reward R997)
  13333. =>WM: (13949: I3 ^see 1)
  13334. <=WM: (13940: S1 ^operator O1985 +)
  13335. <=WM: (13942: S1 ^operator O1985)
  13336. <=WM: (13941: S1 ^operator O1986 +)
  13337. <=WM: (13939: I3 ^dir L)
  13338. <=WM: (13935: R1 ^reward R996)
  13339. <=WM: (13892: I3 ^see 0)
  13340. <=WM: (13938: O1986 ^name predict-no)
  13341. <=WM: (13937: O1985 ^name predict-yes)
  13342. <=WM: (13936: R996 ^value 1)
  13343. --- Inner Elaboration Phase, active level 1 (S1) ---
  13344. Firing prefer*rvt*predict-yes*H0
  13345. -->
  13346. Firing rl*prefer*rvt*predict-yes*H0*1
  13347. -->
  13348. (S1 ^operator O1987 = 0.)
  13349. Firing prefer*rvt*predict-no*H0
  13350. -->
  13351. Firing rl*prefer*rvt*predict-no*H0*2
  13352. -->
  13353. (S1 ^operator O1988 = 1.)
  13354. inner elaboration loop at bottom goal.
  13355. Retracting rl*prefer*rvt*predict-no*H0*2
  13356. -->
  13357. (S1 ^operator O1986 = 1.)
  13358. Retracting rl*prefer*rvt*predict-yes*H0*1
  13359. -->
  13360. (S1 ^operator O1985 = 0.)
  13361. --- END Proposal Phase ---
  13362. --- Decision Phase ---
  13363. RL update rl*prefer*rvt*predict-yes*H0*3 0.472315 -0.0815474 0.390768 -> 0.472324 -0.0815458 0.390778(R,m,v=1,0.94375,0.0534198)
  13364. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527575 0.0815283 0.609103 -> 0.527585 0.0815301 0.609115(R,m,v=1,1,0)
  13365. =>WM: (13957: S1 ^operator O1988)
  13366. 994: O: O1988 (predict-no)
  13367. --- END Decision Phase ---
  13368. --- Application Phase ---
  13369. --- Firing Productions (PE) For State At Depth 1 ---
  13370. --- Inner Elaboration Phase, active level 1 (S1) ---
  13371. Firing apply*operator
  13372. -->
  13373. (I3 ^predict-no N994 + :O )
  13374. Firing apply*operator*complete
  13375. -->
  13376. (I3 ^predict-yes N993 - :O )
  13377. inner elaboration loop at bottom goal.
  13378. --- Change Working Memory (PE) ---
  13379. =>WM: (13958: I3 ^predict-no N994)
  13380. <=WM: (13944: N993 ^status complete)
  13381. <=WM: (13943: I3 ^predict-yes N993)
  13382. --- Firing Productions (IE) For State At Depth 1 ---
  13383. --- Inner Elaboration Phase, active level 1 (S1) ---
  13384. Firing monitor*world
  13385. -->
  13386. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13387. --- Change Working Memory (IE) ---
  13388. --- END Application Phase ---
  13389. --- Output Phase ---
  13390. ENV: Agent did: predict-no for direction U in state State-A
  13391. In State-A moving U
  13392. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13393. predict error 0
  13394. dir: dir isL
  13395. --- END Output Phase ---
  13396. \-/--- Input Phase ---
  13397. =>WM: (13962: I2 ^dir L)
  13398. =>WM: (13961: I2 ^reward 1)
  13399. =>WM: (13960: I2 ^see 0)
  13400. =>WM: (13959: N994 ^status complete)
  13401. <=WM: (13947: I2 ^dir U)
  13402. <=WM: (13946: I2 ^reward 1)
  13403. <=WM: (13945: I2 ^see 1)
  13404. =>WM: (13963: I2 ^level-1 L1-root)
  13405. <=WM: (13948: I2 ^level-1 L1-root)
  13406. --- END Input Phase ---
  13407. --- Proposal Phase ---
  13408. --- Inner Elaboration Phase, active level 1 (S1) ---
  13409. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  13410. -->
  13411. (S1 ^operator O1987 = -0.2062723012911647)
  13412. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  13413. -->
  13414. (S1 ^operator O1988 = 0.685533297663165)
  13415. Firing prefer*rvt*predict-no*H0*4*H1
  13416. -->
  13417. Firing prefer*rvt*predict-yes*H0*3*H1
  13418. -->
  13419. Firing elaborate*copy-see-to-output-link
  13420. -->
  13421. (I3 ^see 0 +)
  13422. Firing elaborate*reward*based*on*reward
  13423. -->
  13424. (R998 ^value 1 +)
  13425. (R1 ^reward R998 +)
  13426. Firing propose*predict-yes
  13427. -->
  13428. (O1989 ^name predict-yes +)
  13429. (S1 ^operator O1989 +)
  13430. Firing propose*predict-no
  13431. -->
  13432. (O1990 ^name predict-no +)
  13433. (S1 ^operator O1990 +)
  13434. Firing rl*prefer*rvt*predict-no*H0*4
  13435. -->
  13436. (S1 ^operator O1988 = 0.3144988611901438)
  13437. Firing rl*prefer*rvt*predict-yes*H0*3
  13438. -->
  13439. (S1 ^operator O1987 = 0.3907782094907327)
  13440. Firing prefer*rvt*predict-yes*H0
  13441. -->
  13442. Firing prefer*rvt*predict-no*H0
  13443. -->
  13444. Firing elaborate*copy-dir-to-output-link
  13445. -->
  13446. (I3 ^dir L +)
  13447. inner elaboration loop at bottom goal.
  13448. Retracting elaborate*copy-see-to-output-link
  13449. -->
  13450. (I3 ^see 1 +)
  13451. Retracting propose*predict-no
  13452. -->
  13453. (O1988 ^name predict-no +)
  13454. (S1 ^operator O1988 +)
  13455. Retracting propose*predict-yes
  13456. -->
  13457. (O1987 ^name predict-yes +)
  13458. (S1 ^operator O1987 +)
  13459. Retracting elaborate*reward*based*on*reward
  13460. -->
  13461. (R997 ^value 1 +)
  13462. (R1 ^reward R997 +)
  13463. Retracting elaborate*copy-dir-to-output-link
  13464. -->
  13465. (I3 ^dir U +)
  13466. Retracting rl*prefer*rvt*predict-no*H0*2
  13467. -->
  13468. (S1 ^operator O1988 = 1.)
  13469. Retracting rl*prefer*rvt*predict-yes*H0*1
  13470. -->
  13471. (S1 ^operator O1987 = 0.)
  13472. =>WM: (13971: S1 ^operator O1990 +)
  13473. =>WM: (13970: S1 ^operator O1989 +)
  13474. =>WM: (13969: I3 ^dir L)
  13475. =>WM: (13968: O1990 ^name predict-no)
  13476. =>WM: (13967: O1989 ^name predict-yes)
  13477. =>WM: (13966: R998 ^value 1)
  13478. =>WM: (13965: R1 ^reward R998)
  13479. =>WM: (13964: I3 ^see 0)
  13480. <=WM: (13955: S1 ^operator O1987 +)
  13481. <=WM: (13956: S1 ^operator O1988 +)
  13482. <=WM: (13957: S1 ^operator O1988)
  13483. <=WM: (13954: I3 ^dir U)
  13484. <=WM: (13950: R1 ^reward R997)
  13485. <=WM: (13949: I3 ^see 1)
  13486. <=WM: (13953: O1988 ^name predict-no)
  13487. <=WM: (13952: O1987 ^name predict-yes)
  13488. <=WM: (13951: R997 ^value 1)
  13489. --- Inner Elaboration Phase, active level 1 (S1) ---
  13490. Firing prefer*rvt*predict-yes*H0
  13491. -->
  13492. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  13493. -->
  13494. (S1 ^operator O1989 = -0.2062723012911647)
  13495. Firing rl*prefer*rvt*predict-yes*H0*3
  13496. -->
  13497. (S1 ^operator O1989 = 0.3907782094907327)
  13498. Firing prefer*rvt*predict-yes*H0*3*H1
  13499. -->
  13500. Firing prefer*rvt*predict-no*H0
  13501. -->
  13502. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  13503. -->
  13504. (S1 ^operator O1990 = 0.685533297663165)
  13505. Firing rl*prefer*rvt*predict-no*H0*4
  13506. -->
  13507. (S1 ^operator O1990 = 0.3144988611901438)
  13508. Firing prefer*rvt*predict-no*H0*4*H1
  13509. -->
  13510. inner elaboration loop at bottom goal.
  13511. Retracting rl*prefer*rvt*predict-no*H0*4
  13512. -->
  13513. (S1 ^operator O1988 = 0.3144988611901438)
  13514. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  13515. -->
  13516. (S1 ^operator O1988 = 0.685533297663165)
  13517. Retracting rl*prefer*rvt*predict-yes*H0*3
  13518. -->
  13519. (S1 ^operator O1987 = 0.3907782094907327)
  13520. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  13521. -->
  13522. (S1 ^operator O1987 = -0.2062723012911647)
  13523. --- END Proposal Phase ---
  13524. --- Decision Phase ---
  13525. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13526. =>WM: (13972: S1 ^operator O1990)
  13527. 995: O: O1990 (predict-no)
  13528. --- END Decision Phase ---
  13529. --- Application Phase ---
  13530. --- Firing Productions (PE) For State At Depth 1 ---
  13531. --- Inner Elaboration Phase, active level 1 (S1) ---
  13532. Firing apply*operator
  13533. -->
  13534. (I3 ^predict-no N995 + :O )
  13535. Firing apply*operator*complete
  13536. -->
  13537. (I3 ^predict-no N994 - :O )
  13538. inner elaboration loop at bottom goal.
  13539. --- Change Working Memory (PE) ---
  13540. =>WM: (13973: I3 ^predict-no N995)
  13541. <=WM: (13959: N994 ^status complete)
  13542. <=WM: (13958: I3 ^predict-no N994)
  13543. --- Firing Productions (IE) For State At Depth 1 ---
  13544. --- Inner Elaboration Phase, active level 1 (S1) ---
  13545. Firing monitor*world
  13546. -->
  13547. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13548. --- Change Working Memory (IE) ---
  13549. --- END Application Phase ---
  13550. --- Output Phase ---
  13551. ENV: Agent did: predict-no for direction L in state State-A
  13552. In State-A moving L
  13553. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13554. predict error 0
  13555. dir: dir isL
  13556. --- END Output Phase ---
  13557. |\---- Input Phase ---
  13558. =>WM: (13977: I2 ^dir L)
  13559. =>WM: (13976: I2 ^reward 1)
  13560. =>WM: (13975: I2 ^see 0)
  13561. =>WM: (13974: N995 ^status complete)
  13562. <=WM: (13962: I2 ^dir L)
  13563. <=WM: (13961: I2 ^reward 1)
  13564. <=WM: (13960: I2 ^see 0)
  13565. =>WM: (13978: I2 ^level-1 L0-root)
  13566. <=WM: (13963: I2 ^level-1 L1-root)
  13567. --- END Input Phase ---
  13568. --- Proposal Phase ---
  13569. --- Inner Elaboration Phase, active level 1 (S1) ---
  13570. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13571. -->
  13572. (S1 ^operator O1989 = -0.208713043145708)
  13573. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13574. -->
  13575. (S1 ^operator O1990 = 0.6854257503571404)
  13576. Firing prefer*rvt*predict-no*H0*4*H1
  13577. -->
  13578. Firing prefer*rvt*predict-yes*H0*3*H1
  13579. -->
  13580. Firing elaborate*copy-see-to-output-link
  13581. -->
  13582. (I3 ^see 0 +)
  13583. Firing elaborate*reward*based*on*reward
  13584. -->
  13585. (R999 ^value 1 +)
  13586. (R1 ^reward R999 +)
  13587. Firing propose*predict-yes
  13588. -->
  13589. (O1991 ^name predict-yes +)
  13590. (S1 ^operator O1991 +)
  13591. Firing propose*predict-no
  13592. -->
  13593. (O1992 ^name predict-no +)
  13594. (S1 ^operator O1992 +)
  13595. Firing rl*prefer*rvt*predict-no*H0*4
  13596. -->
  13597. (S1 ^operator O1990 = 0.3144988611901438)
  13598. Firing rl*prefer*rvt*predict-yes*H0*3
  13599. -->
  13600. (S1 ^operator O1989 = 0.3907782094907327)
  13601. Firing prefer*rvt*predict-yes*H0
  13602. -->
  13603. Firing prefer*rvt*predict-no*H0
  13604. -->
  13605. Firing elaborate*copy-dir-to-output-link
  13606. -->
  13607. (I3 ^dir L +)
  13608. inner elaboration loop at bottom goal.
  13609. Retracting elaborate*copy-see-to-output-link
  13610. -->
  13611. (I3 ^see 0 +)
  13612. Retracting propose*predict-no
  13613. -->
  13614. (O1990 ^name predict-no +)
  13615. (S1 ^operator O1990 +)
  13616. Retracting propose*predict-yes
  13617. -->
  13618. (O1989 ^name predict-yes +)
  13619. (S1 ^operator O1989 +)
  13620. Retracting elaborate*reward*based*on*reward
  13621. -->
  13622. (R998 ^value 1 +)
  13623. (R1 ^reward R998 +)
  13624. Retracting elaborate*copy-dir-to-output-link
  13625. -->
  13626. (I3 ^dir L +)
  13627. Retracting rl*prefer*rvt*predict-no*H0*4
  13628. -->
  13629. (S1 ^operator O1990 = 0.3144988611901438)
  13630. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  13631. -->
  13632. (S1 ^operator O1990 = 0.685533297663165)
  13633. Retracting rl*prefer*rvt*predict-yes*H0*3
  13634. -->
  13635. (S1 ^operator O1989 = 0.3907782094907327)
  13636. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  13637. -->
  13638. (S1 ^operator O1989 = -0.2062723012911647)
  13639. =>WM: (13984: S1 ^operator O1992 +)
  13640. =>WM: (13983: S1 ^operator O1991 +)
  13641. =>WM: (13982: O1992 ^name predict-no)
  13642. =>WM: (13981: O1991 ^name predict-yes)
  13643. =>WM: (13980: R999 ^value 1)
  13644. =>WM: (13979: R1 ^reward R999)
  13645. <=WM: (13970: S1 ^operator O1989 +)
  13646. <=WM: (13971: S1 ^operator O1990 +)
  13647. <=WM: (13972: S1 ^operator O1990)
  13648. <=WM: (13965: R1 ^reward R998)
  13649. <=WM: (13968: O1990 ^name predict-no)
  13650. <=WM: (13967: O1989 ^name predict-yes)
  13651. <=WM: (13966: R998 ^value 1)
  13652. --- Inner Elaboration Phase, active level 1 (S1) ---
  13653. Firing prefer*rvt*predict-yes*H0
  13654. -->
  13655. Firing rl*prefer*rvt*predict-yes*H0*3
  13656. -->
  13657. (S1 ^operator O1991 = 0.3907782094907327)
  13658. Firing prefer*rvt*predict-yes*H0*3*H1
  13659. -->
  13660. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13661. -->
  13662. (S1 ^operator O1991 = -0.208713043145708)
  13663. Firing prefer*rvt*predict-no*H0
  13664. -->
  13665. Firing rl*prefer*rvt*predict-no*H0*4
  13666. -->
  13667. (S1 ^operator O1992 = 0.3144988611901438)
  13668. Firing prefer*rvt*predict-no*H0*4*H1
  13669. -->
  13670. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13671. -->
  13672. (S1 ^operator O1992 = 0.6854257503571404)
  13673. inner elaboration loop at bottom goal.
  13674. Retracting rl*prefer*rvt*predict-no*H0*4
  13675. -->
  13676. (S1 ^operator O1990 = 0.3144988611901438)
  13677. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13678. -->
  13679. (S1 ^operator O1990 = 0.6854257503571404)
  13680. Retracting rl*prefer*rvt*predict-yes*H0*3
  13681. -->
  13682. (S1 ^operator O1989 = 0.3907782094907327)
  13683. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13684. -->
  13685. (S1 ^operator O1989 = -0.208713043145708)
  13686. --- END Proposal Phase ---
  13687. --- Decision Phase ---
  13688. RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478545 -0.164049 0.314496(R,m,v=1,0.922581,0.0718894)
  13689. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521482 0.164052 0.685533 -> 0.521479 0.164051 0.68553(R,m,v=1,1,0)
  13690. =>WM: (13985: S1 ^operator O1992)
  13691. 996: O: O1992 (predict-no)
  13692. --- END Decision Phase ---
  13693. --- Application Phase ---
  13694. --- Firing Productions (PE) For State At Depth 1 ---
  13695. --- Inner Elaboration Phase, active level 1 (S1) ---
  13696. Firing apply*operator
  13697. -->
  13698. (I3 ^predict-no N996 + :O )
  13699. Firing apply*operator*complete
  13700. -->
  13701. (I3 ^predict-no N995 - :O )
  13702. inner elaboration loop at bottom goal.
  13703. --- Change Working Memory (PE) ---
  13704. =>WM: (13986: I3 ^predict-no N996)
  13705. <=WM: (13974: N995 ^status complete)
  13706. <=WM: (13973: I3 ^predict-no N995)
  13707. --- Firing Productions (IE) For State At Depth 1 ---
  13708. --- Inner Elaboration Phase, active level 1 (S1) ---
  13709. Firing monitor*world
  13710. -->
  13711. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13712. --- Change Working Memory (IE) ---
  13713. --- END Application Phase ---
  13714. --- Output Phase ---
  13715. ENV: Agent did: predict-no for direction L in state State-A
  13716. In State-A moving L
  13717. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13718. predict error 0
  13719. dir: dir isL
  13720. --- END Output Phase ---
  13721. /|\--- Input Phase ---
  13722. =>WM: (13990: I2 ^dir L)
  13723. =>WM: (13989: I2 ^reward 1)
  13724. =>WM: (13988: I2 ^see 0)
  13725. =>WM: (13987: N996 ^status complete)
  13726. <=WM: (13977: I2 ^dir L)
  13727. <=WM: (13976: I2 ^reward 1)
  13728. <=WM: (13975: I2 ^see 0)
  13729. =>WM: (13991: I2 ^level-1 L0-root)
  13730. <=WM: (13978: I2 ^level-1 L0-root)
  13731. --- END Input Phase ---
  13732. --- Proposal Phase ---
  13733. --- Inner Elaboration Phase, active level 1 (S1) ---
  13734. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13735. -->
  13736. (S1 ^operator O1991 = -0.208713043145708)
  13737. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13738. -->
  13739. (S1 ^operator O1992 = 0.6854257503571404)
  13740. Firing prefer*rvt*predict-no*H0*4*H1
  13741. -->
  13742. Firing prefer*rvt*predict-yes*H0*3*H1
  13743. -->
  13744. Firing elaborate*copy-see-to-output-link
  13745. -->
  13746. (I3 ^see 0 +)
  13747. Firing elaborate*reward*based*on*reward
  13748. -->
  13749. (R1000 ^value 1 +)
  13750. (R1 ^reward R1000 +)
  13751. Firing propose*predict-yes
  13752. -->
  13753. (O1993 ^name predict-yes +)
  13754. (S1 ^operator O1993 +)
  13755. Firing propose*predict-no
  13756. -->
  13757. (O1994 ^name predict-no +)
  13758. (S1 ^operator O1994 +)
  13759. Firing rl*prefer*rvt*predict-no*H0*4
  13760. -->
  13761. (S1 ^operator O1992 = 0.3144962005421928)
  13762. Firing rl*prefer*rvt*predict-yes*H0*3
  13763. -->
  13764. (S1 ^operator O1991 = 0.3907782094907327)
  13765. Firing prefer*rvt*predict-yes*H0
  13766. -->
  13767. Firing prefer*rvt*predict-no*H0
  13768. -->
  13769. Firing elaborate*copy-dir-to-output-link
  13770. -->
  13771. (I3 ^dir L +)
  13772. inner elaboration loop at bottom goal.
  13773. Retracting elaborate*copy-see-to-output-link
  13774. -->
  13775. (I3 ^see 0 +)
  13776. Retracting propose*predict-no
  13777. -->
  13778. (O1992 ^name predict-no +)
  13779. (S1 ^operator O1992 +)
  13780. Retracting propose*predict-yes
  13781. -->
  13782. (O1991 ^name predict-yes +)
  13783. (S1 ^operator O1991 +)
  13784. Retracting elaborate*reward*based*on*reward
  13785. -->
  13786. (R999 ^value 1 +)
  13787. (R1 ^reward R999 +)
  13788. Retracting elaborate*copy-dir-to-output-link
  13789. -->
  13790. (I3 ^dir L +)
  13791. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13792. -->
  13793. (S1 ^operator O1992 = 0.6854257503571404)
  13794. Retracting rl*prefer*rvt*predict-no*H0*4
  13795. -->
  13796. (S1 ^operator O1992 = 0.3144962005421928)
  13797. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13798. -->
  13799. (S1 ^operator O1991 = -0.208713043145708)
  13800. Retracting rl*prefer*rvt*predict-yes*H0*3
  13801. -->
  13802. (S1 ^operator O1991 = 0.3907782094907327)
  13803. =>WM: (13997: S1 ^operator O1994 +)
  13804. =>WM: (13996: S1 ^operator O1993 +)
  13805. =>WM: (13995: O1994 ^name predict-no)
  13806. =>WM: (13994: O1993 ^name predict-yes)
  13807. =>WM: (13993: R1000 ^value 1)
  13808. =>WM: (13992: R1 ^reward R1000)
  13809. <=WM: (13983: S1 ^operator O1991 +)
  13810. <=WM: (13984: S1 ^operator O1992 +)
  13811. <=WM: (13985: S1 ^operator O1992)
  13812. <=WM: (13979: R1 ^reward R999)
  13813. <=WM: (13982: O1992 ^name predict-no)
  13814. <=WM: (13981: O1991 ^name predict-yes)
  13815. <=WM: (13980: R999 ^value 1)
  13816. --- Inner Elaboration Phase, active level 1 (S1) ---
  13817. Firing prefer*rvt*predict-yes*H0
  13818. -->
  13819. Firing rl*prefer*rvt*predict-yes*H0*3
  13820. -->
  13821. (S1 ^operator O1993 = 0.3907782094907327)
  13822. Firing prefer*rvt*predict-yes*H0*3*H1
  13823. -->
  13824. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13825. -->
  13826. (S1 ^operator O1993 = -0.208713043145708)
  13827. Firing prefer*rvt*predict-no*H0
  13828. -->
  13829. Firing rl*prefer*rvt*predict-no*H0*4
  13830. -->
  13831. (S1 ^operator O1994 = 0.3144962005421928)
  13832. Firing prefer*rvt*predict-no*H0*4*H1
  13833. -->
  13834. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13835. -->
  13836. (S1 ^operator O1994 = 0.6854257503571404)
  13837. inner elaboration loop at bottom goal.
  13838. Retracting rl*prefer*rvt*predict-no*H0*4
  13839. -->
  13840. (S1 ^operator O1992 = 0.3144962005421928)
  13841. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13842. -->
  13843. (S1 ^operator O1992 = 0.6854257503571404)
  13844. Retracting rl*prefer*rvt*predict-yes*H0*3
  13845. -->
  13846. (S1 ^operator O1991 = 0.3907782094907327)
  13847. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13848. -->
  13849. (S1 ^operator O1991 = -0.208713043145708)
  13850. --- END Proposal Phase ---
  13851. --- Decision Phase ---
  13852. RL update rl*prefer*rvt*predict-no*H0*4 0.478545 -0.164049 0.314496 -> 0.478551 -0.164048 0.314503(R,m,v=1,0.923077,0.071464)
  13853. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521384 0.164042 0.685426 -> 0.521391 0.164042 0.685433(R,m,v=1,1,0)
  13854. =>WM: (13998: S1 ^operator O1994)
  13855. 997: O: O1994 (predict-no)
  13856. --- END Decision Phase ---
  13857. --- Application Phase ---
  13858. --- Firing Productions (PE) For State At Depth 1 ---
  13859. --- Inner Elaboration Phase, active level 1 (S1) ---
  13860. Firing apply*operator
  13861. -->
  13862. (I3 ^predict-no N997 + :O )
  13863. Firing apply*operator*complete
  13864. -->
  13865. (I3 ^predict-no N996 - :O )
  13866. inner elaboration loop at bottom goal.
  13867. --- Change Working Memory (PE) ---
  13868. =>WM: (13999: I3 ^predict-no N997)
  13869. <=WM: (13987: N996 ^status complete)
  13870. <=WM: (13986: I3 ^predict-no N996)
  13871. --- Firing Productions (IE) For State At Depth 1 ---
  13872. --- Inner Elaboration Phase, active level 1 (S1) ---
  13873. Firing monitor*world
  13874. -->
  13875. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13876. --- Change Working Memory (IE) ---
  13877. --- END Application Phase ---
  13878. --- Output Phase ---
  13879. ENV: Agent did: predict-no for direction L in state State-A
  13880. In State-A moving L
  13881. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13882. predict error 0
  13883. dir: dir isU
  13884. --- END Output Phase ---
  13885. -/--- Input Phase ---
  13886. =>WM: (14003: I2 ^dir U)
  13887. =>WM: (14002: I2 ^reward 1)
  13888. =>WM: (14001: I2 ^see 0)
  13889. =>WM: (14000: N997 ^status complete)
  13890. <=WM: (13990: I2 ^dir L)
  13891. <=WM: (13989: I2 ^reward 1)
  13892. <=WM: (13988: I2 ^see 0)
  13893. =>WM: (14004: I2 ^level-1 L0-root)
  13894. <=WM: (13991: I2 ^level-1 L0-root)
  13895. --- END Input Phase ---
  13896. --- Proposal Phase ---
  13897. --- Inner Elaboration Phase, active level 1 (S1) ---
  13898. Firing elaborate*copy-see-to-output-link
  13899. -->
  13900. (I3 ^see 0 +)
  13901. Firing elaborate*reward*based*on*reward
  13902. -->
  13903. (R1001 ^value 1 +)
  13904. (R1 ^reward R1001 +)
  13905. Firing propose*predict-yes
  13906. -->
  13907. (O1995 ^name predict-yes +)
  13908. (S1 ^operator O1995 +)
  13909. Firing propose*predict-no
  13910. -->
  13911. (O1996 ^name predict-no +)
  13912. (S1 ^operator O1996 +)
  13913. Firing rl*prefer*rvt*predict-no*H0*2
  13914. -->
  13915. (S1 ^operator O1994 = 1.)
  13916. Firing rl*prefer*rvt*predict-yes*H0*1
  13917. -->
  13918. (S1 ^operator O1993 = 0.)
  13919. Firing prefer*rvt*predict-yes*H0
  13920. -->
  13921. Firing prefer*rvt*predict-no*H0
  13922. -->
  13923. Firing elaborate*copy-dir-to-output-link
  13924. -->
  13925. (I3 ^dir U +)
  13926. inner elaboration loop at bottom goal.
  13927. Retracting elaborate*copy-see-to-output-link
  13928. -->
  13929. (I3 ^see 0 +)
  13930. Retracting propose*predict-no
  13931. -->
  13932. (O1994 ^name predict-no +)
  13933. (S1 ^operator O1994 +)
  13934. Retracting propose*predict-yes
  13935. -->
  13936. (O1993 ^name predict-yes +)
  13937. (S1 ^operator O1993 +)
  13938. Retracting elaborate*reward*based*on*reward
  13939. -->
  13940. (R1000 ^value 1 +)
  13941. (R1 ^reward R1000 +)
  13942. Retracting elaborate*copy-dir-to-output-link
  13943. -->
  13944. (I3 ^dir L +)
  13945. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13946. -->
  13947. (S1 ^operator O1994 = 0.6854332700385593)
  13948. Retracting rl*prefer*rvt*predict-no*H0*4
  13949. -->
  13950. (S1 ^operator O1994 = 0.3145026510346156)
  13951. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13952. -->
  13953. (S1 ^operator O1993 = -0.208713043145708)
  13954. Retracting rl*prefer*rvt*predict-yes*H0*3
  13955. -->
  13956. (S1 ^operator O1993 = 0.3907782094907327)
  13957. =>WM: (14011: S1 ^operator O1996 +)
  13958. =>WM: (14010: S1 ^operator O1995 +)
  13959. =>WM: (14009: I3 ^dir U)
  13960. =>WM: (14008: O1996 ^name predict-no)
  13961. =>WM: (14007: O1995 ^name predict-yes)
  13962. =>WM: (14006: R1001 ^value 1)
  13963. =>WM: (14005: R1 ^reward R1001)
  13964. <=WM: (13996: S1 ^operator O1993 +)
  13965. <=WM: (13997: S1 ^operator O1994 +)
  13966. <=WM: (13998: S1 ^operator O1994)
  13967. <=WM: (13969: I3 ^dir L)
  13968. <=WM: (13992: R1 ^reward R1000)
  13969. <=WM: (13995: O1994 ^name predict-no)
  13970. <=WM: (13994: O1993 ^name predict-yes)
  13971. <=WM: (13993: R1000 ^value 1)
  13972. --- Inner Elaboration Phase, active level 1 (S1) ---
  13973. Firing prefer*rvt*predict-yes*H0
  13974. -->
  13975. Firing rl*prefer*rvt*predict-yes*H0*1
  13976. -->
  13977. (S1 ^operator O1995 = 0.)
  13978. Firing prefer*rvt*predict-no*H0
  13979. -->
  13980. Firing rl*prefer*rvt*predict-no*H0*2
  13981. -->
  13982. (S1 ^operator O1996 = 1.)
  13983. inner elaboration loop at bottom goal.
  13984. Retracting rl*prefer*rvt*predict-no*H0*2
  13985. -->
  13986. (S1 ^operator O1994 = 1.)
  13987. Retracting rl*prefer*rvt*predict-yes*H0*1
  13988. -->
  13989. (S1 ^operator O1993 = 0.)
  13990. --- END Proposal Phase ---
  13991. --- Decision Phase ---
  13992. RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314503 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.923567,0.0710436)
  13993. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521391 0.164042 0.685433 -> 0.521396 0.164043 0.685439(R,m,v=1,1,0)
  13994. =>WM: (14012: S1 ^operator O1996)
  13995. 998: O: O1996 (predict-no)
  13996. --- END Decision Phase ---
  13997. --- Application Phase ---
  13998. --- Firing Productions (PE) For State At Depth 1 ---
  13999. --- Inner Elaboration Phase, active level 1 (S1) ---
  14000. Firing apply*operator
  14001. -->
  14002. (I3 ^predict-no N998 + :O )
  14003. Firing apply*operator*complete
  14004. -->
  14005. (I3 ^predict-no N997 - :O )
  14006. inner elaboration loop at bottom goal.
  14007. --- Change Working Memory (PE) ---
  14008. =>WM: (14013: I3 ^predict-no N998)
  14009. <=WM: (14000: N997 ^status complete)
  14010. <=WM: (13999: I3 ^predict-no N997)
  14011. --- Firing Productions (IE) For State At Depth 1 ---
  14012. --- Inner Elaboration Phase, active level 1 (S1) ---
  14013. Firing monitor*world
  14014. -->
  14015. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14016. --- Change Working Memory (IE) ---
  14017. --- END Application Phase ---
  14018. --- Output Phase ---
  14019. ENV: Agent did: predict-no for direction U in state State-A
  14020. In State-A moving U
  14021. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14022. predict error 0
  14023. dir: dir isR
  14024. --- END Output Phase ---
  14025. |\--- Input Phase ---
  14026. =>WM: (14017: I2 ^dir R)
  14027. =>WM: (14016: I2 ^reward 1)
  14028. =>WM: (14015: I2 ^see 0)
  14029. =>WM: (14014: N998 ^status complete)
  14030. <=WM: (14003: I2 ^dir U)
  14031. <=WM: (14002: I2 ^reward 1)
  14032. <=WM: (14001: I2 ^see 0)
  14033. =>WM: (14018: I2 ^level-1 L0-root)
  14034. <=WM: (14004: I2 ^level-1 L0-root)
  14035. --- END Input Phase ---
  14036. --- Proposal Phase ---
  14037. --- Inner Elaboration Phase, active level 1 (S1) ---
  14038. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  14039. -->
  14040. (S1 ^operator O1995 = 0.8783951706845293)
  14041. Firing prefer*rvt*predict-yes*H0*5*H1
  14042. -->
  14043. Firing elaborate*copy-see-to-output-link
  14044. -->
  14045. (I3 ^see 0 +)
  14046. Firing elaborate*reward*based*on*reward
  14047. -->
  14048. (R1002 ^value 1 +)
  14049. (R1 ^reward R1002 +)
  14050. Firing propose*predict-yes
  14051. -->
  14052. (O1997 ^name predict-yes +)
  14053. (S1 ^operator O1997 +)
  14054. Firing propose*predict-no
  14055. -->
  14056. (O1998 ^name predict-no +)
  14057. (S1 ^operator O1998 +)
  14058. Firing rl*prefer*rvt*predict-no*H0*6
  14059. -->
  14060. (S1 ^operator O1996 = 0.9999888743986174)
  14061. Firing rl*prefer*rvt*predict-yes*H0*5
  14062. -->
  14063. (S1 ^operator O1995 = 0.1215989443698621)
  14064. Firing prefer*rvt*predict-yes*H0
  14065. -->
  14066. Firing prefer*rvt*predict-no*H0
  14067. -->
  14068. Firing elaborate*copy-dir-to-output-link
  14069. -->
  14070. (I3 ^dir R +)
  14071. inner elaboration loop at bottom goal.
  14072. Retracting elaborate*copy-see-to-output-link
  14073. -->
  14074. (I3 ^see 0 +)
  14075. Retracting propose*predict-no
  14076. -->
  14077. (O1996 ^name predict-no +)
  14078. (S1 ^operator O1996 +)
  14079. Retracting propose*predict-yes
  14080. -->
  14081. (O1995 ^name predict-yes +)
  14082. (S1 ^operator O1995 +)
  14083. Retracting elaborate*reward*based*on*reward
  14084. -->
  14085. (R1001 ^value 1 +)
  14086. (R1 ^reward R1001 +)
  14087. Retracting elaborate*copy-dir-to-output-link
  14088. -->
  14089. (I3 ^dir U +)
  14090. Retracting rl*prefer*rvt*predict-no*H0*2
  14091. -->
  14092. (S1 ^operator O1996 = 1.)
  14093. Retracting rl*prefer*rvt*predict-yes*H0*1
  14094. -->
  14095. (S1 ^operator O1995 = 0.)
  14096. =>WM: (14025: S1 ^operator O1998 +)
  14097. =>WM: (14024: S1 ^operator O1997 +)
  14098. =>WM: (14023: I3 ^dir R)
  14099. =>WM: (14022: O1998 ^name predict-no)
  14100. =>WM: (14021: O1997 ^name predict-yes)
  14101. =>WM: (14020: R1002 ^value 1)
  14102. =>WM: (14019: R1 ^reward R1002)
  14103. <=WM: (14010: S1 ^operator O1995 +)
  14104. <=WM: (14011: S1 ^operator O1996 +)
  14105. <=WM: (14012: S1 ^operator O1996)
  14106. <=WM: (14009: I3 ^dir U)
  14107. <=WM: (14005: R1 ^reward R1001)
  14108. <=WM: (14008: O1996 ^name predict-no)
  14109. <=WM: (14007: O1995 ^name predict-yes)
  14110. <=WM: (14006: R1001 ^value 1)
  14111. --- Inner Elaboration Phase, active level 1 (S1) ---
  14112. Firing prefer*rvt*predict-yes*H0
  14113. -->
  14114. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  14115. -->
  14116. (S1 ^operator O1997 = 0.8783951706845293)
  14117. Firing rl*prefer*rvt*predict-yes*H0*5
  14118. -->
  14119. (S1 ^operator O1997 = 0.1215989443698621)
  14120. Firing prefer*rvt*predict-yes*H0*5*H1
  14121. -->
  14122. Firing prefer*rvt*predict-no*H0
  14123. -->
  14124. Firing rl*prefer*rvt*predict-no*H0*6
  14125. -->
  14126. (S1 ^operator O1998 = 0.9999888743986174)
  14127. inner elaboration loop at bottom goal.
  14128. Retracting rl*prefer*rvt*predict-no*H0*6
  14129. -->
  14130. (S1 ^operator O1996 = 0.9999888743986174)
  14131. Retracting rl*prefer*rvt*predict-yes*H0*5
  14132. -->
  14133. (S1 ^operator O1995 = 0.1215989443698621)
  14134. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  14135. -->
  14136. (S1 ^operator O1995 = 0.8783951706845293)
  14137. --- END Proposal Phase ---
  14138. --- Decision Phase ---
  14139. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14140. =>WM: (14026: S1 ^operator O1997)
  14141. 999: O: O1997 (predict-yes)
  14142. --- END Decision Phase ---
  14143. --- Application Phase ---
  14144. --- Firing Productions (PE) For State At Depth 1 ---
  14145. --- Inner Elaboration Phase, active level 1 (S1) ---
  14146. Firing apply*operator
  14147. -->
  14148. (I3 ^predict-yes N999 + :O )
  14149. Firing apply*operator*complete
  14150. -->
  14151. (I3 ^predict-no N998 - :O )
  14152. inner elaboration loop at bottom goal.
  14153. --- Change Working Memory (PE) ---
  14154. =>WM: (14027: I3 ^predict-yes N999)
  14155. <=WM: (14014: N998 ^status complete)
  14156. <=WM: (14013: I3 ^predict-no N998)
  14157. --- Firing Productions (IE) For State At Depth 1 ---
  14158. --- Inner Elaboration Phase, active level 1 (S1) ---
  14159. Firing monitor*world
  14160. -->
  14161. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14162. --- Change Working Memory (IE) ---
  14163. --- END Application Phase ---
  14164. --- Output Phase ---
  14165. ENV: Agent did: predict-yes for direction R in state State-A
  14166. In State-A moving R
  14167. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14168. predict error 0
  14169. dir: dir isU
  14170. --- END Output Phase ---
  14171. -/|--- Input Phase ---
  14172. =>WM: (14031: I2 ^dir U)
  14173. =>WM: (14030: I2 ^reward 1)
  14174. =>WM: (14029: I2 ^see 1)
  14175. =>WM: (14028: N999 ^status complete)
  14176. <=WM: (14017: I2 ^dir R)
  14177. <=WM: (14016: I2 ^reward 1)
  14178. <=WM: (14015: I2 ^see 0)
  14179. =>WM: (14032: I2 ^level-1 R1-root)
  14180. <=WM: (14018: I2 ^level-1 L0-root)
  14181. --- END Input Phase ---
  14182. --- Proposal Phase ---
  14183. --- Inner Elaboration Phase, active level 1 (S1) ---
  14184. Firing elaborate*copy-see-to-output-link
  14185. -->
  14186. (I3 ^see 1 +)
  14187. Firing elaborate*reward*based*on*reward
  14188. -->
  14189. (R1003 ^value 1 +)
  14190. (R1 ^reward R1003 +)
  14191. Firing propose*predict-yes
  14192. -->
  14193. (O1999 ^name predict-yes +)
  14194. (S1 ^operator O1999 +)
  14195. Firing propose*predict-no
  14196. -->
  14197. (O2000 ^name predict-no +)
  14198. (S1 ^operator O2000 +)
  14199. Firing rl*prefer*rvt*predict-no*H0*2
  14200. -->
  14201. (S1 ^operator O1998 = 1.)
  14202. Firing rl*prefer*rvt*predict-yes*H0*1
  14203. -->
  14204. (S1 ^operator O1997 = 0.)
  14205. Firing prefer*rvt*predict-yes*H0
  14206. -->
  14207. Firing prefer*rvt*predict-no*H0
  14208. -->
  14209. Firing elaborate*copy-dir-to-output-link
  14210. -->
  14211. (I3 ^dir U +)
  14212. inner elaboration loop at bottom goal.
  14213. Retracting elaborate*copy-see-to-output-link
  14214. -->
  14215. (I3 ^see 0 +)
  14216. Retracting propose*predict-no
  14217. -->
  14218. (O1998 ^name predict-no +)
  14219. (S1 ^operator O1998 +)
  14220. Retracting propose*predict-yes
  14221. -->
  14222. (O1997 ^name predict-yes +)
  14223. (S1 ^operator O1997 +)
  14224. Retracting elaborate*reward*based*on*reward
  14225. -->
  14226. (R1002 ^value 1 +)
  14227. (R1 ^reward R1002 +)
  14228. Retracting elaborate*copy-dir-to-output-link
  14229. -->
  14230. (I3 ^dir R +)
  14231. Retracting rl*prefer*rvt*predict-no*H0*6
  14232. -->
  14233. (S1 ^operator O1998 = 0.9999888743986174)
  14234. Retracting rl*prefer*rvt*predict-yes*H0*5
  14235. -->
  14236. (S1 ^operator O1997 = 0.1215989443698621)
  14237. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  14238. -->
  14239. (S1 ^operator O1997 = 0.8783951706845293)
  14240. =>WM: (14040: S1 ^operator O2000 +)
  14241. =>WM: (14039: S1 ^operator O1999 +)
  14242. =>WM: (14038: I3 ^dir U)
  14243. =>WM: (14037: O2000 ^name predict-no)
  14244. =>WM: (14036: O1999 ^name predict-yes)
  14245. =>WM: (14035: R1003 ^value 1)
  14246. =>WM: (14034: R1 ^reward R1003)
  14247. =>WM: (14033: I3 ^see 1)
  14248. <=WM: (14024: S1 ^operator O1997 +)
  14249. <=WM: (14026: S1 ^operator O1997)
  14250. <=WM: (14025: S1 ^operator O1998 +)
  14251. <=WM: (14023: I3 ^dir R)
  14252. <=WM: (14019: R1 ^reward R1002)
  14253. <=WM: (13964: I3 ^see 0)
  14254. <=WM: (14022: O1998 ^name predict-no)
  14255. <=WM: (14021: O1997 ^name predict-yes)
  14256. <=WM: (14020: R1002 ^value 1)
  14257. --- Inner Elaboration Phase, active level 1 (S1) ---
  14258. Firing prefer*rvt*predict-yes*H0
  14259. -->
  14260. Firing rl*prefer*rvt*predict-yes*H0*1
  14261. -->
  14262. (S1 ^operator O1999 = 0.)
  14263. Firing prefer*rvt*predict-no*H0
  14264. -->
  14265. Firing rl*prefer*rvt*predict-no*H0*2
  14266. -->
  14267. (S1 ^operator O2000 = 1.)
  14268. inner elaboration loop at bottom goal.
  14269. Retracting rl*prefer*rvt*predict-no*H0*2
  14270. -->
  14271. (S1 ^operator O1998 = 1.)
  14272. Retracting rl*prefer*rvt*predict-yes*H0*1
  14273. -->
  14274. (S1 ^operator O1997 = 0.)
  14275. --- END Proposal Phase ---
  14276. --- Decision Phase ---
  14277. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.864407,0.117874)
  14278. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.46547 0.412925 0.878395 -> 0.465471 0.412925 0.878396(R,m,v=1,1,0)
  14279. =>WM: (14041: S1 ^operator O2000)
  14280. 1000: O: O2000 (predict-no)
  14281. --- END Decision Phase ---
  14282. --- Application Phase ---
  14283. --- Firing Productions (PE) For State At Depth 1 ---
  14284. --- Inner Elaboration Phase, active level 1 (S1) ---
  14285. Firing apply*operator
  14286. -->
  14287. (I3 ^predict-no N1000 + :O )
  14288. Firing apply*operator*complete
  14289. -->
  14290. (I3 ^predict-yes N999 - :O )
  14291. inner elaboration loop at bottom goal.
  14292. --- Change Working Memory (PE) ---
  14293. =>WM: (14042: I3 ^predict-no N1000)
  14294. <=WM: (14028: N999 ^status complete)
  14295. <=WM: (14027: I3 ^predict-yes N999)
  14296. --- Firing Productions (IE) For State At Depth 1 ---
  14297. --- Inner Elaboration Phase, active level 1 (S1) ---
  14298. Firing monitor*world
  14299. -->
  14300. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14301. --- Change Working Memory (IE) ---
  14302. --- END Application Phase ---
  14303. --- Output Phase ---
  14304. ENV: Agent did: predict-no for direction U in state State-B
  14305. In State-B moving U
  14306. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14307. predict error 0
  14308. dir: dir isU
  14309. --- END Output Phase ---
  14310. \-/|\-/|--- Input Phase ---
  14311. =>WM: (14046: I2 ^dir U)
  14312. =>WM: (14045: I2 ^reward 1)
  14313. =>WM: (14044: I2 ^see 0)
  14314. =>WM: (14043: N1000 ^status complete)
  14315. <=WM: (14031: I2 ^dir U)
  14316. <=WM: (14030: I2 ^reward 1)
  14317. <=WM: (14029: I2 ^see 1)
  14318. =>WM: (14047: I2 ^level-1 R1-root)
  14319. <=WM: (14032: I2 ^level-1 R1-root)
  14320. --- END Input Phase ---
  14321. --- Proposal Phase ---
  14322. --- Inner Elaboration Phase, active level 1 (S1) ---
  14323. Firing elaborate*copy-see-to-output-link
  14324. -->
  14325. (I3 ^see 0 +)
  14326. Firing elaborate*reward*based*on*reward
  14327. -->
  14328. (R1004 ^value 1 +)
  14329. (R1 ^reward R1004 +)
  14330. Firing propose*predict-yes
  14331. -->
  14332. (O2001 ^name predict-yes +)
  14333. (S1 ^operator O2001 +)
  14334. Firing propose*predict-no
  14335. -->
  14336. (O2002 ^name predict-no +)
  14337. (S1 ^operator O2002 +)
  14338. Firing rl*prefer*rvt*predict-no*H0*2
  14339. -->
  14340. (S1 ^operator O2000 = 1.)
  14341. Firing rl*prefer*rvt*predict-yes*H0*1
  14342. -->
  14343. (S1 ^operator O1999 = 0.)
  14344. Firing prefer*rvt*predict-yes*H0
  14345. -->
  14346. Firing prefer*rvt*predict-no*H0
  14347. -->
  14348. Firing elaborate*copy-dir-to-output-link
  14349. -->
  14350. (I3 ^dir U +)
  14351. inner elaboration loop at bottom goal.
  14352. Retracting elaborate*copy-see-to-output-link
  14353. -->
  14354. (I3 ^see 1 +)
  14355. Retracting propose*predict-no
  14356. -->
  14357. (O2000 ^name predict-no +)
  14358. (S1 ^operator O2000 +)
  14359. Retracting propose*predict-yes
  14360. -->
  14361. (O1999 ^name predict-yes +)
  14362. (S1 ^operator O1999 +)
  14363. Retracting elaborate*reward*based*on*reward
  14364. -->
  14365. (R1003 ^value 1 +)
  14366. (R1 ^reward R1003 +)
  14367. Retracting elaborate*copy-dir-to-output-link
  14368. -->
  14369. (I3 ^dir U +)
  14370. Retracting rl*prefer*rvt*predict-no*H0*2
  14371. -->
  14372. (S1 ^operator O2000 = 1.)
  14373. Retracting rl*prefer*rvt*predict-yes*H0*1
  14374. -->
  14375. (S1 ^operator O1999 = 0.)
  14376. =>WM: (14054: S1 ^operator O2002 +)
  14377. =>WM: (14053: S1 ^operator O2001 +)
  14378. =>WM: (14052: O2002 ^name predict-no)
  14379. =>WM: (14051: O2001 ^name predict-yes)
  14380. =>WM: (14050: R1004 ^value 1)
  14381. =>WM: (14049: R1 ^reward R1004)
  14382. =>WM: (14048: I3 ^see 0)
  14383. <=WM: (14039: S1 ^operator O1999 +)
  14384. <=WM: (14040: S1 ^operator O2000 +)
  14385. <=WM: (14041: S1 ^operator O2000)
  14386. <=WM: (14034: R1 ^reward R1003)
  14387. <=WM: (14033: I3 ^see 1)
  14388. <=WM: (14037: O2000 ^name predict-no)
  14389. <=WM: (14036: O1999 ^name predict-yes)
  14390. <=WM: (14035: R1003 ^value 1)
  14391. --- Inner Elaboration Phase, active level 1 (S1) ---
  14392. Firing prefer*rvt*predict-yes*H0
  14393. -->
  14394. Firing rl*prefer*rvt*predict-yes*H0*1
  14395. -->
  14396. (S1 ^operator O2001 = 0.)
  14397. Firing prefer*rvt*predict-no*H0
  14398. -->
  14399. Firing rl*prefer*rvt*predict-no*H0*2
  14400. -->
  14401. (S1 ^operator O2002 = 1.)
  14402. inner elaboration loop at bottom goal.
  14403. Retracting rl*prefer*rvt*predict-no*H0*2
  14404. -->
  14405. (S1 ^operator O2000 = 1.)
  14406. Retracting rl*prefer*rvt*predict-yes*H0*1
  14407. -->
  14408. (S1 ^operator O1999 = 0.)
  14409. --- END Proposal Phase ---
  14410. --- Decision Phase ---
  14411. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14412. =>WM: (14055: S1 ^operator O2002)
  14413. 1001: O: O2002 (predict-no)
  14414. --- END Decision Phase ---
  14415. --- Application Phase ---
  14416. --- Firing Productions (PE) For State At Depth 1 ---
  14417. --- Inner Elaboration Phase, active level 1 (S1) ---
  14418. Firing apply*operator
  14419. -->
  14420. (I3 ^predict-no N1001 + :O )
  14421. Firing apply*operator*complete
  14422. -->
  14423. (I3 ^predict-no N1000 - :O )
  14424. inner elaboration loop at bottom goal.
  14425. --- Change Working Memory (PE) ---
  14426. =>WM: (14056: I3 ^predict-no N1001)
  14427. <=WM: (14043: N1000 ^status complete)
  14428. <=WM: (14042: I3 ^predict-no N1000)
  14429. --- Firing Productions (IE) For State At Depth 1 ---
  14430. --- Inner Elaboration Phase, active level 1 (S1) ---
  14431. Firing monitor*world
  14432. -->
  14433. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14434. --- Change Working Memory (IE) ---
  14435. --- END Application Phase ---
  14436. --- Output Phase ---
  14437. ENV: Agent did: predict-no for direction U in state State-B
  14438. In State-B moving U
  14439. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14440. predict error 0
  14441. dir: dir isU
  14442. --- END Output Phase ---
  14443. \--- Input Phase ---
  14444. =>WM: (14060: I2 ^dir U)
  14445. =>WM: (14059: I2 ^reward 1)
  14446. =>WM: (14058: I2 ^see 0)
  14447. =>WM: (14057: N1001 ^status complete)
  14448. <=WM: (14046: I2 ^dir U)
  14449. <=WM: (14045: I2 ^reward 1)
  14450. <=WM: (14044: I2 ^see 0)
  14451. =>WM: (14061: I2 ^level-1 R1-root)
  14452. <=WM: (14047: I2 ^level-1 R1-root)
  14453. --- END Input Phase ---
  14454. --- Proposal Phase ---
  14455. --- Inner Elaboration Phase, active level 1 (S1) ---
  14456. Firing elaborate*copy-see-to-output-link
  14457. -->
  14458. (I3 ^see 0 +)
  14459. Firing elaborate*reward*based*on*reward
  14460. -->
  14461. (R1005 ^value 1 +)
  14462. (R1 ^reward R1005 +)
  14463. Firing propose*predict-yes
  14464. -->
  14465. (O2003 ^name predict-yes +)
  14466. (S1 ^operator O2003 +)
  14467. Firing propose*predict-no
  14468. -->
  14469. (O2004 ^name predict-no +)
  14470. (S1 ^operator O2004 +)
  14471. Firing rl*prefer*rvt*predict-no*H0*2
  14472. -->
  14473. (S1 ^operator O2002 = 1.)
  14474. Firing rl*prefer*rvt*predict-yes*H0*1
  14475. -->
  14476. (S1 ^operator O2001 = 0.)
  14477. Firing prefer*rvt*predict-yes*H0
  14478. -->
  14479. Firing prefer*rvt*predict-no*H0
  14480. -->
  14481. Firing elaborate*copy-dir-to-output-link
  14482. -->
  14483. (I3 ^dir U +)
  14484. inner elaboration loop at bottom goal.
  14485. Retracting elaborate*copy-see-to-output-link
  14486. -->
  14487. (I3 ^see 0 +)
  14488. Retracting propose*predict-no
  14489. -->
  14490. (O2002 ^name predict-no +)
  14491. (S1 ^operator O2002 +)
  14492. Retracting propose*predict-yes
  14493. -->
  14494. (O2001 ^name predict-yes +)
  14495. (S1 ^operator O2001 +)
  14496. Retracting elaborate*reward*based*on*reward
  14497. -->
  14498. (R1004 ^value 1 +)
  14499. (R1 ^reward R1004 +)
  14500. Retracting elaborate*copy-dir-to-output-link
  14501. -->
  14502. (I3 ^dir U +)
  14503. Retracting rl*prefer*rvt*predict-no*H0*2
  14504. -->
  14505. (S1 ^operator O2002 = 1.)
  14506. Retracting rl*prefer*rvt*predict-yes*H0*1
  14507. -->
  14508. (S1 ^operator O2001 = 0.)
  14509. =>WM: (14067: S1 ^operator O2004 +)
  14510. =>WM: (14066: S1 ^operator O2003 +)
  14511. =>WM: (14065: O2004 ^name predict-no)
  14512. =>WM: (14064: O2003 ^name predict-yes)
  14513. =>WM: (14063: R1005 ^value 1)
  14514. =>WM: (14062: R1 ^reward R1005)
  14515. <=WM: (14053: S1 ^operator O2001 +)
  14516. <=WM: (14054: S1 ^operator O2002 +)
  14517. <=WM: (14055: S1 ^operator O2002)
  14518. <=WM: (14049: R1 ^reward R1004)
  14519. <=WM: (14052: O2002 ^name predict-no)
  14520. <=WM: (14051: O2001 ^name predict-yes)
  14521. <=WM: (14050: R1004 ^value 1)
  14522. --- Inner Elaboration Phase, active level 1 (S1) ---
  14523. Firing prefer*rvt*predict-yes*H0
  14524. -->
  14525. Firing rl*prefer*rvt*predict-yes*H0*1
  14526. -->
  14527. (S1 ^operator O2003 = 0.)
  14528. Firing prefer*rvt*predict-no*H0
  14529. -->
  14530. Firing rl*prefer*rvt*predict-no*H0*2
  14531. -->
  14532. (S1 ^operator O2004 = 1.)
  14533. inner elaboration loop at bottom goal.
  14534. Retracting rl*prefer*rvt*predict-no*H0*2
  14535. -->
  14536. (S1 ^operator O2002 = 1.)
  14537. Retracting rl*prefer*rvt*predict-yes*H0*1
  14538. -->
  14539. (S1 ^operator O2001 = 0.)
  14540. --- END Proposal Phase ---
  14541. --- Decision Phase ---
  14542. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14543. =>WM: (14068: S1 ^operator O2004)
  14544. 1002: O: O2004 (predict-no)
  14545. --- END Decision Phase ---
  14546. --- Application Phase ---
  14547. --- Firing Productions (PE) For State At Depth 1 ---
  14548. --- Inner Elaboration Phase, active level 1 (S1) ---
  14549. Firing apply*operator
  14550. -->
  14551. (I3 ^predict-no N1002 + :O )
  14552. Firing apply*operator*complete
  14553. -->
  14554. (I3 ^predict-no N1001 - :O )
  14555. inner elaboration loop at bottom goal.
  14556. --- Change Working Memory (PE) ---
  14557. =>WM: (14069: I3 ^predict-no N1002)
  14558. <=WM: (14057: N1001 ^status complete)
  14559. <=WM: (14056: I3 ^predict-no N1001)
  14560. --- Firing Productions (IE) For State At Depth 1 ---
  14561. --- Inner Elaboration Phase, active level 1 (S1) ---
  14562. Firing monitor*world
  14563. -->
  14564. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14565. --- Change Working Memory (IE) ---
  14566. --- END Application Phase ---
  14567. --- Output Phase ---
  14568. ENV: Agent did: predict-no for direction U in state State-B
  14569. In State-B moving U
  14570. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14571. predict error 0
  14572. dir: dir isR
  14573. --- END Output Phase ---
  14574. -/--- Input Phase ---
  14575. =>WM: (14073: I2 ^dir R)
  14576. =>WM: (14072: I2 ^reward 1)
  14577. =>WM: (14071: I2 ^see 0)
  14578. =>WM: (14070: N1002 ^status complete)
  14579. <=WM: (14060: I2 ^dir U)
  14580. <=WM: (14059: I2 ^reward 1)
  14581. <=WM: (14058: I2 ^see 0)
  14582. =>WM: (14074: I2 ^level-1 R1-root)
  14583. <=WM: (14061: I2 ^level-1 R1-root)
  14584. --- END Input Phase ---
  14585. --- Proposal Phase ---
  14586. --- Inner Elaboration Phase, active level 1 (S1) ---
  14587. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  14588. -->
  14589. (S1 ^operator O2003 = -0.04253361215288998)
  14590. Firing prefer*rvt*predict-yes*H0*5*H1
  14591. -->
  14592. Firing elaborate*copy-see-to-output-link
  14593. -->
  14594. (I3 ^see 0 +)
  14595. Firing elaborate*reward*based*on*reward
  14596. -->
  14597. (R1006 ^value 1 +)
  14598. (R1 ^reward R1006 +)
  14599. Firing propose*predict-yes
  14600. -->
  14601. (O2005 ^name predict-yes +)
  14602. (S1 ^operator O2005 +)
  14603. Firing propose*predict-no
  14604. -->
  14605. (O2006 ^name predict-no +)
  14606. (S1 ^operator O2006 +)
  14607. Firing rl*prefer*rvt*predict-no*H0*6
  14608. -->
  14609. (S1 ^operator O2004 = 0.9999888743986174)
  14610. Firing rl*prefer*rvt*predict-yes*H0*5
  14611. -->
  14612. (S1 ^operator O2003 = 0.1215994207949702)
  14613. Firing prefer*rvt*predict-yes*H0
  14614. -->
  14615. Firing prefer*rvt*predict-no*H0
  14616. -->
  14617. Firing elaborate*copy-dir-to-output-link
  14618. -->
  14619. (I3 ^dir R +)
  14620. inner elaboration loop at bottom goal.
  14621. Retracting elaborate*copy-see-to-output-link
  14622. -->
  14623. (I3 ^see 0 +)
  14624. Retracting propose*predict-no
  14625. -->
  14626. (O2004 ^name predict-no +)
  14627. (S1 ^operator O2004 +)
  14628. Retracting propose*predict-yes
  14629. -->
  14630. (O2003 ^name predict-yes +)
  14631. (S1 ^operator O2003 +)
  14632. Retracting elaborate*reward*based*on*reward
  14633. -->
  14634. (R1005 ^value 1 +)
  14635. (R1 ^reward R1005 +)
  14636. Retracting elaborate*copy-dir-to-output-link
  14637. -->
  14638. (I3 ^dir U +)
  14639. Retracting rl*prefer*rvt*predict-no*H0*2
  14640. -->
  14641. (S1 ^operator O2004 = 1.)
  14642. Retracting rl*prefer*rvt*predict-yes*H0*1
  14643. -->
  14644. (S1 ^operator O2003 = 0.)
  14645. =>WM: (14081: S1 ^operator O2006 +)
  14646. =>WM: (14080: S1 ^operator O2005 +)
  14647. =>WM: (14079: I3 ^dir R)
  14648. =>WM: (14078: O2006 ^name predict-no)
  14649. =>WM: (14077: O2005 ^name predict-yes)
  14650. =>WM: (14076: R1006 ^value 1)
  14651. =>WM: (14075: R1 ^reward R1006)
  14652. <=WM: (14066: S1 ^operator O2003 +)
  14653. <=WM: (14067: S1 ^operator O2004 +)
  14654. <=WM: (14068: S1 ^operator O2004)
  14655. <=WM: (14038: I3 ^dir U)
  14656. <=WM: (14062: R1 ^reward R1005)
  14657. <=WM: (14065: O2004 ^name predict-no)
  14658. <=WM: (14064: O2003 ^name predict-yes)
  14659. <=WM: (14063: R1005 ^value 1)
  14660. --- Inner Elaboration Phase, active level 1 (S1) ---
  14661. Firing prefer*rvt*predict-yes*H0
  14662. -->
  14663. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  14664. -->
  14665. (S1 ^operator O2005 = -0.04253361215288998)
  14666. Firing rl*prefer*rvt*predict-yes*H0*5
  14667. -->
  14668. (S1 ^operator O2005 = 0.1215994207949702)
  14669. Firing prefer*rvt*predict-yes*H0*5*H1
  14670. -->
  14671. Firing prefer*rvt*predict-no*H0
  14672. -->
  14673. Firing rl*prefer*rvt*predict-no*H0*6
  14674. -->
  14675. (S1 ^operator O2006 = 0.9999888743986174)
  14676. inner elaboration loop at bottom goal.
  14677. Retracting rl*prefer*rvt*predict-no*H0*6
  14678. -->
  14679. (S1 ^operator O2004 = 0.9999888743986174)
  14680. Retracting rl*prefer*rvt*predict-yes*H0*5
  14681. -->
  14682. (S1 ^operator O2003 = 0.1215994207949702)
  14683. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  14684. -->
  14685. (S1 ^operator O2003 = -0.04253361215288998)
  14686. --- END Proposal Phase ---
  14687. --- Decision Phase ---
  14688. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14689. =>WM: (14082: S1 ^operator O2006)
  14690. 1003: O: O2006 (predict-no)
  14691. --- END Decision Phase ---
  14692. --- Application Phase ---
  14693. --- Firing Productions (PE) For State At Depth 1 ---
  14694. --- Inner Elaboration Phase, active level 1 (S1) ---
  14695. Firing apply*operator
  14696. -->
  14697. (I3 ^predict-no N1003 + :O )
  14698. Firing apply*operator*complete
  14699. -->
  14700. (I3 ^predict-no N1002 - :O )
  14701. inner elaboration loop at bottom goal.
  14702. --- Change Working Memory (PE) ---
  14703. =>WM: (14083: I3 ^predict-no N1003)
  14704. <=WM: (14070: N1002 ^status complete)
  14705. <=WM: (14069: I3 ^predict-no N1002)
  14706. --- Firing Productions (IE) For State At Depth 1 ---
  14707. --- Inner Elaboration Phase, active level 1 (S1) ---
  14708. Firing monitor*world
  14709. -->
  14710. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14711. --- Change Working Memory (IE) ---
  14712. --- END Application Phase ---
  14713. --- Output Phase ---
  14714. ENV: Agent did: predict-no for direction R in state State-B
  14715. In State-B moving R
  14716. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14717. predict error 0
  14718. dir: dir isR
  14719. --- END Output Phase ---
  14720. |\---- Input Phase ---
  14721. =>WM: (14087: I2 ^dir R)
  14722. =>WM: (14086: I2 ^reward 1)
  14723. =>WM: (14085: I2 ^see 0)
  14724. =>WM: (14084: N1003 ^status complete)
  14725. <=WM: (14073: I2 ^dir R)
  14726. <=WM: (14072: I2 ^reward 1)
  14727. <=WM: (14071: I2 ^see 0)
  14728. =>WM: (14088: I2 ^level-1 R0-root)
  14729. <=WM: (14074: I2 ^level-1 R1-root)
  14730. --- END Input Phase ---
  14731. --- Proposal Phase ---
  14732. --- Inner Elaboration Phase, active level 1 (S1) ---
  14733. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  14734. -->
  14735. (S1 ^operator O2005 = -0.1512366769350551)
  14736. Firing prefer*rvt*predict-yes*H0*5*H1
  14737. -->
  14738. Firing elaborate*copy-see-to-output-link
  14739. -->
  14740. (I3 ^see 0 +)
  14741. Firing elaborate*reward*based*on*reward
  14742. -->
  14743. (R1007 ^value 1 +)
  14744. (R1 ^reward R1007 +)
  14745. Firing propose*predict-yes
  14746. -->
  14747. (O2007 ^name predict-yes +)
  14748. (S1 ^operator O2007 +)
  14749. Firing propose*predict-no
  14750. -->
  14751. (O2008 ^name predict-no +)
  14752. (S1 ^operator O2008 +)
  14753. Firing rl*prefer*rvt*predict-no*H0*6
  14754. -->
  14755. (S1 ^operator O2006 = 0.9999888743986174)
  14756. Firing rl*prefer*rvt*predict-yes*H0*5
  14757. -->
  14758. (S1 ^operator O2005 = 0.1215994207949702)
  14759. Firing prefer*rvt*predict-yes*H0
  14760. -->
  14761. Firing prefer*rvt*predict-no*H0
  14762. -->
  14763. Firing elaborate*copy-dir-to-output-link
  14764. -->
  14765. (I3 ^dir R +)
  14766. inner elaboration loop at bottom goal.
  14767. Retracting elaborate*copy-see-to-output-link
  14768. -->
  14769. (I3 ^see 0 +)
  14770. Retracting propose*predict-no
  14771. -->
  14772. (O2006 ^name predict-no +)
  14773. (S1 ^operator O2006 +)
  14774. Retracting propose*predict-yes
  14775. -->
  14776. (O2005 ^name predict-yes +)
  14777. (S1 ^operator O2005 +)
  14778. Retracting elaborate*reward*based*on*reward
  14779. -->
  14780. (R1006 ^value 1 +)
  14781. (R1 ^reward R1006 +)
  14782. Retracting elaborate*copy-dir-to-output-link
  14783. -->
  14784. (I3 ^dir R +)
  14785. Retracting rl*prefer*rvt*predict-no*H0*6
  14786. -->
  14787. (S1 ^operator O2006 = 0.9999888743986174)
  14788. Retracting rl*prefer*rvt*predict-yes*H0*5
  14789. -->
  14790. (S1 ^operator O2005 = 0.1215994207949702)
  14791. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  14792. -->
  14793. (S1 ^operator O2005 = -0.04253361215288998)
  14794. =>WM: (14094: S1 ^operator O2008 +)
  14795. =>WM: (14093: S1 ^operator O2007 +)
  14796. =>WM: (14092: O2008 ^name predict-no)
  14797. =>WM: (14091: O2007 ^name predict-yes)
  14798. =>WM: (14090: R1007 ^value 1)
  14799. =>WM: (14089: R1 ^reward R1007)
  14800. <=WM: (14080: S1 ^operator O2005 +)
  14801. <=WM: (14081: S1 ^operator O2006 +)
  14802. <=WM: (14082: S1 ^operator O2006)
  14803. <=WM: (14075: R1 ^reward R1006)
  14804. <=WM: (14078: O2006 ^name predict-no)
  14805. <=WM: (14077: O2005 ^name predict-yes)
  14806. <=WM: (14076: R1006 ^value 1)
  14807. --- Inner Elaboration Phase, active level 1 (S1) ---
  14808. Firing prefer*rvt*predict-yes*H0
  14809. -->
  14810. Firing rl*prefer*rvt*predict-yes*H0*5
  14811. -->
  14812. (S1 ^operator O2007 = 0.1215994207949702)
  14813. Firing prefer*rvt*predict-yes*H0*5*H1
  14814. -->
  14815. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  14816. -->
  14817. (S1 ^operator O2007 = -0.1512366769350551)
  14818. Firing prefer*rvt*predict-no*H0
  14819. -->
  14820. Firing rl*prefer*rvt*predict-no*H0*6
  14821. -->
  14822. (S1 ^operator O2008 = 0.9999888743986174)
  14823. inner elaboration loop at bottom goal.
  14824. Retracting rl*prefer*rvt*predict-no*H0*6
  14825. -->
  14826. (S1 ^operator O2006 = 0.9999888743986174)
  14827. Retracting rl*prefer*rvt*predict-yes*H0*5
  14828. -->
  14829. (S1 ^operator O2005 = 0.1215994207949702)
  14830. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  14831. -->
  14832. (S1 ^operator O2005 = -0.1512366769350551)
  14833. --- END Proposal Phase ---
  14834. --- Decision Phase ---
  14835. RL update rl*prefer*rvt*predict-no*H0*6 0.999989 0 0.999989 -> 0.999991 0 0.999991(R,m,v=1,0.938202,0.0583064)
  14836. =>WM: (14095: S1 ^operator O2008)
  14837. 1004: O: O2008 (predict-no)
  14838. --- END Decision Phase ---
  14839. --- Application Phase ---
  14840. --- Firing Productions (PE) For State At Depth 1 ---
  14841. --- Inner Elaboration Phase, active level 1 (S1) ---
  14842. Firing apply*operator
  14843. -->
  14844. (I3 ^predict-no N1004 + :O )
  14845. Firing apply*operator*complete
  14846. -->
  14847. (I3 ^predict-no N1003 - :O )
  14848. inner elaboration loop at bottom goal.
  14849. --- Change Working Memory (PE) ---
  14850. =>WM: (14096: I3 ^predict-no N1004)
  14851. <=WM: (14084: N1003 ^status complete)
  14852. <=WM: (14083: I3 ^predict-no N1003)
  14853. --- Firing Productions (IE) For State At Depth 1 ---
  14854. --- Inner Elaboration Phase, active level 1 (S1) ---
  14855. Firing monitor*world
  14856. -->
  14857. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14858. --- Change Working Memory (IE) ---
  14859. --- END Application Phase ---
  14860. --- Output Phase ---
  14861. ENV: Agent did: predict-no for direction R in state State-B
  14862. In State-B moving R
  14863. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14864. predict error 0
  14865. dir: dir isU
  14866. --- END Output Phase ---
  14867. /|\--- Input Phase ---
  14868. =>WM: (14100: I2 ^dir U)
  14869. =>WM: (14099: I2 ^reward 1)
  14870. =>WM: (14098: I2 ^see 0)
  14871. =>WM: (14097: N1004 ^status complete)
  14872. <=WM: (14087: I2 ^dir R)
  14873. <=WM: (14086: I2 ^reward 1)
  14874. <=WM: (14085: I2 ^see 0)
  14875. =>WM: (14101: I2 ^level-1 R0-root)
  14876. <=WM: (14088: I2 ^level-1 R0-root)
  14877. --- END Input Phase ---
  14878. --- Proposal Phase ---
  14879. --- Inner Elaboration Phase, active level 1 (S1) ---
  14880. Firing elaborate*copy-see-to-output-link
  14881. -->
  14882. (I3 ^see 0 +)
  14883. Firing elaborate*reward*based*on*reward
  14884. -->
  14885. (R1008 ^value 1 +)
  14886. (R1 ^reward R1008 +)
  14887. Firing propose*predict-yes
  14888. -->
  14889. (O2009 ^name predict-yes +)
  14890. (S1 ^operator O2009 +)
  14891. Firing propose*predict-no
  14892. -->
  14893. (O2010 ^name predict-no +)
  14894. (S1 ^operator O2010 +)
  14895. Firing rl*prefer*rvt*predict-no*H0*2
  14896. -->
  14897. (S1 ^operator O2008 = 1.)
  14898. Firing rl*prefer*rvt*predict-yes*H0*1
  14899. -->
  14900. (S1 ^operator O2007 = 0.)
  14901. Firing prefer*rvt*predict-yes*H0
  14902. -->
  14903. Firing prefer*rvt*predict-no*H0
  14904. -->
  14905. Firing elaborate*copy-dir-to-output-link
  14906. -->
  14907. (I3 ^dir U +)
  14908. inner elaboration loop at bottom goal.
  14909. Retracting elaborate*copy-see-to-output-link
  14910. -->
  14911. (I3 ^see 0 +)
  14912. Retracting propose*predict-no
  14913. -->
  14914. (O2008 ^name predict-no +)
  14915. (S1 ^operator O2008 +)
  14916. Retracting propose*predict-yes
  14917. -->
  14918. (O2007 ^name predict-yes +)
  14919. (S1 ^operator O2007 +)
  14920. Retracting elaborate*reward*based*on*reward
  14921. -->
  14922. (R1007 ^value 1 +)
  14923. (R1 ^reward R1007 +)
  14924. Retracting elaborate*copy-dir-to-output-link
  14925. -->
  14926. (I3 ^dir R +)
  14927. Retracting rl*prefer*rvt*predict-no*H0*6
  14928. -->
  14929. (S1 ^operator O2008 = 0.9999906741383352)
  14930. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  14931. -->
  14932. (S1 ^operator O2007 = -0.1512366769350551)
  14933. Retracting rl*prefer*rvt*predict-yes*H0*5
  14934. -->
  14935. (S1 ^operator O2007 = 0.1215994207949702)
  14936. =>WM: (14108: S1 ^operator O2010 +)
  14937. =>WM: (14107: S1 ^operator O2009 +)
  14938. =>WM: (14106: I3 ^dir U)
  14939. =>WM: (14105: O2010 ^name predict-no)
  14940. =>WM: (14104: O2009 ^name predict-yes)
  14941. =>WM: (14103: R1008 ^value 1)
  14942. =>WM: (14102: R1 ^reward R1008)
  14943. <=WM: (14093: S1 ^operator O2007 +)
  14944. <=WM: (14094: S1 ^operator O2008 +)
  14945. <=WM: (14095: S1 ^operator O2008)
  14946. <=WM: (14079: I3 ^dir R)
  14947. <=WM: (14089: R1 ^reward R1007)
  14948. <=WM: (14092: O2008 ^name predict-no)
  14949. <=WM: (14091: O2007 ^name predict-yes)
  14950. <=WM: (14090: R1007 ^value 1)
  14951. --- Inner Elaboration Phase, active level 1 (S1) ---
  14952. Firing prefer*rvt*predict-yes*H0
  14953. -->
  14954. Firing rl*prefer*rvt*predict-yes*H0*1
  14955. -->
  14956. (S1 ^operator O2009 = 0.)
  14957. Firing prefer*rvt*predict-no*H0
  14958. -->
  14959. Firing rl*prefer*rvt*predict-no*H0*2
  14960. -->
  14961. (S1 ^operator O2010 = 1.)
  14962. inner elaboration loop at bottom goal.
  14963. Retracting rl*prefer*rvt*predict-no*H0*2
  14964. -->
  14965. (S1 ^operator O2008 = 1.)
  14966. Retracting rl*prefer*rvt*predict-yes*H0*1
  14967. -->
  14968. (S1 ^operator O2007 = 0.)
  14969. --- END Proposal Phase ---
  14970. --- Decision Phase ---
  14971. RL update rl*prefer*rvt*predict-no*H0*6 0.999991 0 0.999991 -> 0.999992 0 0.999992(R,m,v=1,0.938547,0.0580001)
  14972. =>WM: (14109: S1 ^operator O2010)
  14973. 1005: O: O2010 (predict-no)
  14974. --- END Decision Phase ---
  14975. --- Application Phase ---
  14976. --- Firing Productions (PE) For State At Depth 1 ---
  14977. --- Inner Elaboration Phase, active level 1 (S1) ---
  14978. Firing apply*operator
  14979. -->
  14980. (I3 ^predict-no N1005 + :O )
  14981. Firing apply*operator*complete
  14982. -->
  14983. (I3 ^predict-no N1004 - :O )
  14984. inner elaboration loop at bottom goal.
  14985. --- Change Working Memory (PE) ---
  14986. =>WM: (14110: I3 ^predict-no N1005)
  14987. <=WM: (14097: N1004 ^status complete)
  14988. <=WM: (14096: I3 ^predict-no N1004)
  14989. --- Firing Productions (IE) For State At Depth 1 ---
  14990. --- Inner Elaboration Phase, active level 1 (S1) ---
  14991. Firing monitor*world
  14992. -->
  14993. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14994. --- Change Working Memory (IE) ---
  14995. --- END Application Phase ---
  14996. --- Output Phase ---
  14997. ENV: Agent did: predict-no for direction U in state State-B
  14998. In State-B moving U
  14999. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15000. predict error 0
  15001. dir: dir isU
  15002. --- END Output Phase ---
  15003. -/|--- Input Phase ---
  15004. =>WM: (14114: I2 ^dir U)
  15005. =>WM: (14113: I2 ^reward 1)
  15006. =>WM: (14112: I2 ^see 0)
  15007. =>WM: (14111: N1005 ^status complete)
  15008. <=WM: (14100: I2 ^dir U)
  15009. <=WM: (14099: I2 ^reward 1)
  15010. <=WM: (14098: I2 ^see 0)
  15011. =>WM: (14115: I2 ^level-1 R0-root)
  15012. <=WM: (14101: I2 ^level-1 R0-root)
  15013. --- END Input Phase ---
  15014. --- Proposal Phase ---
  15015. --- Inner Elaboration Phase, active level 1 (S1) ---
  15016. Firing elaborate*copy-see-to-output-link
  15017. -->
  15018. (I3 ^see 0 +)
  15019. Firing elaborate*reward*based*on*reward
  15020. -->
  15021. (R1009 ^value 1 +)
  15022. (R1 ^reward R1009 +)
  15023. Firing propose*predict-yes
  15024. -->
  15025. (O2011 ^name predict-yes +)
  15026. (S1 ^operator O2011 +)
  15027. Firing propose*predict-no
  15028. -->
  15029. (O2012 ^name predict-no +)
  15030. (S1 ^operator O2012 +)
  15031. Firing rl*prefer*rvt*predict-no*H0*2
  15032. -->
  15033. (S1 ^operator O2010 = 1.)
  15034. Firing rl*prefer*rvt*predict-yes*H0*1
  15035. -->
  15036. (S1 ^operator O2009 = 0.)
  15037. Firing prefer*rvt*predict-yes*H0
  15038. -->
  15039. Firing prefer*rvt*predict-no*H0
  15040. -->
  15041. Firing elaborate*copy-dir-to-output-link
  15042. -->
  15043. (I3 ^dir U +)
  15044. inner elaboration loop at bottom goal.
  15045. Retracting elaborate*copy-see-to-output-link
  15046. -->
  15047. (I3 ^see 0 +)
  15048. Retracting propose*predict-no
  15049. -->
  15050. (O2010 ^name predict-no +)
  15051. (S1 ^operator O2010 +)
  15052. Retracting propose*predict-yes
  15053. -->
  15054. (O2009 ^name predict-yes +)
  15055. (S1 ^operator O2009 +)
  15056. Retracting elaborate*reward*based*on*reward
  15057. -->
  15058. (R1008 ^value 1 +)
  15059. (R1 ^reward R1008 +)
  15060. Retracting elaborate*copy-dir-to-output-link
  15061. -->
  15062. (I3 ^dir U +)
  15063. Retracting rl*prefer*rvt*predict-no*H0*2
  15064. -->
  15065. (S1 ^operator O2010 = 1.)
  15066. Retracting rl*prefer*rvt*predict-yes*H0*1
  15067. -->
  15068. (S1 ^operator O2009 = 0.)
  15069. =>WM: (14121: S1 ^operator O2012 +)
  15070. =>WM: (14120: S1 ^operator O2011 +)
  15071. =>WM: (14119: O2012 ^name predict-no)
  15072. =>WM: (14118: O2011 ^name predict-yes)
  15073. =>WM: (14117: R1009 ^value 1)
  15074. =>WM: (14116: R1 ^reward R1009)
  15075. <=WM: (14107: S1 ^operator O2009 +)
  15076. <=WM: (14108: S1 ^operator O2010 +)
  15077. <=WM: (14109: S1 ^operator O2010)
  15078. <=WM: (14102: R1 ^reward R1008)
  15079. <=WM: (14105: O2010 ^name predict-no)
  15080. <=WM: (14104: O2009 ^name predict-yes)
  15081. <=WM: (14103: R1008 ^value 1)
  15082. --- Inner Elaboration Phase, active level 1 (S1) ---
  15083. Firing prefer*rvt*predict-yes*H0
  15084. -->
  15085. Firing rl*prefer*rvt*predict-yes*H0*1
  15086. -->
  15087. (S1 ^operator O2011 = 0.)
  15088. Firing prefer*rvt*predict-no*H0
  15089. -->
  15090. Firing rl*prefer*rvt*predict-no*H0*2
  15091. -->
  15092. (S1 ^operator O2012 = 1.)
  15093. inner elaboration loop at bottom goal.
  15094. Retracting rl*prefer*rvt*predict-no*H0*2
  15095. -->
  15096. (S1 ^operator O2010 = 1.)
  15097. Retracting rl*prefer*rvt*predict-yes*H0*1
  15098. -->
  15099. (S1 ^operator O2009 = 0.)
  15100. --- END Proposal Phase ---
  15101. --- Decision Phase ---
  15102. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15103. =>WM: (14122: S1 ^operator O2012)
  15104. 1006: O: O2012 (predict-no)
  15105. --- END Decision Phase ---
  15106. --- Application Phase ---
  15107. --- Firing Productions (PE) For State At Depth 1 ---
  15108. --- Inner Elaboration Phase, active level 1 (S1) ---
  15109. Firing apply*operator
  15110. -->
  15111. (I3 ^predict-no N1006 + :O )
  15112. Firing apply*operator*complete
  15113. -->
  15114. (I3 ^predict-no N1005 - :O )
  15115. inner elaboration loop at bottom goal.
  15116. --- Change Working Memory (PE) ---
  15117. =>WM: (14123: I3 ^predict-no N1006)
  15118. <=WM: (14111: N1005 ^status complete)
  15119. <=WM: (14110: I3 ^predict-no N1005)
  15120. --- Firing Productions (IE) For State At Depth 1 ---
  15121. --- Inner Elaboration Phase, active level 1 (S1) ---
  15122. Firing monitor*world
  15123. -->
  15124. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15125. --- Change Working Memory (IE) ---
  15126. --- END Application Phase ---
  15127. --- Output Phase ---
  15128. ENV: Agent did: predict-no for direction U in state State-B
  15129. In State-B moving U
  15130. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15131. predict error 0
  15132. dir: dir isL
  15133. --- END Output Phase ---
  15134. \-/--- Input Phase ---
  15135. =>WM: (14127: I2 ^dir L)
  15136. =>WM: (14126: I2 ^reward 1)
  15137. =>WM: (14125: I2 ^see 0)
  15138. =>WM: (14124: N1006 ^status complete)
  15139. <=WM: (14114: I2 ^dir U)
  15140. <=WM: (14113: I2 ^reward 1)
  15141. <=WM: (14112: I2 ^see 0)
  15142. =>WM: (14128: I2 ^level-1 R0-root)
  15143. <=WM: (14115: I2 ^level-1 R0-root)
  15144. --- END Input Phase ---
  15145. --- Proposal Phase ---
  15146. --- Inner Elaboration Phase, active level 1 (S1) ---
  15147. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  15148. -->
  15149. (S1 ^operator O2012 = -0.1984300550322165)
  15150. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  15151. -->
  15152. (S1 ^operator O2011 = 0.6091150129894595)
  15153. Firing prefer*rvt*predict-no*H0*4*H1
  15154. -->
  15155. Firing prefer*rvt*predict-yes*H0*3*H1
  15156. -->
  15157. Firing elaborate*copy-see-to-output-link
  15158. -->
  15159. (I3 ^see 0 +)
  15160. Firing elaborate*reward*based*on*reward
  15161. -->
  15162. (R1010 ^value 1 +)
  15163. (R1 ^reward R1010 +)
  15164. Firing propose*predict-yes
  15165. -->
  15166. (O2013 ^name predict-yes +)
  15167. (S1 ^operator O2013 +)
  15168. Firing propose*predict-no
  15169. -->
  15170. (O2014 ^name predict-no +)
  15171. (S1 ^operator O2014 +)
  15172. Firing rl*prefer*rvt*predict-no*H0*4
  15173. -->
  15174. (S1 ^operator O2012 = 0.3145079413521559)
  15175. Firing rl*prefer*rvt*predict-yes*H0*3
  15176. -->
  15177. (S1 ^operator O2011 = 0.3907782094907327)
  15178. Firing prefer*rvt*predict-yes*H0
  15179. -->
  15180. Firing prefer*rvt*predict-no*H0
  15181. -->
  15182. Firing elaborate*copy-dir-to-output-link
  15183. -->
  15184. (I3 ^dir L +)
  15185. inner elaboration loop at bottom goal.
  15186. Retracting elaborate*copy-see-to-output-link
  15187. -->
  15188. (I3 ^see 0 +)
  15189. Retracting propose*predict-no
  15190. -->
  15191. (O2012 ^name predict-no +)
  15192. (S1 ^operator O2012 +)
  15193. Retracting propose*predict-yes
  15194. -->
  15195. (O2011 ^name predict-yes +)
  15196. (S1 ^operator O2011 +)
  15197. Retracting elaborate*reward*based*on*reward
  15198. -->
  15199. (R1009 ^value 1 +)
  15200. (R1 ^reward R1009 +)
  15201. Retracting elaborate*copy-dir-to-output-link
  15202. -->
  15203. (I3 ^dir U +)
  15204. Retracting rl*prefer*rvt*predict-no*H0*2
  15205. -->
  15206. (S1 ^operator O2012 = 1.)
  15207. Retracting rl*prefer*rvt*predict-yes*H0*1
  15208. -->
  15209. (S1 ^operator O2011 = 0.)
  15210. =>WM: (14135: S1 ^operator O2014 +)
  15211. =>WM: (14134: S1 ^operator O2013 +)
  15212. =>WM: (14133: I3 ^dir L)
  15213. =>WM: (14132: O2014 ^name predict-no)
  15214. =>WM: (14131: O2013 ^name predict-yes)
  15215. =>WM: (14130: R1010 ^value 1)
  15216. =>WM: (14129: R1 ^reward R1010)
  15217. <=WM: (14120: S1 ^operator O2011 +)
  15218. <=WM: (14121: S1 ^operator O2012 +)
  15219. <=WM: (14122: S1 ^operator O2012)
  15220. <=WM: (14106: I3 ^dir U)
  15221. <=WM: (14116: R1 ^reward R1009)
  15222. <=WM: (14119: O2012 ^name predict-no)
  15223. <=WM: (14118: O2011 ^name predict-yes)
  15224. <=WM: (14117: R1009 ^value 1)
  15225. --- Inner Elaboration Phase, active level 1 (S1) ---
  15226. Firing prefer*rvt*predict-yes*H0
  15227. -->
  15228. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  15229. -->
  15230. (S1 ^operator O2013 = 0.6091150129894595)
  15231. Firing rl*prefer*rvt*predict-yes*H0*3
  15232. -->
  15233. (S1 ^operator O2013 = 0.3907782094907327)
  15234. Firing prefer*rvt*predict-yes*H0*3*H1
  15235. -->
  15236. Firing prefer*rvt*predict-no*H0
  15237. -->
  15238. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  15239. -->
  15240. (S1 ^operator O2014 = -0.1984300550322165)
  15241. Firing rl*prefer*rvt*predict-no*H0*4
  15242. -->
  15243. (S1 ^operator O2014 = 0.3145079413521559)
  15244. Firing prefer*rvt*predict-no*H0*4*H1
  15245. -->
  15246. inner elaboration loop at bottom goal.
  15247. Retracting rl*prefer*rvt*predict-no*H0*4
  15248. -->
  15249. (S1 ^operator O2012 = 0.3145079413521559)
  15250. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  15251. -->
  15252. (S1 ^operator O2012 = -0.1984300550322165)
  15253. Retracting rl*prefer*rvt*predict-yes*H0*3
  15254. -->
  15255. (S1 ^operator O2011 = 0.3907782094907327)
  15256. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  15257. -->
  15258. (S1 ^operator O2011 = 0.6091150129894595)
  15259. --- END Proposal Phase ---
  15260. --- Decision Phase ---
  15261. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15262. =>WM: (14136: S1 ^operator O2013)
  15263. 1007: O: O2013 (predict-yes)
  15264. --- END Decision Phase ---
  15265. --- Application Phase ---
  15266. --- Firing Productions (PE) For State At Depth 1 ---
  15267. --- Inner Elaboration Phase, active level 1 (S1) ---
  15268. Firing apply*operator
  15269. -->
  15270. (I3 ^predict-yes N1007 + :O )
  15271. Firing apply*operator*complete
  15272. -->
  15273. (I3 ^predict-no N1006 - :O )
  15274. inner elaboration loop at bottom goal.
  15275. --- Change Working Memory (PE) ---
  15276. =>WM: (14137: I3 ^predict-yes N1007)
  15277. <=WM: (14124: N1006 ^status complete)
  15278. <=WM: (14123: I3 ^predict-no N1006)
  15279. --- Firing Productions (IE) For State At Depth 1 ---
  15280. --- Inner Elaboration Phase, active level 1 (S1) ---
  15281. Firing monitor*world
  15282. -->
  15283. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15284. --- Change Working Memory (IE) ---
  15285. --- END Application Phase ---
  15286. --- Output Phase ---
  15287. ENV: Agent did: predict-yes for direction L in state State-B
  15288. In State-B moving L
  15289. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15290. predict error 0
  15291. dir: dir isR
  15292. --- END Output Phase ---
  15293. |\---- Input Phase ---
  15294. =>WM: (14141: I2 ^dir R)
  15295. =>WM: (14140: I2 ^reward 1)
  15296. =>WM: (14139: I2 ^see 1)
  15297. =>WM: (14138: N1007 ^status complete)
  15298. <=WM: (14127: I2 ^dir L)
  15299. <=WM: (14126: I2 ^reward 1)
  15300. <=WM: (14125: I2 ^see 0)
  15301. =>WM: (14142: I2 ^level-1 L1-root)
  15302. <=WM: (14128: I2 ^level-1 R0-root)
  15303. --- END Input Phase ---
  15304. --- Proposal Phase ---
  15305. --- Inner Elaboration Phase, active level 1 (S1) ---
  15306. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  15307. -->
  15308. (S1 ^operator O2013 = 0.8784140715701729)
  15309. Firing prefer*rvt*predict-yes*H0*5*H1
  15310. -->
  15311. Firing elaborate*copy-see-to-output-link
  15312. -->
  15313. (I3 ^see 1 +)
  15314. Firing elaborate*reward*based*on*reward
  15315. -->
  15316. (R1011 ^value 1 +)
  15317. (R1 ^reward R1011 +)
  15318. Firing propose*predict-yes
  15319. -->
  15320. (O2015 ^name predict-yes +)
  15321. (S1 ^operator O2015 +)
  15322. Firing propose*predict-no
  15323. -->
  15324. (O2016 ^name predict-no +)
  15325. (S1 ^operator O2016 +)
  15326. Firing rl*prefer*rvt*predict-no*H0*6
  15327. -->
  15328. (S1 ^operator O2014 = 0.9999921813761182)
  15329. Firing rl*prefer*rvt*predict-yes*H0*5
  15330. -->
  15331. (S1 ^operator O2013 = 0.1215994207949702)
  15332. Firing prefer*rvt*predict-yes*H0
  15333. -->
  15334. Firing prefer*rvt*predict-no*H0
  15335. -->
  15336. Firing elaborate*copy-dir-to-output-link
  15337. -->
  15338. (I3 ^dir R +)
  15339. inner elaboration loop at bottom goal.
  15340. Retracting elaborate*copy-see-to-output-link
  15341. -->
  15342. (I3 ^see 0 +)
  15343. Retracting propose*predict-no
  15344. -->
  15345. (O2014 ^name predict-no +)
  15346. (S1 ^operator O2014 +)
  15347. Retracting propose*predict-yes
  15348. -->
  15349. (O2013 ^name predict-yes +)
  15350. (S1 ^operator O2013 +)
  15351. Retracting elaborate*reward*based*on*reward
  15352. -->
  15353. (R1010 ^value 1 +)
  15354. (R1 ^reward R1010 +)
  15355. Retracting elaborate*copy-dir-to-output-link
  15356. -->
  15357. (I3 ^dir L +)
  15358. Retracting rl*prefer*rvt*predict-no*H0*4
  15359. -->
  15360. (S1 ^operator O2014 = 0.3145079413521559)
  15361. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  15362. -->
  15363. (S1 ^operator O2014 = -0.1984300550322165)
  15364. Retracting rl*prefer*rvt*predict-yes*H0*3
  15365. -->
  15366. (S1 ^operator O2013 = 0.3907782094907327)
  15367. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  15368. -->
  15369. (S1 ^operator O2013 = 0.6091150129894595)
  15370. =>WM: (14150: S1 ^operator O2016 +)
  15371. =>WM: (14149: S1 ^operator O2015 +)
  15372. =>WM: (14148: I3 ^dir R)
  15373. =>WM: (14147: O2016 ^name predict-no)
  15374. =>WM: (14146: O2015 ^name predict-yes)
  15375. =>WM: (14145: R1011 ^value 1)
  15376. =>WM: (14144: R1 ^reward R1011)
  15377. =>WM: (14143: I3 ^see 1)
  15378. <=WM: (14134: S1 ^operator O2013 +)
  15379. <=WM: (14136: S1 ^operator O2013)
  15380. <=WM: (14135: S1 ^operator O2014 +)
  15381. <=WM: (14133: I3 ^dir L)
  15382. <=WM: (14129: R1 ^reward R1010)
  15383. <=WM: (14048: I3 ^see 0)
  15384. <=WM: (14132: O2014 ^name predict-no)
  15385. <=WM: (14131: O2013 ^name predict-yes)
  15386. <=WM: (14130: R1010 ^value 1)
  15387. --- Inner Elaboration Phase, active level 1 (S1) ---
  15388. Firing prefer*rvt*predict-yes*H0
  15389. -->
  15390. Firing rl*prefer*rvt*predict-yes*H0*5
  15391. -->
  15392. (S1 ^operator O2015 = 0.1215994207949702)
  15393. Firing prefer*rvt*predict-yes*H0*5*H1
  15394. -->
  15395. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  15396. -->
  15397. (S1 ^operator O2015 = 0.8784140715701729)
  15398. Firing prefer*rvt*predict-no*H0
  15399. -->
  15400. Firing rl*prefer*rvt*predict-no*H0*6
  15401. -->
  15402. (S1 ^operator O2016 = 0.9999921813761182)
  15403. inner elaboration loop at bottom goal.
  15404. Retracting rl*prefer*rvt*predict-no*H0*6
  15405. -->
  15406. (S1 ^operator O2014 = 0.9999921813761182)
  15407. Retracting rl*prefer*rvt*predict-yes*H0*5
  15408. -->
  15409. (S1 ^operator O2013 = 0.1215994207949702)
  15410. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  15411. -->
  15412. (S1 ^operator O2013 = 0.8784140715701729)
  15413. --- END Proposal Phase ---
  15414. --- Decision Phase ---
  15415. RL update rl*prefer*rvt*predict-yes*H0*3 0.472324 -0.0815458 0.390778 -> 0.472332 -0.0815445 0.390787(R,m,v=1,0.944099,0.0531056)
  15416. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527585 0.0815301 0.609115 -> 0.527593 0.0815315 0.609125(R,m,v=1,1,0)
  15417. =>WM: (14151: S1 ^operator O2015)
  15418. 1008: O: O2015 (predict-yes)
  15419. --- END Decision Phase ---
  15420. --- Application Phase ---
  15421. --- Firing Productions (PE) For State At Depth 1 ---
  15422. --- Inner Elaboration Phase, active level 1 (S1) ---
  15423. Firing apply*operator
  15424. -->
  15425. (I3 ^predict-yes N1008 + :O )
  15426. Firing apply*operator*complete
  15427. -->
  15428. (I3 ^predict-yes N1007 - :O )
  15429. inner elaboration loop at bottom goal.
  15430. --- Change Working Memory (PE) ---
  15431. =>WM: (14152: I3 ^predict-yes N1008)
  15432. <=WM: (14138: N1007 ^status complete)
  15433. <=WM: (14137: I3 ^predict-yes N1007)
  15434. --- Firing Productions (IE) For State At Depth 1 ---
  15435. --- Inner Elaboration Phase, active level 1 (S1) ---
  15436. Firing monitor*world
  15437. -->
  15438. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15439. --- Change Working Memory (IE) ---
  15440. --- END Application Phase ---
  15441. --- Output Phase ---
  15442. ENV: Agent did: predict-yes for direction R in state State-A
  15443. In State-A moving R
  15444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15445. predict error 0
  15446. dir: dir isL
  15447. --- END Output Phase ---
  15448. /|--- Input Phase ---
  15449. =>WM: (14156: I2 ^dir L)
  15450. =>WM: (14155: I2 ^reward 1)
  15451. =>WM: (14154: I2 ^see 1)
  15452. =>WM: (14153: N1008 ^status complete)
  15453. <=WM: (14141: I2 ^dir R)
  15454. <=WM: (14140: I2 ^reward 1)
  15455. <=WM: (14139: I2 ^see 1)
  15456. =>WM: (14157: I2 ^level-1 R1-root)
  15457. <=WM: (14142: I2 ^level-1 L1-root)
  15458. --- END Input Phase ---
  15459. --- Proposal Phase ---
  15460. --- Inner Elaboration Phase, active level 1 (S1) ---
  15461. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  15462. -->
  15463. (S1 ^operator O2016 = -0.168718511744511)
  15464. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  15465. -->
  15466. (S1 ^operator O2015 = 0.6093091841289463)
  15467. Firing prefer*rvt*predict-no*H0*4*H1
  15468. -->
  15469. Firing prefer*rvt*predict-yes*H0*3*H1
  15470. -->
  15471. Firing elaborate*copy-see-to-output-link
  15472. -->
  15473. (I3 ^see 1 +)
  15474. Firing elaborate*reward*based*on*reward
  15475. -->
  15476. (R1012 ^value 1 +)
  15477. (R1 ^reward R1012 +)
  15478. Firing propose*predict-yes
  15479. -->
  15480. (O2017 ^name predict-yes +)
  15481. (S1 ^operator O2017 +)
  15482. Firing propose*predict-no
  15483. -->
  15484. (O2018 ^name predict-no +)
  15485. (S1 ^operator O2018 +)
  15486. Firing rl*prefer*rvt*predict-no*H0*4
  15487. -->
  15488. (S1 ^operator O2016 = 0.3145079413521559)
  15489. Firing rl*prefer*rvt*predict-yes*H0*3
  15490. -->
  15491. (S1 ^operator O2015 = 0.3907869885089824)
  15492. Firing prefer*rvt*predict-yes*H0
  15493. -->
  15494. Firing prefer*rvt*predict-no*H0
  15495. -->
  15496. Firing elaborate*copy-dir-to-output-link
  15497. -->
  15498. (I3 ^dir L +)
  15499. inner elaboration loop at bottom goal.
  15500. Retracting elaborate*copy-see-to-output-link
  15501. -->
  15502. (I3 ^see 1 +)
  15503. Retracting propose*predict-no
  15504. -->
  15505. (O2016 ^name predict-no +)
  15506. (S1 ^operator O2016 +)
  15507. Retracting propose*predict-yes
  15508. -->
  15509. (O2015 ^name predict-yes +)
  15510. (S1 ^operator O2015 +)
  15511. Retracting elaborate*reward*based*on*reward
  15512. -->
  15513. (R1011 ^value 1 +)
  15514. (R1 ^reward R1011 +)
  15515. Retracting elaborate*copy-dir-to-output-link
  15516. -->
  15517. (I3 ^dir R +)
  15518. Retracting rl*prefer*rvt*predict-no*H0*6
  15519. -->
  15520. (S1 ^operator O2016 = 0.9999921813761182)
  15521. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  15522. -->
  15523. (S1 ^operator O2015 = 0.8784140715701729)
  15524. Retracting rl*prefer*rvt*predict-yes*H0*5
  15525. -->
  15526. (S1 ^operator O2015 = 0.1215994207949702)
  15527. =>WM: (14164: S1 ^operator O2018 +)
  15528. =>WM: (14163: S1 ^operator O2017 +)
  15529. =>WM: (14162: I3 ^dir L)
  15530. =>WM: (14161: O2018 ^name predict-no)
  15531. =>WM: (14160: O2017 ^name predict-yes)
  15532. =>WM: (14159: R1012 ^value 1)
  15533. =>WM: (14158: R1 ^reward R1012)
  15534. <=WM: (14149: S1 ^operator O2015 +)
  15535. <=WM: (14151: S1 ^operator O2015)
  15536. <=WM: (14150: S1 ^operator O2016 +)
  15537. <=WM: (14148: I3 ^dir R)
  15538. <=WM: (14144: R1 ^reward R1011)
  15539. <=WM: (14147: O2016 ^name predict-no)
  15540. <=WM: (14146: O2015 ^name predict-yes)
  15541. <=WM: (14145: R1011 ^value 1)
  15542. --- Inner Elaboration Phase, active level 1 (S1) ---
  15543. Firing prefer*rvt*predict-yes*H0
  15544. -->
  15545. Firing rl*prefer*rvt*predict-yes*H0*3
  15546. -->
  15547. (S1 ^operator O2017 = 0.3907869885089824)
  15548. Firing prefer*rvt*predict-yes*H0*3*H1
  15549. -->
  15550. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  15551. -->
  15552. (S1 ^operator O2017 = 0.6093091841289463)
  15553. Firing prefer*rvt*predict-no*H0
  15554. -->
  15555. Firing rl*prefer*rvt*predict-no*H0*4
  15556. -->
  15557. (S1 ^operator O2018 = 0.3145079413521559)
  15558. Firing prefer*rvt*predict-no*H0*4*H1
  15559. -->
  15560. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  15561. -->
  15562. (S1 ^operator O2018 = -0.168718511744511)
  15563. inner elaboration loop at bottom goal.
  15564. Retracting rl*prefer*rvt*predict-no*H0*4
  15565. -->
  15566. (S1 ^operator O2016 = 0.3145079413521559)
  15567. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  15568. -->
  15569. (S1 ^operator O2016 = -0.168718511744511)
  15570. Retracting rl*prefer*rvt*predict-yes*H0*3
  15571. -->
  15572. (S1 ^operator O2015 = 0.3907869885089824)
  15573. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  15574. -->
  15575. (S1 ^operator O2015 = 0.6093091841289463)
  15576. --- END Proposal Phase ---
  15577. --- Decision Phase ---
  15578. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.865169,0.117311)
  15579. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465486 0.412928 0.878414 -> 0.465485 0.412928 0.878413(R,m,v=1,1,0)
  15580. =>WM: (14165: S1 ^operator O2017)
  15581. 1009: O: O2017 (predict-yes)
  15582. --- END Decision Phase ---
  15583. --- Application Phase ---
  15584. --- Firing Productions (PE) For State At Depth 1 ---
  15585. --- Inner Elaboration Phase, active level 1 (S1) ---
  15586. Firing apply*operator
  15587. -->
  15588. (I3 ^predict-yes N1009 + :O )
  15589. Firing apply*operator*complete
  15590. -->
  15591. (I3 ^predict-yes N1008 - :O )
  15592. inner elaboration loop at bottom goal.
  15593. --- Change Working Memory (PE) ---
  15594. =>WM: (14166: I3 ^predict-yes N1009)
  15595. <=WM: (14153: N1008 ^status complete)
  15596. <=WM: (14152: I3 ^predict-yes N1008)
  15597. --- Firing Productions (IE) For State At Depth 1 ---
  15598. --- Inner Elaboration Phase, active level 1 (S1) ---
  15599. Firing monitor*world
  15600. -->
  15601. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15602. --- Change Working Memory (IE) ---
  15603. --- END Application Phase ---
  15604. --- Output Phase ---
  15605. ENV: Agent did: predict-yes for direction L in state State-B
  15606. In State-B moving L
  15607. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15608. predict error 0
  15609. dir: dir isL
  15610. --- END Output Phase ---
  15611. \-/--- Input Phase ---
  15612. =>WM: (14170: I2 ^dir L)
  15613. =>WM: (14169: I2 ^reward 1)
  15614. =>WM: (14168: I2 ^see 1)
  15615. =>WM: (14167: N1009 ^status complete)
  15616. <=WM: (14156: I2 ^dir L)
  15617. <=WM: (14155: I2 ^reward 1)
  15618. <=WM: (14154: I2 ^see 1)
  15619. =>WM: (14171: I2 ^level-1 L1-root)
  15620. <=WM: (14157: I2 ^level-1 R1-root)
  15621. --- END Input Phase ---
  15622. --- Proposal Phase ---
  15623. --- Inner Elaboration Phase, active level 1 (S1) ---
  15624. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  15625. -->
  15626. (S1 ^operator O2017 = -0.2062723012911647)
  15627. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  15628. -->
  15629. (S1 ^operator O2018 = 0.685530273786795)
  15630. Firing prefer*rvt*predict-no*H0*4*H1
  15631. -->
  15632. Firing prefer*rvt*predict-yes*H0*3*H1
  15633. -->
  15634. Firing elaborate*copy-see-to-output-link
  15635. -->
  15636. (I3 ^see 1 +)
  15637. Firing elaborate*reward*based*on*reward
  15638. -->
  15639. (R1013 ^value 1 +)
  15640. (R1 ^reward R1013 +)
  15641. Firing propose*predict-yes
  15642. -->
  15643. (O2019 ^name predict-yes +)
  15644. (S1 ^operator O2019 +)
  15645. Firing propose*predict-no
  15646. -->
  15647. (O2020 ^name predict-no +)
  15648. (S1 ^operator O2020 +)
  15649. Firing rl*prefer*rvt*predict-no*H0*4
  15650. -->
  15651. (S1 ^operator O2018 = 0.3145079413521559)
  15652. Firing rl*prefer*rvt*predict-yes*H0*3
  15653. -->
  15654. (S1 ^operator O2017 = 0.3907869885089824)
  15655. Firing prefer*rvt*predict-yes*H0
  15656. -->
  15657. Firing prefer*rvt*predict-no*H0
  15658. -->
  15659. Firing elaborate*copy-dir-to-output-link
  15660. -->
  15661. (I3 ^dir L +)
  15662. inner elaboration loop at bottom goal.
  15663. Retracting elaborate*copy-see-to-output-link
  15664. -->
  15665. (I3 ^see 1 +)
  15666. Retracting propose*predict-no
  15667. -->
  15668. (O2018 ^name predict-no +)
  15669. (S1 ^operator O2018 +)
  15670. Retracting propose*predict-yes
  15671. -->
  15672. (O2017 ^name predict-yes +)
  15673. (S1 ^operator O2017 +)
  15674. Retracting elaborate*reward*based*on*reward
  15675. -->
  15676. (R1012 ^value 1 +)
  15677. (R1 ^reward R1012 +)
  15678. Retracting elaborate*copy-dir-to-output-link
  15679. -->
  15680. (I3 ^dir L +)
  15681. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  15682. -->
  15683. (S1 ^operator O2018 = -0.168718511744511)
  15684. Retracting rl*prefer*rvt*predict-no*H0*4
  15685. -->
  15686. (S1 ^operator O2018 = 0.3145079413521559)
  15687. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  15688. -->
  15689. (S1 ^operator O2017 = 0.6093091841289463)
  15690. Retracting rl*prefer*rvt*predict-yes*H0*3
  15691. -->
  15692. (S1 ^operator O2017 = 0.3907869885089824)
  15693. =>WM: (14177: S1 ^operator O2020 +)
  15694. =>WM: (14176: S1 ^operator O2019 +)
  15695. =>WM: (14175: O2020 ^name predict-no)
  15696. =>WM: (14174: O2019 ^name predict-yes)
  15697. =>WM: (14173: R1013 ^value 1)
  15698. =>WM: (14172: R1 ^reward R1013)
  15699. <=WM: (14163: S1 ^operator O2017 +)
  15700. <=WM: (14165: S1 ^operator O2017)
  15701. <=WM: (14164: S1 ^operator O2018 +)
  15702. <=WM: (14158: R1 ^reward R1012)
  15703. <=WM: (14161: O2018 ^name predict-no)
  15704. <=WM: (14160: O2017 ^name predict-yes)
  15705. <=WM: (14159: R1012 ^value 1)
  15706. --- Inner Elaboration Phase, active level 1 (S1) ---
  15707. Firing prefer*rvt*predict-yes*H0
  15708. -->
  15709. Firing rl*prefer*rvt*predict-yes*H0*3
  15710. -->
  15711. (S1 ^operator O2019 = 0.3907869885089824)
  15712. Firing prefer*rvt*predict-yes*H0*3*H1
  15713. -->
  15714. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  15715. -->
  15716. (S1 ^operator O2019 = -0.2062723012911647)
  15717. Firing prefer*rvt*predict-no*H0
  15718. -->
  15719. Firing rl*prefer*rvt*predict-no*H0*4
  15720. -->
  15721. (S1 ^operator O2020 = 0.3145079413521559)
  15722. Firing prefer*rvt*predict-no*H0*4*H1
  15723. -->
  15724. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  15725. -->
  15726. (S1 ^operator O2020 = 0.685530273786795)
  15727. inner elaboration loop at bottom goal.
  15728. Retracting rl*prefer*rvt*predict-no*H0*4
  15729. -->
  15730. (S1 ^operator O2018 = 0.3145079413521559)
  15731. Ret