trait ActionValueFn[Obs, A, T] extends AnyRef
Along with StateValueFn, this is the main trait in tabular reinforcement learning for tracking the value of an (observation, action) pair.
- Obs
Observation returned by the State instances tracked by ActionValueFn.
- A
Action type on the State instances tracked by ActionValueFn.
- T
type of values tracked by ActionValueFn.
- Self Type
- ActionValueFn[Obs, A, T]
- Source
- ActionValueFn.scala
- Alphabetic
- By Inheritance
- ActionValueFn
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Abstract Value Members
-
abstract
def
actionValue(obs: Obs, a: A): T
Returns the stored value associated with the given obs, a pair.
-
abstract
def
seen(obs: Obs): Iterable[A]
Returns an iterable of all actions available from the supplied observation associated with any tracked value T.
-
abstract
def
seenStates: Iterable[Obs]
Returns an Iterable of all observations associated with some internally tracked value T.
-
abstract
def
update(obs: Obs, action: A, value: T): ActionValueFn[Obs, A, T]
Absorb a new value for the supplied obs, action pair.
Absorb a new value for the supplied obs, action pair. The behavior of this function is implementation dependent; some might ignore the value, some might merge it in to an existing set of values, some might completely replace the stored state.
Concrete Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
def
fold[U](prepare: (U) ⇒ T, present: (T) ⇒ U): ActionValueFn[Obs, A, U]
Transforms this ActionValueFn into a new instance that applies the supplied
prepare
to all incoming values before they're learned, and presents tracked T instances using thepresent
fn before returning them via actionValue. -
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
def
mergeable(implicit T: Semigroup[T]): ActionValueFn[Obs, A, T]
Returns an ActionValueFn instance that uses the supplied semigroup T to merge values into this current ActionValueFn.
Returns an ActionValueFn instance that uses the supplied semigroup T to merge values into this current ActionValueFn.
- T
Semigroup instance used to merge values.
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toEvaluator[R, S[_]]: ActionValue[Obs, A, R, T, S]
TODO fill in.
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
edit this text on github
ScalaRL
This is the API documentation for the ScalaRL functional reinforcement learning library.
Further documentation for ScalaRL can be found at the documentation site.
Check out the ScalaRL package list for all the goods.