Actor Temperature

back

For a history of actor events of length $n$ , we want to calculate an actor’s temperature $T$ , where $T$ is a value form $0$ to $1$ .

When $T=0$ , the actor has not sent any messages that have been classified as toxic. When $T=1$ , all of the messages sent by the actor have been classified as toxic.

We want to control how much the oldest event impacts the temperature compared to the most recent so we define a discount factor $\gamma$ that is a value from $0$ to $1$ such that a value of $0$ means only the most recent event impacts the temperature and a value of $1$ means all events impact the temperature equally.

We define $\vec{d}$ to be the vector of discount coefficients of length $n$ such that:

$\begin{aligned} \vec{d} &= \left[ \gamma^0, \gamma^1, \gamma^2, \dots, \gamma^{n-1} \right] \\ &= \left[ 1, \gamma, \gamma^2, \dots, \gamma^{n-1} \right] \\ \end{aligned}$

We also define $\vec{x}$ to be the vector of actor events where each element is either $0$ or $1$ . A $1$ is an event where a message was classified as toxic and a $0$ is any other event.

This leaves us with a calculation for $T$ .

$T = \frac{ \vec{d} \cdot \vec{x} } { \sum_{i=1}^n \vec{d}_i }$

Ok, so this gives us a weighted average. So this is really straight forward, but let’s find a calculation for $T$ when we only know the current temperature and the state of the latest event. This will allow us to update the temperature value without having to store the full events vector.

First, let’s expand this calculation of $T$ and treat the sequence of events as timesteps where $t$ is the current timestep and $t-4$ is four timesteps in the past.

$T_t = \frac{ x_t + \gamma x_{t-1} + \gamma^2 x_{t-2} + \dots + \gamma^{n-1} x_{t-(n-1)} } { 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} }$

Let’s also expand the calculation for $T_{t+1}$ .

$T_{t+1} = \frac{ x_{t+1} + \gamma x_{t} + \gamma^2 x_{t-1} + \dots + \gamma^{n-1} x_{t-(n-2)} } { 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} }$

If we can define $T_{t+1}$ in terms of $T_t$ , $x_t$ , and $\gamma$ then we are finished.

Let’s start by finding an expression that lets us define $\gamma x_{t} + \gamma^2 x_{t-1} + \dots + \gamma^{n-1} x_{t-(n-2)}$ in terms of $T_t$ .

$\begin{aligned} T_t &= \frac{ x_t + \gamma x_{t-1} + \gamma^2 x_{t-2} + \dots + \gamma^{n-1} x_{t-(n-1)} } { \sum_{i=1}^n \vec{d}_i } \\ T_t \sum_{i=1}^n \vec{d}_i &= x_t + \gamma x_{t-1} + \gamma^2 x_{t-2} + \dots + \gamma^{n-1} x_{t-(n-1)} \\ \gamma T_t \sum_{i=1}^n \vec{d}_i &= \gamma x_t + \gamma^2 x_{t-1} + \gamma^3 x_{t-2} + \dots + \gamma^n x_{t-(n-1)} \\ \left( \gamma T_t \sum_{i=1}^n \vec{d}_i \right) - \gamma^n x_{t-(n-1)} &= \gamma x_t + \gamma^2 x_{t-1} + \gamma^3 x_{t-2} + \dots + \gamma^{n-1} x_{t-(n-2)} \\ \end{aligned}$

We can now rewrite $T_{t+1}$ as follows:

$T_{t+1} = \frac{ x_{t+1} + \left( \gamma T_t \sum_{i=1}^n \vec{d}_i \right) - \gamma^n x_{t-(n-1)} } { \sum_{i=1}^n \vec{d}_i }$

This leaves us with a bit of a problem since we can’t know the value of $x_{t-(n-1)}$ unless we store the whole event vector.

The question now is can we get to a “good enough” aproximation of $T_{t+1}$ given $T_t$ , $x_t$ , and $\gamma$ .

We know that $x_{t-(n-1)}$ is either $0$ or $1$ . We also know that $\lim_{n\to\infty} \gamma^n = 0$ for $\gamma < 1$ . In practice, we won’t be calculating a temperature based on an infinite history but as long as $n$ is sufficiently large $\gamma^n$ will be small. We can then assume that $x_{t-(n-1)}$ represents a toxic message and replace it with $1$ . This introduces a non-zero error that lowers the temperature value but will be a very small error as long as $n$ is sufficiently large.

With this compromise, we can rewrite $T_{t+1}$ as:

$T_{t+1} = \frac{ x_{t+1} + \left( \gamma T_t \sum_{i=1}^n \vec{d}_i \right) - \gamma^n } { \sum_{i=1}^n \vec{d}_i }$

If $T_t$ is sufficiently small, we will get a negative temperature which breaks our requiremnets. A small change to the expression allows us to prevent this outcome.

$T_{t+1} = \frac{ x_{t+1} + \max \left( \gamma T_t \sum_{i=1}^n \vec{d}_i \right) - \gamma^n, 0 } { \sum_{i=1}^n \vec{d}_i }$

Granted, we have had to make a compromise to get to this point but the result is we are able to update $T$ without having to know the full history of the actor’s events.

Now let’s double check our requirements.

When $x_{t+1}$ and $T_t$ are both $1$ , we want $T_{t+1}$ to also be $1$ .

$\begin{aligned} T_{t+1} &= \frac{ 1 + \max \left( \gamma (1) \sum_{i=1}^n \vec{d}_i \right) - \gamma^n, 0 } { \sum_{i=1}^n \vec{d}_i } \\ &= \frac{ 1 + \left( \gamma \left( 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} \right) \right) - \gamma^n } { 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} } \\ &= \frac{ 1 + \left( \gamma + \gamma^2 + \dots + \gamma^n \right) - \gamma^n } { 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} } \\ &= \frac{ 1 + \left( \gamma + \gamma^2 + \dots + \gamma^{n-1} + \gamma^n \right) - \gamma^n } { 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} } \\ &= \frac{ 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} } { 1 + \gamma + \gamma^2 + \dots + \gamma^{n-1} } \\ &= 1 \\ \end{aligned}$

When $x_{t+1}$ and $T_t$ are both $0$ , we want $T_{t+1}$ to also be $0$ .

$\begin{aligned} T_{t+1} &= \frac{ 0 + \max \left( \gamma (0) \sum_{i=1}^n \vec{d}_i \right) - \gamma^n, 0 } { \sum_{i=1}^n \vec{d}_i } \\ &= \frac{ 0 + \max {-\gamma^n}, 0 } { \sum_{i=1}^n \vec{d}_i } \\ &= \frac{ 0 + 0 } { \sum_{i=1}^n \vec{d}_i } \\ &= 0 \\ \end{aligned}$

Let’s also test that values of $0$ and $1$ for $x_{t+1}$ have an impact on $T$ that match our intuition of temperature for “real” values of $\gamma$ and $n$ .

Let’s assume $n = 30$ and $\gamma = 0.9$ .

This results in $\sum_{i=1}^n \vec{d}_i = 9.576088417247838$ and $\gamma^n = 0.04239115827521624$

If we start with $T_t = 0.5$ we end up with the following when the latest event is a toxic message:

$\begin{aligned} T_{t+1} &= \frac{ 1 + \max \left( (0.9) (0.5) (9.576088417247838) \right) - 0.04239115827521624, 0 } { 9.576088417247838 } \\ &= \frac{ 1 + \max 4.309239787761528, 0 } { 9.576088417247838 } \\ &= \frac{ 5.309239787761528 } { 9.576088417247838 } \\ &= 0.5544267718120548 \\ \end{aligned}$

And when the letest event is not a toxic message.

$\begin{aligned} T_{t+1} &= \frac{ 0 + \max \left( (0.9) (0.5) (9.576088417247838) \right) - 0.04239115827521624, 0 } { 9.576088417247838 } \\ &= \frac{ \max 4.309239787761528, 0 } { 9.576088417247838 } \\ &= \frac{ 4.309239787761528 } { 9.576088417247838 } \\ &= 0.45 \\ \end{aligned}$

The smaller $T_t$ is, the more impact “warming” has and the hotter $T_t$ is, the more impact “cooling” has. This matches our intuition for temperature.

The following chart illustrates consecutive “warming” and “cooling” events.

Consecutive 'warming' and 'cooling' events.

It is worth noting that due to the compromise to prevent negative temperatures and handle the unknown value of $x_{t-(n-1)}$ , the temperature can reach 0 after $n$ consecutive “cooling” events, but the temperature will only ever asymptotically approach $1$ as the number of consecutive “warming” events approaches infinity.

TODO: Add coefficients of cooling and heating to enable finer grained control over the relationship between the cooling and heating curves.

NB: The reason I’ve described this in terms of events rather than simply message classifications is that we can define an event as either a message classification or the tick of a clock. This would allow actors to “cool off” by either sending messages that are deemed non-toxic or by simply refraining from sending messages.

Actor Temperature

Ed.Minnett.AI

Error

Templates (for web app):

Error