Reactive Systems using Actors

Dear all,

Welcome to Actors - a Python library for creating Reactive Systems based on the Actors model.

Status

The library is currently a work in progress and may change without any notice. It's a proof of concept version that provides the basic Actors functionality and most of the features discussed below. The library has been tested informally on a

Beaglebone Black (Debian 10.3, Python v3.6)
Raspberry Pi 4 (Ubuntu 20,04 server)
Portable PC (i7-9750H CPU, Ubuntu 22.04, Python v3.10)
High end PC (i9-12900k CPU, Fedora 37, Python v3.11)

Idea

The idea with this library is to create a framework for creating highly performant, scalable and maintainable code. My inspiration for this project originates from the following sources:

Actors Model (https://en.wikipedia.org/wiki/Actor_model)
ROS (https://www.ros.org)
RxROS (https://github.com/rosin-project/rxros)
RxROS2 (https://github.com/rosin-project/rxros2)
Reactive Programming (https://reactivex.io)
Reactive systems (https://www.reactivemanifesto.org)
CPP_Actors (https://github.com/henrik7264/CPP_Actors)

Although there are several references to ROS, the library has as such nothing to do with programming robots. It is a general purpose library/framework for creating reactive systems using Actors.

Approach

I have always found the following items central for producing high quality software

Low coupling/high cohesion
Low coupling/high cohesion is a central part of the Actors model and therefore also a central part of this library. The only way Actors can communicate with each other is through messages. An actor can publish a message and scribe and react to published messages. Each and every actor lives it own life. It will newer break due to major changes in other Actors. The developer can concentrate on developing the best Actor in the wold without ever having to think of how other Actors are implemented. An Actor may start other actors to perform its task. The only dependency between Actors are the messages and the properties/data they carries – so designing a system based on Actors is all about creating proper messages that can be distributed and processed by other Actors.
Reusability
Reusability of code is the use of existing software, or software knowledge, to build new software (ref. https://en.wikipedia.org/wiki/Code_reuse). This is of course the fundamental idea with library. I just hope the library has the quality that is needed to be used over and over again in many projects. The big question is however if the produced code using the library will result in more reusable code? This is my hope, but it will have to be seen.
Keep it simple
Keep it simple! Well, well, well - I could talk hours about this subject, but this is not the purpose of this section. I think you know what I am talking about, especially if you have tried to take over the maintenance of some code your "dear" colleague produced just before he left the company. Even code that I produced myself is hard to read and understand after having been away from it for say ½ a year. I have put a lot of work in making this code easy to read and understand, but it is in the usage of the library that the "keep it simple" statement really should shine through.

It is my hope that the above items are reflected in the code and especially the usage it, and that the library provides a solid foundation for creating reactive systems based on Actors.

Project phases

I foresee at least two phases for the library. Phase1 is related to only adding features to the library. This includes:

Logging
Logging is one of the most fundamental debugging facilities a library like this must provide. It shall be easy to enable and use. A log entry shall contain a time stamp, severity, which Actor created the entry and a text that describes a problem or information about the state of the Actor.
Http
A Web page shall be available for each Actor. The purpose of the web page is to provide monitoring of an Actor. More advanced web pages that collects information on the overall application (Actors), will be added later.
Statistics
Statistics is like logging essential for debugging a program. It shall be possible to see how many messages of a given type have been published and how long time has gone sine we received the last message. Queue sizes and number of worker threads should also be information that easily can be shown. All these information could in principle be shown in a web page. This Will be decided at a later time.
Scheduling and Timers
Scheduling and timers are fundamental of all systems and shall as such be part of this library.
State machines
State machines are part of nearly all systems. However, they come in many variants/implementations and it is always complicated to understand how they work and what they do. A simple approach to state machines will be presented in this library. The implementation will be very close to the definition of a state machine.
Behavior trees
Experimental feature that will be added to the library.
Reactive programming
Experimental feature that will be added to the library. Instead of looking at a system that is based on parsing messages from one Actor to another, we could also look at it as system of message streams that are processed by the Actors. When we take this view reactive programming becomes a natural way of processing the messages. See some of the advantages of using reactive programming on ttps://reactivex.io - they are awesome.

Phase1: Focus is to provide different features for Actors.

The second phase is only related to create a distributed system of Actors that can communicate with each other on multiple platforms and hosts.

Phase2: Focus is to provide a distributed environment for sharing messages between Actors.

The progress of this library depends a lot on the interest for it and if the overall goals actually are met.

Required software

The Actors library depends on the following software:

Python3 (Seen it run on a Python v3.6)
RxPY v4 (see https://github.com/ReactiveX/RxPY)

Installation and setup

The Actors library depends on the ReactiveX extensions RxPY This extensions must be installed prior to installing the library. The installation process is as follows:

Install RxPy
Install Actors library

Installation of RxPY on Linux

pip3 install reactivex

Installation of the Actors library on Linux

git clone https://github.com/henrik7264/PY_Actors.git

Testing the Python library on Linux

cd PY_Actors/py_actors
export PYTHONPATH=`pwd`
python3 example_publisher_subscriber/main.py
python3 example_statemachine/main.py

Using the Actors library in your own project

Now to the more fun part of using the Actors library.

Project setup

There is currently no installation packages for the Actors library. The code is simply indented to be included directly in your project. Copy the lib_actors folder directly into your project and start to create new actors as described below:

YouProject/
  lib_actors/
  ...
  actor1/
  actor2/
  ...
  main.py

A number of examples are provided as part of the Actors library. They should provide enough information to setup your development environment.

Features of the Actor library

The following sections describe messages, actors, schedulers, timers and state machines. As the library expands new features will be added

Messages

Messages are one of the most important concepts of the Actors library. A message is simply a class!

The most simple message consists of nothing but a class definition:

class SimpleMsg:
  pass

Messages may carry data and even functions that can be executed when a message is received:

class MyMessage:
    def __init__(self, name: str, count: int):
        self.name: str = name
        self.count: int = count

Operations on messages

There are three operations which can be applied on messages: This is to subscribe, unsubscribe and to publish a message.

Subscribe to a Message

An Actor subscribes to a specific message type by providing a callback function that is executed each time a message of that type is published.

The 'subscribe' function

def subscribe(self, func, msg_type) -> int:

# func: A lambda or callback function. The function must take a message argument of the specified type.
# msg_type: A reference to a class/message.
# return a subscription id (int).

Example

self.message.subscribe(self.func, MyMessage)

def func(self, msg: MyMessage):
  self.logger.debug("Received a MyMessage: " + msg.name + ", " + str(msg.count))

The 'unsubscribe' function

def unsubscribe(self, sub_id: int, msg_type) -> None:

# sub_id: subscription id (int) returned from a subscription
# msg_type: A reference to a class/message.

Example

...
sub_id = self.message.subscribe(self.func, MyMessage)

self.message.unsubscribe(sub_id, MyMessage) ...

Publish a Message

An Actor publishes messages by means of the publish function.

The 'publish' function

def publish(self, msg) -> None:

# msg: The message (instance of a class) to be published.

Example

self.message.publish(MyMessage("Hello world", 1234))

The sequence diagram below shows how the subscription and publishing of messages work. The Actor starts by subscribing to a number of messages (message types). A callback function is associated to each subscription. Each time a message is published the set of callback functions that have subscribed to the message type will be executed. This takes place in the Dispatcher where a number of Worker threads will take care of the execution.

Sequence diagram showing the subscribe and publish mechanism of the Actors Library.

There are some problems related to this execution model/architecture:

While executing one callback function another message may be published and trigger another callback function. This could in worse case lead to thread synchronization problems. The Actors library solves this problem by allowing only one callback function per Actor to be executed at a time, i.e. 100 Actors can concurrently execute 100 callback functions, but one Actor can only execute one callback function at a time.
A heavy message load may create the situation described in item 1. The actors library accommodates for this by providing a number of threads (equals to the number of CPU cores) that will execute the callback functions. If the messages cannot be handled as fast as they arrive the queues associated to the threads will start to grow and the system will become unresponsive and unpredictable.

The problems described above are common for this kind of architecture. There is no real solution to the problem except that the architect and programmer must ensure that the hardware platform is dimensioned to the message load of the system and that the callback functions are fast and responsive. Avoid sleep, wait and I/O operations in callback functions that are called often.

Actors

Actors are, like messages, a central part of the Actors library. All Actors are sub-classes of the Actor class:

Creation of an Actor

class MyActor(Actor):
        def __init__(self):
            super().__init__("MyActor", logging.NOTSET)

It is as simple as that! The Actor takes as argument the name of the Actor. It must be a unique name that is easy to identify in ex. log message. The second argument is the log level. The default log level is set to CRITICAL. Set it to logging.NOTSET to log everything.

Initialization of an Actor consist of creating an instance of it. It can be done from anywhere and at any time - even an Actor may create new Actors. The instance of an Actor must exists throughout the lifetime of the application.

if __name__ == "__main__":
    # Initialize actors
    actors = [MyActor(), ASecondActor(), AThirdActor()]
    ...

An Actor is implemented as a facade. As soon we are in the scope of an Actor a set of functions becomes available. This includes:

self.message.subscribe(...)
self.message.unsubscribe(...)
self.message.publish(...)
self.message.stream(...)

self.logger.debug(...)
self.logger.info(...)
self.logger.warning(...)
self.logger.error(...)
self.logger.critical(...)

self.scheduler.once(...)
self.scheduler.repeat(...)
self.scheduler.remove(...)
self.scheduler.timer(...)

self.sm = Statemachine(...)

Observe how the functions are organized into logical groups. This makes it very easy to understand and use them. Only Statemachine is a bit different due to its nature.

Logging

The logging interface of the Actors library is based on the Python logging library. The Python library has been slightly adapted so that the name of the Actor is included in the log message. Default is to log to a terminal and a file named "actors.log", and the log level is set to CRITICAL. The log level can be changed during creation of the Actor. In fact, all features of the Python logging library are available and can be changed if needed. It is however recommended to use the following interface to log messages:

Logging Interface

self.logger.debug(msg, *args, **kwargs)
self.logger.info(msg, *args, **kwargs)
self.logger.warning(msg, *args, **kwargs)
self.logger.error(msg, *args, **kwargs)
self.logger.critical(msg, *args, **kwargs)

The msg is the message format string, and the args are the arguments which are merged into msg using the string formatting operator (source https://docs.python.org/3/library/logging.html).

Example

self.logger.info("Received a MyMessage: " + msg.name + ", " + str(msg.count))

The code will produce the following log entry:

2023-01-01 23:19:49,175 MyActor INFO: Received a MyMessage: Hello World, 1234

Scheduler

A Scheduler can be used to execute a task (function call) at a given time. The task can be executed once or repeated until it is removed. The scheduled tasks are executed by a single Worker/Thread. If the Worker is not able to execute the tasks as requested by the Scheduler the thread queue will start to grow. The situation is not different from handling messages (see above section) - in fact it is exactly the same, and the Actors library will behave the same way:

While executing one task another task may be triggered by a scheduler timeout. This could in worse case lead to thread synchronization problems. The Actors library solves this problem by allowing only one task per Actor to execute at a time, i.e. 100 Actors can concurrently execute 100 tasks, but one Actor can only execute one task at a time.
A heavy scheduler load may create the situation described in item 1. If the tasks cannot be handled as fast as they are scheduled the queues associated with the Worker will start to grow and the system will become unresponsive and unpredictable.
Scheduled tasks and message handling works under the same principles as described above. Only one task/callback function can be executed at time per Actor to avoid synchronization problems.

Again, the Actors library will do what it can to solve the load problems, but the root cause of the problem is an insufficient hardware platform and/or poor implementations of the tasks/callback functions. It is in these two areas the problem should be resolved.

The scheduler interface is defined as follows:

Schedule a task once

A task can be executed once at a given time by the Scheduler. The once function will return a job id that can be used to cancel/remove the scheduled job.

The 'once' function

def once(self, msec: int, func) -> int:

# msec: timeout in milliseconds.
# func: call back function to be executed when the job times out.
# return: job_id (int)

Example

job_id = self.scheduler.once(1000, self.task)

def task(self):
  self.logger.debug("The scheduled job timedout.")

Schedule a repeating task

A job can be scheduled to repeat a task. The repeat function will return a job id that can be used to cancel/remove the scheduled job.

The 'repeat' function

def repeat(self, msec: int, func) -> int:

# msec: timeout in milliseconds.
# func: call back function to be executed when the job times out.
# return: job_id (int)

Example

job_id = self.scheduler.repeat(1000, self.task)

def task(self):
  self.logger.debug("The scheduled job timedout.")

Remove a scheduled job

A scheduled job can at any time be canceled/removed.

The 'remove' function

def remove(self, job_id: int) -> None:

# job_id: job to be removed.

Example

job_id = self.scheduler.repeat(1000, self.task)  # A new job has been scheduled.
...
self.scheduler.remove(job_id)  # The job is canceled and removed.

Timers

Timers are similar to schedulers, except a timer must be started before it is activated. A timer can at anytime be stopped or restarted if needed. The timer has a timeout and a callback function. The timer is activated when it is started, and when it times out the callback function will be executed.

Create a timer

A timer is created by calling the scheduler.timer function. It takes a timeout time and a callback function as argument. The Timer is activated at the moment it is started.

t1 = self.scheduler.timer(1000, self.func)  # Create a timer
...
t1.start()  # Start the timer. It will timeout 1000ms from this moment.
...
t1.stop()  # Stop the timer.
...
t1.start() # Restart the timer.  It will timeout 1000ms from this moment.

Start a timer

A Timer is activated at the moment it is started. If needed it can at any time be restarted by calling the start function.

The 'start' function

def start(self) -> None

Example

t1 = self.scheduler.timer(1000, self.func)  # Create a timer
t1.start()  # Start the timer. It will timeout 1000ms from this moment.

def func(self):
  self.logger.debug("The timer timedout.")

Stop a timer

A timer can at any time be stopped by calling the stop function. The stop function will inactivate the timer and preventing it from timing out.

The 'stop' function

def stop(self) -> None:

Example

def func(self):
  self.logger.debug("The timer timedout.")

t1 = self.scheduler.timer(1000, self.func)  # Create a timer
...
t1.start()  # Start the timer. It will timeout 1000ms from this moment.
...
t1.stop()  # Stop the timer. The timer is inactivated and it will not time out.

State Machines

State machines comes in many forms and is always hard to understand. The state machine is in itself not that hard to understand: It can be in one of a number of states and it will initially be in a predefined state. The state machine can change from one state to another in response to an event. The change from one state to another is called a transition. An state machine is in other words defined by list of states, an initial state, and the events that trigger the transitions.

Creating a State Machine

A state machine is created, not surprisingly, as an instance of a Statemachine:

self.sm = Statemachine(initial_state,
            State(state1, ...),
            State(state2, ...)
            ...
            State(stateN, ...))

It takes as argument an initial state and a number of states. Each state is identified by a unique state id. It can be a number, string, enumeration etc. Use enumerations as shown in the example below. This is a nice way to define state ids.

Example

The following example shows the definition of a state machine of a door. The door can be in state DOOR_CLOSED or DOOR_OPENED. The door is initially in state DOOR_CLOSED.

class States(Enum):
    DOOR_OPENED = 0,
    DOOR_CLOSED = 1
self.sm = Statemachine(States.DOOR_CLOSED,
               State(States.DOOR_CLOSED, ...),
               State(States.DOOR_OPENED, ...))

Creating a State

A State is defined by its id and a number of transitions that each is triggered by an event.

class States(Enum):
    DOOR_OPENED = 0,
    DOOR_CLOSED = 1
self.sm = Statemachine(States.DOOR_CLOSED,
                State(States.DOOR_CLOSED,
                      Transition(...),
                      ...
                      Transition(...)),
                State(States.DOOR_OPENED,
                      Transition(...),
                      ...
                      Transition(...)))

Two types of transitions can be used. That is the Message transition which is triggered by a Message that is published by an Actor, and the Timer transition that is triggered by a timer timeout.

Example

class States(Enum):
    DOOR_OPENED = 0,
    DOOR_CLOSED = 1
self.sm = Statemachine(States.DOOR_CLOSED,
                State(States.DOOR_CLOSED,
                      Message(OpenDoorMsg, ...)),
                State(States.DOOR_OPENED,
                      Message(CloseDoorMsg, ...),
                      Timer(1000, ...)))

Creating a Transition

To get the full picture of the state machine's capabilities we have to understand transitions. A transition is triggered by an event. When it is triggered it will perform an action (optional), and finally it will change to a new/next state (optional).

class States(Enum):
    DOOR_OPENED = 0,
    DOOR_CLOSED = 1
self.sm = Statemachine(States.DOOR_CLOSED,
                       State(States.DOOR_CLOSED,
                             Message(OpenDoorMsg, action=self.open_door, next_state=States.DOOR_OPENED)),
                       State(States.DOOR_OPENED,
                             Message(CloseDoorMsg, action=self.close_door, next_state=States.DOOR_CLOSED),
                             Timer(1000, action=self.auto_close_door, next_state=States.DOOR_CLOSED)))

The action part is simply a function or lambda expression that is executed as part handling the transition. When the operation is complete the transition will set the new/next state of the state machine. Observe that both the action and next_state are optional.

Observations related to State Machines

The state machine is tricky construction, but care has been taken to make it simple and safe to use:

A state machine is tightly coupled to an Actor. In fact, the state machine can not be used outside the scope of an Actor.
A transition is an "atomic" operation. It works as follows:
1. A message is published by an Actor.
2. The state machine will check if it has a transition that is triggered by the message event.
3. If this is the case, the state machine will be locked until the transition has been executed and the next state has been set.
4. The state machine is unlocked and new events can be handled.
An Actor can define more state machines. The state machines are updated concurrently and there is no way to determine if one state machine is updated before the other.
The state machine can slow down due to constraints with execution of transitions.

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
images		images
py_actors		py_actors
LICENSE		LICENSE
README.md		README.md

License

henrik7264/PY_Actors

Folders and files

Latest commit

History

Repository files navigation

Reactive Systems using Actors

Status

Idea

Approach

Project phases

Required software

Installation and setup

Installation of RxPY on Linux

Installation of the Actors library on Linux

Testing the Python library on Linux

Using the Actors library in your own project

Project setup

Features of the Actor library

Messages

Operations on messages

Subscribe to a Message

The 'subscribe' function

Example

The 'unsubscribe' function

Example

Publish a Message

The 'publish' function

Example

Actors

Creation of an Actor

Logging

Logging Interface

Example

Scheduler

Schedule a task once

The 'once' function

Example

Schedule a repeating task

The 'repeat' function

Example

Remove a scheduled job

The 'remove' function

Example

Timers

Create a timer

Start a timer

The 'start' function

Example

Stop a timer

The 'stop' function

Example

State Machines

Creating a State Machine

Example

Creating a State

Example

Creating a Transition

Observations related to State Machines

Message Streams

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages