Note
The reason for doing file I/O in a thread pool is that there have historically been poor cross-platform APIs for non-blocking file I/O. While it’s true that many runtimes choose to relegate this task to a thread pool making blocking calls to the OS, it might not be true in the future as the OS APIs evolve over time.
Creating a thread pool to handle these cases is outside the scope of this example (even mio considers this outside its scope, just to be clear). We’ll focus on showing how epoll works and mention these topics in the text, even though we won’t actually implement a solution for them in this example.
Now that we’ve covered a lot of basic information about epoll, mio, and the design of our example, it’s time to write some code and see for ourselves how this all works in practice.
The ffi module
Let’s start with the modules that don’t depend on any others and work our way from there. The ffi module contains mappings to the syscalls and data structures we need to communicate with the operating system. We’ll also explain how epoll works in detail once we have presented the syscalls.
It’s only a few lines of code, so I’ll place the first part here so it’s easier to keep track of where we are in the file since there’s quite a bit to explain. Open the ffi.rs file and write the following lines of code:
ch04/a-epoll/src/ffi.rs
pub const EPOLL_CTL_ADD: i32 = 1;
pub const EPOLLIN: i32 = 0x1;
pub const EPOLLET: i32 = 1 << 31;
#[link(name = “c”)]
extern “C” {
pub fn epoll_create(size: i32) -> i32;
pub fn close(fd: i32) -> i32;
pub fn epoll_ctl(epfd: i32, op: i32, fd: i32, event: *mut Event) -> i32;
pub fn epoll_wait(epfd: i32, events: *mut Event, maxevents: i32, timeout: i32) -> i32;
}
The first thing you’ll notice is that we declare a few constants called EPOLL_CTL_ADD, EPOLLIN, and EPOLLET.
I’ll get back to explaining what these constants are in a moment. Let’s first take a look at the syscalls we need to make. Fortunately, we’ve already covered syscalls in detail, so you already know the basics of ffi and why we link to C in the preceding code:
• epoll_create is the syscall we make to create an epoll queue. You can find the documentation for it at https://man7.org/linux/man-pages/man2/epoll_create.2.html. This method accepts one argument called size, but size is there only for historical reasons. The argument will be ignored but must have a value larger than 0.
• close is the syscall we need to close the file descriptor we get when we create our epoll instance, so we release our resources properly. You can read the documentation for the syscall at https://man7.org/linux/man-pages/man2/close.2.html.
• epoll_ctl is the control interface we use to perform operations on our epoll instance. This is the call we use to register interest in events on a source. It supports three main operations: add, modify, or delete. The first argument, epfd, is the epoll file descriptor we want to perform operations on. The second argument, op, is the argument where we specify whether we want to perform an add, modify, or delete operation
• In our case, we’re only interested in adding interest for events, so we’ll only pass in EPOLL_CTL_ADD, which is the value to indicate that we want to perform an add operation. epoll_event is a little more complicated, so we’ll discuss it in more detail. It does two important things for us: first, the events field indicates what kind of events we want to be notified of and it can also modify the behavior of how and when we get notified. Second, the data field passes on a piece of data to the kernel that it will return to us when an event occurs. The latter is important since we need this data to identify exactly what event occurred since that’s the only information we’ll receive in return that can identify what source we got the notification for. You can find the documentation for this syscall here: https://man7.org/linux/man-pages/man2/epoll_ctl.2.html.
• epoll_wait is the call that will block the current thread and wait until one of two things happens: we receive a notification that an event has occurred or it times out. epfd is the epoll file descriptor identifying the queue we made with epoll_create. events is an array of the same Event structure we used in epoll_ctl. The difference is that the events field now gives us information about what event did occur, and importantly the data field contains the same data that we passed in when we registered interest
• For example, the data field lets us identify which file descriptor has data that’s ready to be read. The maxevents arguments tell the kernel how many events we have reserved space for in our array. Lastly, the timeout argument tells the kernel how long we will wait for events before it will wake us up again so we don’t potentially block forever. You can read the documentation for epoll_wait at https://man7.org/linux/man-pages/man2/epoll_wait.2.html.
Leave a Reply