Understanding Smart Pointers in Rust

Published on July 29th, 2024

Smart Pointers in Rust

When I first learned Rust, smart pointers were extremely confusing. 😬 If you're struggling too, don't worry, I got you.

In this blog post, we're going to talk about smart pointers and interior mutability in detail.

First, What are Pointers in Rust?

A Pointer is a variable that stores memory address which points to some other data in memory. The most common pointer is references (&) in rust. They don’t have any special capabilities other than referring to data, and have no overhead(additional operational cost).

fn main(){
    let x = 5;
    let y = &x; // y is pointing towards x
    println!("x: {}", x); // prints: x: 5
    println!("y: {}", y); // prints: y: 5
}

What are Smart Pointers in Rust and why do we need them?

Smart pointers is basically a way to do more things on top of references(&). They implement the Deref and Drop traits. Moreover, smart pointers can also have ownership of the data they are pointing to.

Deref Trait:- The Deref trait is used to treat smart pointers as a reference, enabling easy access to the data stored behind the smart pointers.

fn main() {
    let x = 5;
    let y = &x; //setting y equal to a reference to x

    assert_eq!(5, x);
    assert_eq!(5, *y); //dereferencing y so that compiler can access the real value
    //assert_eq!(5,y); this will give error as we are comparing i32 to a reference
}

Drop Trait:- The Drop trait lets you decide what to do when a value is going out of scope.

struct Cars  {
        size: i32
}

impl Drop for Cars {
        fn drop(&mut self) {
                println!("This instance of Car has being dropped: {}", self.size);
}
}

fn main() {
        let first_instance = Cars{size: 10};
        let second_instance = Cars{size: 5};
        println!("car instance is created.");

       //At the end of main, our instances will go out of scope and Rust will call the code we put in the drop method, printing our final message
}

/*
output :-
car instance is created.
This instance of Car has being dropped: 10
This instance of Car has being dropped: 5

*/

Now that our basics are cleared. I’ll be talking about several types of smart pointers in rust and their usecases.

1. `Box<T>`

Box is the simplest smart pointer in rust. It is used to dictate that enclosed data needs to be stored on the heap instead of stack.

fn main() {
    let boxed_value = Box::new(10); //boxed_value owns 10 here
    println!("Boxed value: {}", boxed_value);

    // `boxed_value` is deallocated here automatically when it goes out of scope
}
//10 is stored in heap

Usecases:-

When you need to store data on the heap, use Box.
When you have a data type whose size is not defined, use Box.
When you want to create recursive types. Eg- Binary Tree

Usefull Resources:-

2. `Rc<T>` (Reference Counting)

Rc comes into the picture when you have multiple references to a memory but you are not sure in which order they are going out of scope. Rc counts the number of references to the memory and it keeps the references alive until the last reference goes out of scope.

#[derive(Debug)]
//This is how we bring Rc into scope
use std::rc::Rc;

struct Person {
    name: String,
    age: u32,
}

fn main() {
    let person1 = Rc::new(Person {
        name: "Alice".to_string(),
        age: 25,
    });

    // Clone the Rc pointer to create additional references
    let person2 = Rc::clone(&person1);
    let person3 = Rc::clone(&person1);
    /*
    note:- here the clone method does not clone the data it
    wraps but instead makes another Rc that points to the data on the heap.
    */
    println!("Reference Count: {}", Rc::strong_count(&person1));
}

Usecases:-

When you need multiple parts of your code to own and manage access to the same data, use Rc.
When you don't want to clone data while creating new references of the data, use Rc.
When your program is single threaded because Rc is not thread-safe.

Useful Links:-

3. `Arc<T>` (Atomic Reference Count)

The Arc smart pointer is just like the Rc smart pointer but with extra benefits. It lets you use atomic operations to manage reference count, making it thread-safe.

what are Atomic operations?

Atomic operations ensure that a set of operations on shared data are completed without interruption, providing thread safety and consistency.

use std::sync::Arc;
use std::thread;

let val = Arc::new(0);
for i in 0..10 {
    let val = Arc::clone(&val);

    // You could not do this with "Rc"
    thread::spawn(move || {
        println!(
            "Value: {:?} / Active pointers: {}",
            *val+i,
            Arc::strong_count(&val)
        );
    });
}
/*
output:-
Value: 0 / Active pointers: 11
Value: 1 / Active pointers: 11
...
*/

Usecases:-

Arc should be used when you need to share data between multiple threads as it provides thread safety.

Useful Link:-

Note:- It is slower than Rc when dealing with single threads.

4. `Refcell<T>` (Reference Cell)

Refcell is used when you want to mutate data even when there are immutable references to that data and this pattern is often referred to as interior mutability in Rust.

Note:- RefCell itself is not a smart pointer as it holds data instead of referring to it, but it offers two smart pointers, Ref and RefMut, to access the contained data.

How Refcell works?

So basically, Refcell implements runtime borrow checking, rather than compile-time checking similar to Rust’s usual borrowing rules. This allows certain memory-safe scenarios to be executed, which would have been disallowed by the compile-time checks.

use std::cell::RefCell;

fn main() {
    // Create a RefCell containing a vector
    let vec = RefCell::new(vec![1, 2, 3]);

    // Get a mutable reference to the vector and modify it
    {
        let mut vec_ref = vec.borrow_mut();
        vec_ref.push(4); // Mutate the vector by adding an element
    }

    // Get an immutable reference to the vector and read it
    {
        let vec_ref = vec.borrow();
        println!("Vector contents: {:?}", *vec_ref); // Output: Vector contents: [1, 2, 3, 4]
    }
}

Usecases:-

It should be used when you’re sure your code follows the borrowing rules but the compiler is unable to understand and guarantee that.
It should be used when you need to manage mutable state in a single-threaded context.

Useful Links:-

5. `Mutex<T>` (Mutual Exclusion)

Mutex is helpful when we want to be able to mutate shared data in multiple threads safely. It allows only one thread to access some data at any given time whether it is a writer or reader. It is used with Arc to ensure shared ownership among multi threaded programs.

Note:- Mutex itself is not a smart pointer as it holds data instead of referring to it but the call to lock returns a smart pointer called MutexGuard.

use std::sync::{Arc, Mutex};
use std::thread;

fn main() {
    let counter = Arc::new(Mutex::new(0));
    let mut handles = vec![];

    for _ in 0..10 {
        let counter = Arc::clone(&counter);
        let handle = thread::spawn(move || {
            let mut num = counter.lock().unwrap();

            *num += 1;
        });
        handles.push(handle);
    }

    for handle in handles {
        handle.join().unwrap();
    }

    println!("Result: {}", *counter.lock().unwrap());
}
//result : 10

Usecases:-

Mutex should be used when you need to exclusively access data majorly dealing with high writing operations keeping overhead of locking and unlocking in mind.

Useful Links:-

6. `RwLock<T>` (Reader-Writer Lock)

RwLock is similar to Mutex but unlike Mutex it allows multiple threads to read mutable data while only allowing one thread to write to it at a time.

Note:- RwLock itself is not a smart pointer as it holds data instead of referring to it, but it offers two smart pointers, RwLockReadGuard and RwLockWriteGuard, to access the contained data.

use std::sync::RwLock;

let lock = RwLock::new(5);

// many reader locks can be held at once
{
    let r1 = lock.read().unwrap();
    let r2 = lock.read().unwrap();
    assert_eq!(*r1, 5);
    assert_eq!(*r2, 5);
} // read locks are dropped at this point

// only one write lock may be held, however
{
    let mut w = lock.write().unwrap();
    *w += 1;
    assert_eq!(*w, 6);
} // write lock is dropped here

Usecases:-

Rwlock should be used when you have frequent reads and rare write operations.
When you want to allow multiple readers to access the data concurrently.

Useful Links:-

Rust Docs

7. `Cow<T>` (Clone on Write)

Cow provides a flexible way of working with borrowed and owned data. It acts as an enum that lets you use either an owned instance of something or a borrowed instance, rather than acting like a smart pointer, using the same code.

Note:- Cow isn't a smart pointer as It doesn't implement Drop trait. Cows are used to abstract over owned vs references data.

fn get_potion_message(num: usize) -> Cow<'static, str> {
    match num {
        0 => Cow::Borrowed("out of potions"),
        1 => Cow::Borrowed("last potion"),
        _ => Cow::Owned(format!("{} potions remaining", num)),
    }
}
//static cases avoid allocation using cow borrowed
//Dynamic cases use Cow::Owned for formatted strings, allowing mutation if necessary.

Usecases:-

It should be used when you want to leverage efficient and smart memory allocation and data handling.

Useful Links:-

Rust Docs

8. `Cell<T>`

Cell is similar to Refcell but it provides zero-cost interior mutability only for Copy types. It's basically a RefCell with less features as you can only take/replace values, but it has the upside of no runtime cost or ability to panic.

let x = Cell::new(1);
let y = &x;
let z = &x;
x.set(2);
y.set(3);
z.set(4);
println!("{}", x.get());
//Note that here we were able to mutate the same value from various immutable references.

Usecases:-

In cases where mutation is an implementation detail, use Cell.

Useful Links:-

Rust Docs

First, What are Pointers in Rust?

What are Smart Pointers in Rust and why do we need them?

1. Box<T>

Usecases:-

Usefull Resources:-

2. Rc<T> (Reference Counting)

Usecases:-

Useful Links:-

3. Arc<T> (Atomic Reference Count)

Usecases:-

Useful Link:-

4. Refcell<T> (Reference Cell)

Usecases:-

Useful Links:-

5. Mutex<T> (Mutual Exclusion)

Usecases:-

Useful Links:-

6. RwLock<T> (Reader-Writer Lock)

Usecases:-

Useful Links:-

7. Cow<T> (Clone on Write)

Usecases:-

Useful Links:-

8. Cell<T>

Usecases:-

Useful Links:-

1. `Box<T>`

2. `Rc<T>` (Reference Counting)

3. `Arc<T>` (Atomic Reference Count)

4. `Refcell<T>` (Reference Cell)

5. `Mutex<T>` (Mutual Exclusion)

6. `RwLock<T>` (Reader-Writer Lock)

7. `Cow<T>` (Clone on Write)

8. `Cell<T>`