4 ways to filter has_many associations

You have a model that has_many associations, and you’d like to get a collection of these models with their associations that satisfies a list of conditions both on the model and the association records. You’ve tried a few things, but whatever you’ve tried only “sort of” works (meaning, not really) and leaves you feeling like you need a push in the right direction.

Filtering has_many associations can be one of those problems which makes you scratch your head and want to do some extra reading to brush up on your SQL and ActiveRecord knowledge.

For example, let’s say you have User and Project models in your system, and you want to write a query that retrieves a collection of users with projects that were created in a given date range.

Depending on what you want your query to return, here are 4 different ways to approach this problem:

  1. The simplest thing you can do is to combine the joins and where methods that ActiveRecord gives you, like so:

       User.joins(:projects).where(projects: { zipcode: 30332 })
    

    This technique is a good choice when you need to filter your records by one or more attributes. This will give you back a collection of User records which all have projects with a zipcode of 30332. One gotcha to keep in mind is that since you’re doing a INNER JOIN with the joins method, this can return duplicates. Read on to point 2 to see how you can get around this.

  2. More often than not, you might have scopes defined on your association models which you want to re-use. So when you want to use these scopes to do your filtering, get acquainted with ActiveRecord's merge function. According to the ActiveRecord documentation, merge returns an array representing the intersection of the ActiveRecord::Relation that you call merge on, and the ActiveRecord::Relation that you pass in.
    So, for example, if you have a scope in your Project model called opened_recently, which returns all projects which have been created within the last 10 days, you can do something like:
      User.joins(:projects).merge(Project.opened_recently)
    

    This will return to you a list of User objects which all satisfy the condition of having projects which were opened recently. Like I said before, one thing to keep in mind when you’re using joins for a has_many association is that since you are doing an INNER JOIN, it is possible that you will get back duplicate User (in our example) objects – basically one object for every ‘match’. It’s easy enough to get around this with a call to the uniq method.

     User.joins(:projects).merge(Project.opened_recently).uniq
    

    The uniq method works by changing your query from something like SELECT `users` from ... to something like SELECT DISTINCT `users` from...

  3. If you’re going to be needing to retrieve information from the records you filter, especially information stored in the associations of these records, another option to consider is eager loading and the includes method. As you probably know, the benefit of eager loading is that you get to avoid the N + 1 queries problem. It’s pretty straightforward to filter by attributes or scopes on the association: User.includes(:projects).where(projects: { zipcode: ‘30332’ }) This will give us all users who have projects in Atlanta, and also eager load their projects. If in your .where you want you use an SQL fragment, then you need to call the references method: User.includes(:projects).where(‘projects.deleted_at IS NOT NULL’).references(:projects) You can also use the merge method in conjunction with includes and references to make use of any scopes you might have defined in your association models, like so: User.includes(:projects).merge(Project.opened_recently).references(:projects) You won’t need to use uniq when you use includes because ActiveRecord will take care of it for you automatically.

  4. Now, what should you do if you want your collection to only have those associations which meet your filter criteria? This situation arose for me when I had to write an API endpoint which let app users pass in parameters to filter associations. There are a couple of ways you can approach this. In my case, since I had to return an object that would eventually be converted to JSON, the simplest way was to manually construct a hash which contained the record and the filtered associations. So I did something like this in my controller:
 def show 
   user_attributes.merge(projects: filtered_projects) 
 end
 
 
 def user_attributes 
   # returns a hash containing attributes for the given user 
   # something like this: # { id: 1, first_name: 'Rick', last_name: 'Sanchez' } 
 end
 
 def filtered_projects 
   # here's where I used any Project related params to filter down my projects 
   # something like Project.opened_after(params[:project_date]).as_json 
   # I use as_json because I need this method to return a hash 
 end
 Note that the `merge` method used here is the one for `Hash`, and not the `ActiveRecord` method. You should be able to extend this to return multiple users as well.

Another way to do this is by defining a scoped has_many association.

Want to be notified when I publish a new article?