The post Refactoring a wall of JavaScript from dev.to project first appeared on Ruby clarity.
]]>Also, we're experimenting with the naming process, inspired by
Naming as a process.This video was originally streamed on my Twitch channel. If you'd like to be notified, when I stream, you can subscribe on Twitch or on Twitter
This video was originally streamed on my Twitch channel. If you'd like to be notified, when I stream, you can subscribe on Twitch or on Twitter.
Happy hacking!
The post Refactoring a wall of JavaScript from dev.to project first appeared on Ruby clarity.
]]>The post Is it always a good idea to split long methods into smaller ones? An experiment. first appeared on Ruby clarity.
]]>So, I want to do an experiment and try to split a long method that's difficult to split into smaller methods. I know that it's easy to extract methods when the code uses several levels of abstraction or a low level of abstraction. In such a case it's easy to achieve better readability by extracting some methods. So, the hardest long method I can think of would use a high level of abstraction, from which it wouldn't be easy to go to a level higher.
Such a method can be found in Rails' ActiveRecord::Persistence
module. It's the #touch
method, and it deals with timestamp attributes that need to be updated with current time, scopes, primary keys, locks and SQL UPDATE. In other words, the level of abstraction is high enough and it's the level that deals with database-level concepts. I expect it won't be that easy to go to a higher level of abstraction here.
def touch(*names, time: nil) unless persisted? raise ActiveRecordError, <<-MSG.squish cannot touch on a new or destroyed record object. Consider using persisted?, new_record?, or destroyed? before touching MSG end time ||= current_time_from_proper_timezone attributes = timestamp_attributes_for_update_in_model attributes.concat(names) unless attributes.empty? changes = {} attributes.each do |column| column = column.to_s changes[column] = write_attribute(column, time) end primary_key = self.class.primary_key scope = self.class.unscoped.where(primary_key => _read_attribute(primary_key)) if locking_enabled? locking_column = self.class.locking_column scope = scope.where(locking_column => _read_attribute(locking_column)) changes[locking_column] = increment_lock end clear_attribute_changes(changes.keys) result = scope.update_all(changes) == 1 if !result && locking_enabled? raise ActiveRecord::StaleObjectError.new(self, "touch") end @_trigger_update_callback = result result else true end end
At 42 lines, #touch
is quite long and requires some explaining. Arguments #touch
accepts are names
(meaning timestamp attributes to update) and time
(the time to set timestamps to). Then we decline to work on non-persisted records (lines 2-7). Then we set up the default time
value to current time and merge standard timestamp attributes and passed timestamp attributes (names
) into attributes
array (lines 9-11).
Then goes the most interesting part, when there are some attributes
to update (i.e. attributes
isn't empty). We instantiate changes
hash to pass to #update_all
later, and fill it up with attribute
keys and time
values, and set timestamp attributes on the record to time
(lines 14-19). Then we setup a scope we'll be using to match this very record that we #touch
, using primary key (lines 21-22). Then we deal with the case when locking is on, updating both record, changes
hash and the scope (lines 24-28).
And then clear #changed
, so that after we've updated/touched the record, the stuff we've updated in db is no longer marked as changed (line 30). Then we run #update_all
. If we've failed to find the record, because of locking issues, we raise a StaleObjectError
(lines 33-35). And at last, we setup a special flag telling Rails whether we've actually changed data in the db.
Now, an easy way to split a method is to see how it's structured, and split at the seams. I'd expect a junior developer to do just that. And that's what I did:
unless attributes.empty? changes = {} update_record_and_changes_with_time(attributes, time, changes) scope = scope_by_primary_key if locking_enabled? scope = extend_scope_to_match_locking_column_value(scope) update_record_and_changes_with_lock_increment(changes) end clear_attribute_changes(changes.keys) result = scope.update_all(changes) == 1 if !result && locking_enabled? raise ActiveRecord::StaleObjectError.new(self, "touch") end @_trigger_update_callback = result result else true end
As you can see, I just replaced more verbose code with explanations. The same level of abstractions is used. The main problem here is whether it's still clear what's going on. I've shown this code to a person and he says it's still clear.
However, this is really my 3rd take on refactoring #touch
. The first two refactoring were pretty bad. It took me awhile to understand all the intricacies of this method.
There are hidden concepts in the original code that the reader has to figure out by themselves. I describe them beneath the code (no need to read the code below without explanations).
unless attributes.empty? changes = {} attributes.each do |column| column = column.to_s changes[column] = write_attribute(column, time) end primary_key = self.class.primary_key scope = self.class.unscoped.where(primary_key => _read_attribute(primary_key)) if locking_enabled? locking_column = self.class.locking_column scope = scope.where(locking_column => _read_attribute(locking_column)) changes[locking_column] = increment_lock end clear_attribute_changes(changes.keys) result = scope.update_all(changes) == 1 if !result && locking_enabled? raise ActiveRecord::StaleObjectError.new(self, "touch") end @_trigger_update_callback = result result else true end
So, the hidden concepts are:
changes
and record attributes are updated with the same information at the same time (lines 4-7 and 15). It follows that the record attributes and changes
hold the same information afterwards.scope
is used to match the record #touch
is called on (lines 9-10 and 14).#increment_lock
is called.Also, there is understanding of why we need #clear_attribute_changes
(on the line 18) after we're done with adding locking information to changes
. In fact, that statement used to be placed before the code that deals with locking, and was fixed later. The bug was that locking column was updated, but not cleared off #changed
. Of course, now the tests prevent from regressions, but it'd be great to understand all the intricacies easily.
I believe that if we could put all the code that deals with updating record attributes into one place, and place #clear_attribute_changes
afterwards, it'll be a bit clearer. This brings us to another refactoring take:
In this take I've clearly separated scope
building and updating of record attributes and changes
hash:
unless attributes.empty? scope = prepare_scope_to_match_this_record changes = {} update_record_and_changes_with_same_data(attributes, time, changes) result = scope.update_all(changes) == 1 if !result && locking_enabled? raise ActiveRecord::StaleObjectError.new(self, "touch") end @_trigger_update_callback = result result else true end private def update_record_and_changes_with_same_data(attributes, time, changes) attributes.each do |column| column = column.to_s changes[column] = write_attribute(column, time) end if locking_enabled? changes[self.class.locking_column] = increment_lock end clear_attribute_changes(changes.keys) end
I think the line 2 is as good as it gets, but the line 5 could be named better. The level of abstraction used is essentially the same as the original code, dealing with records and scopes.
As for revealing of the hidden concepts, we have:
changes
are updated with the same data (see the line 5).scope
is meant to match the record #touch
is called on.scope
should be prepared before changing record, at least not in a prominent way. I considered naming scope preparing method #prepare_scope_to_match_this_unchanged_record
, but _unchanged_
would just add cognitive overhead. It's not every day that you're moving things around. And, match this record hints that values used for that should be such that the record can be found, i.e. unchanged.The first two takes haven't really deviated from the existing code structure, the first take especially. But there's another way to look at it, from the domain perspective. You could say it's thinking out of the box.
What does the code tell us about domain? Do we see domain concepts manifesting in the code?
If I look at #touch
with that in mind, I can't help noticing that domain logic for #touch
should be:
time
. Standard timestamp attributes and passed timestamp attributes are updated.That's the level of abstraction that should be used, when looking from the domain context perspective:
unless attributes.empty? attributes.each do |column| column = column.to_s write_attribute(column, time) end touch_columns(*attributes) else true end
So, here we update the standard and passed timestamp attributes on the record and tell #touch_columns
to save them in db.
I like this take the most. There's no word about locking as it doesn't really belong in the domain logic, and we don't know how the attributes are going to get saved, all we care about is that they get saved.
The name #touch_columns
is not very good. In Rails, there is #update_attributes
that #save
s the record, and there is #update_columns
that just executes SQL and doesn't call any callbacks. #touch_columns
is in-between, skipping validations, but calling callbacks. I just don't know Rails enough to come up with a better name. But otherwise, it's a good take.
I've selected code that I considered to be hard to split into smaller methods. It had high level of abstraction, which increased the difficulty level.
So, Is it always a good idea to split a long method into smaller ones? The conclusion I've made is that it depends on your refactoring skills and tenacity. I could have stopped after my first two attempts (take 1 is actually my 3rd attempt) and declared it impossible to split #touch
, because the first two attempts were bad.
All three takes described in this post are fitting as a replacement of the original code. Take 1 is better than the original code because it allows to understand what's going on faster, and provides the same level of understanding of the implementation. Take 2 is better than the original code because it highlights hidden concepts that the reader would have to take more time to get otherwise. And finally, take 3 is better than the original code because it uses domain-level concepts, and that makes #touch
much easier to reason about.
I would also like to note that doing just extract methods as the original question seems to imply, is very limiting. Refactoring is much more than just extracting methods.
The post Is it always a good idea to split long methods into smaller ones? An experiment. first appeared on Ruby clarity.
]]>The post acts_as_list refactoring part 3 first appeared on Ruby clarity.
]]>You don't need to read part 2 and part 1 to understand this article.
acts_as_list is a Rails gem. It allows you to treat Rails model records as part of an ordered list and offers methods like #move_to_bottom
and #move_higher
.
.update_all_with_touch
method caught my attention as it's a somewhat long (11 lines) and hairy method. This method executes passed SQL, as Rails' #update_all
does, but also updates standard timestamps like updated_at
.
define_singleton_method :update_all_with_touch do |updates| record = new attrs = record.send(:timestamp_attributes_for_update_in_model) now = record.send(:current_time_from_proper_timezone) attrs.each do |attr| updates << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end update_all(updates) end
Let's have a look. On the line 2 (see above ↑) it creates a new model instance (acts_as_list is supposed to extend ActiveRecord
models, so naturally new
would create one). Then it sends two messages to the created model instance record
via #send
. The reason it uses #send
is because both those methods are private, so it can't just say record.timestamp_attributes_for_update_in_model
. Now, this is some cognitive load, because every time I read these lines I can't help think of why #send
has to be used. But I'll get to it later, let's look at the rest of the method now.
One the lines 6-10 (see above ↑), SQL is built and appended to updates
argument, modifying it. Each of the timestamp_attributes_for_update_in_model
is updated with current time. And after the SQL was built, it's executed with Rails' standard #update_all
.
So, this method does two things - build SQL and execute it. And most of the method is taken up by building SQL.
Is anything wrong with this method? For my taste, it's too hairy, and it goes into too much detail about details of building SQL. So, the first thing I want to do is to go to a higher level of abstraction on building SQL:
define_singleton_method :update_all_with_touch do |updates| update_all(updates << touch_record_sql) end private define_singleton_method :touch_record_sql do record = new attrs = record.send(:timestamp_attributes_for_update_in_model) now = record.send(:current_time_from_proper_timezone) updates = "" attrs.each do |attr| updates << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end updates end
The result on the line 2 (see above ↑) allows us to grasp what's going on much faster. updates
is being appended with SQL of touch_record_sql
. It took me more than two pomodoro to figure out a decent name for the method. I've tried many, including update_standard_timestamps_to_current_time_sql
(if only it wasn't that long). I prefer touch_record_sql
because it uses a well known term touch, which is inherited from Unix touch(1)
command and Rails' #touch
. Touch means update appropriate timestamps.
I've copied the code from above for easier reference:
define_singleton_method :update_all_with_touch do |updates| update_all(updates << touch_record_sql) end private define_singleton_method :touch_record_sql do record = new attrs = record.send(:timestamp_attributes_for_update_in_model) now = record.send(:current_time_from_proper_timezone) updates = "" attrs.each do |attr| updates << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end updates end
On the lines 12-17 (see above ↑), updates
variable we inherited from .update_all_with_touch
method, doesn't explain what's going on well enough. It may mean updates we want to do to db records, but it's far from being obvious. It's not a bad name, but I prefer sql
, to be in tune with the method's name touch_record_sql
:
define_singleton_method :touch_record_sql do record = new attrs = record.send(:timestamp_attributes_for_update_in_model) now = record.send(:current_time_from_proper_timezone) sql = "" attrs.each do |attr| sql << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end sql end
One the lines 6-11 (see above ↑) #each
loops over timestamp attributes and collects SQL fragments into sql
variable. It's a typical misuse of #each
and it could be replaced with #map(...).join(", ")
if we didn't need the leading ,
. In this case, #each
can be replaced with #inject
:
define_singleton_method :touch_record_sql do record = new attrs = record.send(:timestamp_attributes_for_update_in_model) now = record.send(:current_time_from_proper_timezone) attrs.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end end
As I mentioned previously, #send
is used here to run private methods on a model instance (see the lines 3-4 above ↑). And, it incurs cognitive load, because you have to wonder why #send
is used here. So, I chose to move this code to an instance method:
define_singleton_method :touch_record_sql do new.touch_record_sql end ... define_method :touch_record_sql do connection = self.class.connection attrs = timestamp_attributes_for_update_in_model now = current_time_from_proper_timezone attrs.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end end
The line 2 (see above ↑) raises a question though: Why do we need to create model instance to build SQL?. Oh well, it's not perfect.
On the line 8 (see above ↑) I had to add connection
variable, so even though record = new
is no longer needed, we've not reduced the number of lines. But #send
is gone! (see the lines 9-10 above ↑).
On the line 9 (see above ↑) there's a variable attrs
that is used on the line 12 only. And, since we no longer have #send
, i.e. the right side of the variable assignment isn't hairy, we can just do away with it. After all, what new does attrs
tells us that timestamp_attributes_for_update_in_model
does not?
define_method :touch_record_sql do connection = self.class.connection now = current_time_from_proper_timezone timestamp_attributes_for_update_in_model.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(now))}" end end
On the line 3 (see above ↑), there's now
variable that is only used in one place, at the line 6. It could be said that there's a performance benefit to keeping current_time_from_proper_timezone
call out of the #inject
loop, but it also could be said that it's premature optimisation.
It looks like a stalemate, but thankfully, there's another angle we can use here - readability. From readability perspective, having now
out of the loop clarifies that the now
value doesn't depend on the loop.
But on the line 6 there's also some code that doesn't depend on the loop - #{connection.quote(connection.quoted_date(now))}
, and it's hard to reason about two parts of the value that doesn't depend on the loop. So, I'm still going to inline now
, and see how it goes:
define_method :touch_record_sql do connection = self.class.connection timestamp_attributes_for_update_in_model.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{connection.quote(connection.quoted_date(current_time_from_proper_timezone))}" end end
Have to say, I like less lines here, but the right value in the SQL assignment doesn't depend on loop values, so it should be moved out of the loop for clarity:
define_method :touch_record_sql do connection = self.class.connection quoted_now = connection.quote(connection.quoted_date( current_time_from_proper_timezone)) timestamp_attributes_for_update_in_model.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{quoted_now}" end end
At this point, when looking at the lines 3-4 (see above ↑) I'm asking myself why not extract quoted_now
into a method. I did just that:
define_method :touch_record_sql do connection = self.class.connection quoted_now = quoted_current_time_from_proper_timezone timestamp_attributes_for_update_in_model.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{quoted_now}" end end private def quoted_current_time_from_proper_timezone self.class.connection.quote(self.class.connection.quoted_date( current_time_from_proper_timezone)) end
On the lines 3-7 (see above ↑) we can see that quoted_now
is used only on the line 6, and a question arises Why not inline it and live happily ever after?. We already discussed that quoted_now
must be the same for all its uses within SQL, but I've failed to encode this knowledge into words. So, I'm going to use a variable name that clearly explains that - cached_quoted_now
:
define_method :touch_record_sql do connection = self.class.connection cached_quoted_now = quoted_current_time_from_proper_timezone timestamp_attributes_for_update_in_model.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{cached_quoted_now}" end end
I quite like the result. Have to say, I first got the idea of using a different name for quoted_now
when I imagined that the extracted method would be called #quoted_now
, so I had to invent a new name for the variable as quoted_now = self.quoted_now
would suck.
On the line 2 (see above ↑) you can see a remnant of the beginning of this refactoring, connection
variable. Now that it's used only once on the line 6, it can be inlined. So, typing self.class.connection
sucks, so why not have #connection
method? I'd say that it goes agains the Single Responsibility Principle, but convenience trumps it in this case!
define_method :touch_record_sql do cached_quoted_now = quoted_current_time_from_proper_timezone timestamp_attributes_for_update_in_model.inject("") do |sql, attr| sql << ", #{connection.quote_column_name(attr)} = #{cached_quoted_now}" end end private delegate :connection, to: self
Not that bad.
I got a suggestion from reader Alex Piechowski that #map
and #join
can be used here, instead of #inject
:
define_method :touch_record_sql do cached_quoted_now = quoted_current_time_from_proper_timezone timestamp_attributes_for_update_in_model.map do |attr| ", #{connection.quote_column_name(attr)} = #{cached_quoted_now}" end.join end
Thank you, Alex!
I planned on doing more stuff in this refactoring, but it's quite a lot as it is. And most impressive to me, it all came from refactoring a single method. Just how much can you draw from a single method? Turns out, quite a lot.
Happy hacking!
P.S. my PR was accepted by acts_as_list project!
The post acts_as_list refactoring part 3 first appeared on Ruby clarity.
]]>The post acts_as_list refactoring part 2 first appeared on Ruby clarity.
]]>As you might remember, I've split .acts_as_list
method into several modules, each module dedicated to an option passed to the method. E.g. ColumnMethodDefiner
module defines methods related to the column
option (the option defines column name for storing record's list position).
This post is dedicated to refactoring of the ColumnMethodDefiner
module.
So, I've extracted code related to column
option of .acts_as_list
to ColumnMethodDefiner
. Here's an excerpt:
module ActiveRecord::Acts::List::ColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, column) | |
caller_class.class_eval do | |
attr_reader :position_changed | |
define_method :position_column do | |
column | |
end | |
define_method :"#{column}=" do |position| | |
write_attribute(column, position) | |
@position_changed = true | |
end | |
# only add to attr_accessible | |
# if the class has some mass_assignment_protection | |
if defined?(accessible_attributes) and !accessible_attributes.blank? | |
attr_accessible :"#{column}" | |
end | |
... |
Line 7 (see above ↑) references column
, but what column is that? Line 6 hints that we're talking about position column, i.e. column
means "name of the column that holds record's position in the list". I.e. position_column_name
. Unfortunately, it's too hard to read, so I opted for position_column
, which is easier to read:
module ActiveRecord::Acts::List::ColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
caller_class.class_eval do | |
attr_reader :position_changed | |
define_method :position_column do | |
position_column | |
end | |
define_method :"#{position_column}=" do |position| | |
write_attribute(position_column, position) | |
@position_changed = true | |
end | |
# only add to attr_accessible | |
# if the class has some mass_assignment_protection | |
if defined?(accessible_attributes) and !accessible_attributes.blank? | |
attr_accessible :"#{position_column}" | |
end | |
... |
I like that the method defined on the line 6 (see above ↑) has the same name as #position_column
method. Earlier, we had to reason as to why column
argument and #position_column
method contained the same data, were named differently. But no more! One concept less!
At this point, ColumnMethodDefiner
module's mission is to define methods related to position_column
, but the module is named as if it works with just Column
. It is inconsistent, so I'm going to rename it to PositionColumnMethodDefiner
:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
PositionColumnMethodDefiner.call(caller_class, column) | |
... |
On the line 4 (see above ↑), we still use column
argument though, but from the module name, we can infer that we talk about position column.
I would have liked to deprecate the column
argument and introduce position_column
to replace it, but that would be changing functionality, and refactoring is all about restructuring code and keeping functionality intact.
PositionColumnMethodDefiner.call
is 46 lines long and starts with defining some instance methods:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
caller_class.class_eval do | |
attr_reader :position_changed | |
define_method :position_column do | |
position_column | |
end | |
define_method :"#{position_column}=" do |position| | |
write_attribute(position_column, position) | |
@position_changed = true | |
end | |
... |
Since the method is too long, I'm going to extract #define_instance_methods
:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
caller_class.class_eval do | |
... | |
end | |
private | |
def self.define_instance_methods(caller_class, position_column) | |
caller_class.class_eval do | |
attr_reader :position_changed | |
... |
Because in part 1 I've chosen to extract stuff related to position column to a separate module, I can now extract methods from .call
method and not be afraid to pollute the namespace (as opposed to a single module for all .acts_as_list
options).
An interesting thing to note is that line 3 (see above ↑) doesn't need to be inside .class_eval
block that starts on line 5. At first, I made a mistake of putting the .define_instance_methods
method call inside the block, and it led to a problem. The problem was that inside .class_eval
block, self
points not to the PositionColumnMethodDefiner
module, and I had to do a hack to call .define_instance_methods
. It was ugly! Feast your eyes on this:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
SELF = self | |
def self.call(caller_class, position_column) | |
caller_class.class_eval do | |
SELF.define_instance_methods(caller_class, position_column) | |
... | |
end |
Yuck!
Starting at the line 12 (see below ↓), there are several class methods defined via #define_singleton_method
:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
caller_class.class_eval do | |
# only add to attr_accessible | |
# if the class has some mass_assignment_protection | |
if defined?(accessible_attributes) and !accessible_attributes.blank? | |
attr_accessible :"#{position_column}" | |
end | |
define_singleton_method :quoted_position_column do | |
@_quoted_position_column ||= connection.quote_column_name(position_column) | |
end | |
... |
I'm going to extract those class method definitions into a method:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
... |
It was my first time encountering #define_singleton_method
, and the docs didn't explain it well: "Defines a singleton method in the receiver". WTF is a singleton method? I know the singleton pattern, but that doesn't make any sense here.
It turns out, a singleton method is a method defined on an object instance. A class, for example, Object
class, is an instance of class Class
, so a class method foo
on Object
(Object.foo
) is a singleton method too. As well as a method defined on a string:
s = "abc" s.define_singleton_method :foo s.foo
So, in Ruby def self.foo
method is a class method, and at the same time, a singleton method.
If you feel like diving into this a bit more, there's a great article Ways to Define Singleton Methods in Ruby.
After I've extracted class and instance method definition we're left with adding position_column
as an accessible attribute on line 10 (see below ↓). attr_accessible
allows to specify a white list of model attributes that can be set via mass-assignment.
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
caller_class.class_eval do | |
# only add to attr_accessible | |
# if the class has some mass_assignment_protection | |
if defined?(accessible_attributes) and !accessible_attributes.blank? | |
attr_accessible :"#{position_column}" | |
end | |
end | |
end | |
... |
At the line 10 (see above ↑), position_column
is interpolated and then converted to a Symbol
. We can do away with the interpolation here (see the line 10 below ↓):
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
caller_class.class_eval do | |
# only add to attr_accessible | |
# if the class has some mass_assignment_protection | |
if defined?(accessible_attributes) and !accessible_attributes.blank? | |
attr_accessible position_column.to_sym | |
end | |
end | |
end | |
... |
One of the worst things you can find in code is comments, and I hate them with passion. Sometimes you can't help but have comments, sometimes it's a necessary evil, but not in this case. On the lines 7-8 (see above ↑) the comments explain that we only protect position_column
from mass-assignment if the user already uses mass-assignment protection. Can we say the same thing without comments? Absolutely!
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
if mass_assignment_protection_was_used_by_user?(caller_class) | |
caller_class.class_eval do | |
attr_accessible position_column.to_sym | |
end | |
end | |
end | |
... |
So, instead of a long conditional, we have a method call .mass_assignment_protection_was_used_by_user?
, that is much easier to understand and is at the right level of abstraction.
However, lines 7-9 (see above ↑) are still at the wrong level of abstraction, so I'm going to extract them into a method:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
if mass_assignment_protection_was_used_by_user?(caller_class) | |
protect_attributes_from_mass_assignment(caller_class, position_column) | |
end | |
end | |
... |
So, I've extracted protecting position_column
attribute into .protect_attributes_from_mass_assignment
method (see line 7 above ↑).
I feel it reads much better without any comments now.
Let's see whether the code that I've extracted can be improved:
def self.mass_assignment_protection_was_used_by_user?(caller_class) | |
caller_class.class_eval do | |
defined?(accessible_attributes) and !accessible_attributes.blank? | |
end | |
end |
On the line 3 (see above ↑) we check whether accessible_attributes
is defined. But what is accessible_attributes
? It seems that it's an undocumented Rails method.
In Rails 2.3.8 accessible_attributes
used to reference attr_accessible
attribute (used to store those attributes that allow mass assignment). In Rails 4, attr_accessible
was removed in favour of strong parameters and thus, would no longer be defined.
This explains why accessible_attributes
may not be defined, and I will not dive deeper into undocumented Rails stuff.
defined?(accessible_attributes)
returns a truthful value if . accessible_attributes
is defined. However, it would also return a truthful value if a variable named accessible_attributes
was defined. It's not very likely that such variable would be defined, but for somebody reading it thoroughly, it makes code harder to understand. "Did the author really mean that accessible_attributes
variable counts as mass protection defined?". Thus, it's better to replace defined?
with #respond_to?
:
def self.mass_assignment_protection_was_used_by_user?(caller_class) | |
caller_class.class_eval do | |
respond_to?(:accessible_attributes) and !accessible_attributes.blank? | |
end | |
end |
In this way, it's clear that we're looking for a method .accessible_attributes
, and there are no further questions.
But we're not done with the .mass_assignment_protection_was_used_by_user?
method yet. On the line 3 (see above ↑) we check whether accessible_attributes
is not #blank?
. It's probably always better to avoid using negation. In this case, we can use #present?
:
def self.mass_assignment_protection_was_used_by_user?(caller_class) | |
caller_class.class_eval do | |
respond_to?(:accessible_attributes) and accessible_attributes.present? | |
end | |
end |
Now I'm happy with the method.
To remind you what the state of .call
method is:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
if mass_assignment_protection_was_used_by_user?(caller_class) | |
protect_attributes_from_mass_assignment(caller_class, position_column) | |
end | |
end | |
... |
We are passing caller_class
to each method call here. We could define a class instance variable and reference it in class methods later:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
@caller_class = caller_class | |
define_class_methods(position_column) | |
define_instance_methods(position_column) | |
if mass_assignment_protection_was_used_by_user? | |
protect_attributes_from_mass_assignment(position_column) | |
end | |
end | |
... | |
def self.mass_assignment_protection_was_used_by_user?(caller_class) | |
@caller_class.class_eval do | |
respond_to?(:accessible_attributes) and accessible_attributes.present? | |
end | |
end | |
... |
Voila! Reads much better!
But alas, using a class instance variable is not thread safe :(
I have two choices here:
Long story short, I've refactored to this:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
Definer.new(caller_class, position_column).call | |
end | |
class Definer | |
def initialize(caller_class, position_column) | |
@caller_class, @position_column = caller_class, position_column | |
end | |
def call | |
define_class_methods(@position_column) | |
define_instance_methods(@position_column) | |
if mass_assignment_protection_was_used_by_user? | |
protect_attributes_from_mass_assignment(@position_column) | |
end | |
end | |
... |
And, I can't stand it. The cure is worse than the disease. In the #call
method (see the lines 12-17 above ↑) I'm passing an instance variable @position_column
as a method argument. It's awful, but it's that or I have to say something like position_column = @ position_column
for the variable to be picked up by a #class_eval
block. Neither of the options are good. So, it's a no-go.
So, I've refactored to use a thread variable:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
self.caller_class = caller_class | |
define_class_methods(position_column) | |
define_instance_methods(position_column) | |
if mass_assignment_protection_was_used_by_user? | |
protect_attributes_from_mass_assignment(position_column) | |
end | |
end | |
private | |
def self.caller_class=(value) | |
Thread.current.thread_variable_set :acts_as_list_caller_class, value | |
end | |
def self.caller_class | |
Thread.current.thread_variable_get :acts_as_list_caller_class | |
end | |
def self.define_class_methods(position_column) | |
caller_class.class_eval do | |
define_singleton_method :quoted_position_column do | |
@_quoted_position_column ||= connection.quote_column_name(position_column) | |
end | |
... |
Much better than service object, but the cognitive load is there. It's just far from being standard to say self.caller_class = caller_class
. And thread variable instead of just another method argument? That takes much more thinking. "Why a thread variable?", "What does self.caller_class = caller_class
assignment mean?". It's a no-go either.
So, in the end I was unable to improve on this:
module ActiveRecord::Acts::List::PositionColumnMethodDefiner #:nodoc: | |
def self.call(caller_class, position_column) | |
define_class_methods(caller_class, position_column) | |
define_instance_methods(caller_class, position_column) | |
if mass_assignment_protection_was_used_by_user?(caller_class) | |
protect_attributes_from_mass_assignment(caller_class, position_column) | |
end | |
end | |
... |
Can you think of a way to improve it?
In part 3 I'll dive into methods defined with #define_singleton_method
in .define_class_methods
. Some of them use class instance variables, so they may not be thread safe. I'm looking forward to finding out.
That's all for today, and, happy hacking!
P.S. my PR was accepted by acts_as_list project!
The post acts_as_list refactoring part 2 first appeared on Ruby clarity.
]]>The post acts_as_list refactoring part 1 first appeared on Ruby clarity.
]]>#move_to_bottom
and #move_higher
.
.acts_as_list
is available as a class method in ActiveRecord::Base
when acts_as_list
gem is loaded. Here's an excerpt from .acts_as_list
definition:
module ClassMethods | |
def acts_as_list(options = {}) | |
configuration = { column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom} | |
configuration.update(options) if options.is_a?(Hash) | |
... |
Using ClassMethods
module is customary in Rails, but it's not a requirement to be familiar with it to read this article. All you need to know is that .acts_as_list
is a class method when used in a Rails model.
As you can see on the line 3 above ↑, there are 4 options that can be passed to .acts_as_list
:
column
: db column to store position in the list.scope
: restricts what is to be considered a list. For example, enabled = true
SQL could be used as scope
, to limit list items to those that are enabled.top_of_list
: a number the first element of the list will have as position.add_new_at
: specifies whether new items get added to the :top or :bottom of the list.Options are passed as options
argument, and a hash is expected (see the line 2 above ↑). Then, the default configuration
hash is updated with the passed options
on the line 4, thus overriding defaults with the passed options.
The problem here is that the caller can make mistakes:
Hash
, but something else:acts_as_list :column acts_as_list 1
:columm
instead of :column
):acts_as_list columm: "order"
In both cases, .acts_as_list
will fail silently, leaving the user to figure out what went wrong by themselves.
Using Ruby 2 keyword arguments solves both described problems:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
configuration = { column: column, scope: scope, top_of_list: top_of_list, add_new_at: add_new_at } |
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
configuration = { column: column, scope: scope, top_of_list: top_of_list, add_new_at: add_new_at } | |
if configuration[:scope].is_a?(Symbol) && configuration[:scope].to_s !~ /_id$/ | |
configuration[:scope] = :"#{configuration[:scope]}_id" | |
end |
Using configuration
variable after using keyword arguments does look confusing, and it's so much shorter without it:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
if scope.is_a?(Symbol) && scope.to_s !~ /_id$/ | |
scope = :"#{scope}_id" | |
end |
I realise that it puts cognitive load on us, to figure out that scope
is part of configuration, but if the method is short (and currently it's not short), it'll be ok. Meanwhile, I'll enjoy shorter names :)
As you can see on the line 3 above ↑, _id
suffix is added to scope
. The problem with this line is twofold:
I thought of extracting that into a method (thus, solving the 1st problem), but fortunately, I guessed that there must be a method out there doing that already. And indeed, there is: ActiveSupport::Inflector.foreign_key. So, I'm going to use it:
module ClassMethods | |
include ActiveSupport::Inflector | |
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
if scope.is_a?(Symbol) && scope.to_s !~ /_id$/ | |
scope = foreign_key(scope).to_sym | |
end |
The #foreign_key
method fits perfectly here, because, scope
is described in the comments as Given a symbol, it'll attach _id and use that as the foreign key restriction. Not only it's a standard solution, the story it tells, fits well into what .acts_as_list
does.
As you can see on the line 2 above ↑, I've chosen to include ActiveSupport::Inflector
into ClassMethods
, thus polluting all classes ClassMethods
will be extending. But this is temporary, and I'll figure out later, how to fix that.
On the lines 5-7 (see above ↑), we add _id
suffix to scope
if it's a Symbol
and doesn't end with _id
already. This code is ripe for extracting a method:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
scope = idify(scope) if scope.is_a?(Symbol) | |
.... | |
end | |
def idify(name) | |
return name.to_sym if name.to_s =~ /_id$/ | |
foreign_key(name).to_sym | |
end |
On the line 2 (see above ↑) you can see that I haven't extracted the check of whether scope
is a Symbol
. I believe, it would be less readable to have just scope = idify(scope)
as it'd look like we add _id
suffix always. But this is not the case, the suffix is added only for symbols (strings are left untouched).
However, there's one problem with this setup. Having #idify
in the module ClassMethods
pollutes namespace of ActiveRecord::Base
.
At this stage, .acts_as_list
method is 118 lines long. Here's a short snippet:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
scope = idify(scope) if scope.is_a?(Symbol) | |
caller_class = self | |
class_eval do | |
define_singleton_method :acts_as_list_top do | |
top_of_list.to_i | |
end | |
define_method :acts_as_list_top do | |
top_of_list.to_i | |
end | |
define_method :acts_as_list_class do | |
caller_class | |
end | |
... |
The code in .acts_as_list
defines methods and Rails callbacks, related to column
, scope
, top_of_list
, add_new_at
arguments. It seems like a good idea to group code by those arguments, putting scope
-related stuff into one place and column
-related, into some other place.
I see 3 approaches to split .acts_as_list
, and I'm going to describe them below.
To avoid polluting ClassMethods
namespace, add a module AuxMethods
and split .acts_as_list
into multiple methods. It'd look something like this:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
AuxMethods.define_column_methods(caller_class, column) | |
AuxMethods.define_scope_methods(caller_class, scope) | |
AuxMethods.define_top_of_list_methods(caller_class, top_of_list) | |
AuxMethods.define_add_new_at_methods(caller_class, att_new_at) | |
... |
The problem with this approach is that methods names aren't very readable. Also, since we can't include AuxMethods
to ClassMethods
, we can't get rid of AuxMethods.
prefix. And it doesn't read that well too.
A service object could look like this:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
definer = MethodDefiner.new(caller_class, column, scope, top_of_list, add_new_at) | |
definer.define_column_methods | |
definer.define_scope_methods | |
definer.define_top_of_list_methods | |
definer.define_add_new_at_methods | |
... |
I think, it's even worse than the approach 1. It looks like the methods that are defined when #define_column_methods
is called, are defined on the definer
object. And, it's one line longer.
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
ColumnMethodDefiner.call(caller_class, column) | |
ScopeMethodDefiner.call(caller_class, scope) | |
TopOfListMethodDefiner.call(caller_class, top_of_list) | |
AddNewAtMethodDefiner.call(caller_class, add_new_at) | |
... |
This is my favourite of the three, because:
ColumnMethodDefiner
..call
methods only take the arguments the modules need (better than the approach 2).After the module extraction (I chose the approach 3), .acts_as_list
looks like this:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
ColumnMethodDefiner.call(caller_class, column) | |
ScopeMethodDefiner.call(caller_class, scope) | |
TopOfListMethodDefiner.call(caller_class, top_of_list) | |
AddNewAtMethodDefiner.call(caller_class, add_new_at) | |
class_eval do | |
define_method :acts_as_list_class do | |
caller_class | |
end | |
end | |
before_validation :check_top_position | |
before_destroy :lock! | |
after_destroy :decrement_positions_on_lower_items | |
before_update :check_scope | |
after_update :update_positions | |
after_commit :clear_scope_changed | |
if add_new_at.present? | |
before_create "add_to_list_#{add_new_at}".to_sym | |
end | |
include ::ActiveRecord::Acts::List::InstanceMethods | |
end |
So, instead of 118 lines, .acts_as_list
is 30 lines now, and fits into a page.
Exactly because I have reduced the number of lines, I can now pay more attention to what's left. And, on the line 9 (see above ↑) there's a redundant .class_eval
call. This call changes execution context from self
to, well, self
. That's why it's redundant. After removal, we get (see the lines 9-11 below ↓):
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
ColumnMethodDefiner.call(caller_class, column) | |
ScopeMethodDefiner.call(caller_class, scope) | |
TopOfListMethodDefiner.call(caller_class, top_of_list) | |
AddNewAtMethodDefiner.call(caller_class, add_new_at) | |
define_method :acts_as_list_class do | |
caller_class | |
end | |
before_validation :check_top_position | |
before_destroy :lock! | |
after_destroy :decrement_positions_on_lower_items | |
before_update :check_scope | |
after_update :update_positions | |
after_commit :clear_scope_changed | |
if add_new_at.present? | |
before_create "add_to_list_#{add_new_at}".to_sym | |
end | |
include ::ActiveRecord::Acts::List::InstanceMethods | |
end |
On the lines 13-25 (see above ↑), there are lots of Rails callbacks created. I've already added ColumnMethodDefiner.call
, etc, so having callback code here breaks Single Level of Abstraction. I've extracted the Rails callbacks into a separate module (see the line 13 below ↓):
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
ColumnMethodDefiner.call(caller_class, column) | |
ScopeMethodDefiner.call(caller_class, scope) | |
TopOfListMethodDefiner.call(caller_class, top_of_list) | |
AddNewAtMethodDefiner.call(caller_class, add_new_at) | |
define_method :acts_as_list_class do | |
caller_class | |
end | |
CallbackDefiner.call(caller_class, add_new_at) | |
include ::ActiveRecord::Acts::List::InstanceMethods | |
end |
If Rails callbacks break Single Level of Abstraction, doesn't code on the lines 9-11 (see above ↑) break it too? It does. Because it's so small, it seems that there's no harm in having it there as it is, but I don't really care to read that #acts_as_list_class
is added, I'd rather read a high-leveled description of what kind of functionality it provides.
So, I've looked up the rest of the code and, #acts_as_list_class
is just used internally by the gem. So, it's an auxiliary method. I've extracted it into its own module (see the line 9 below ↓):
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
caller_class = self | |
ColumnMethodDefiner.call(caller_class, column) | |
ScopeMethodDefiner.call(caller_class, scope) | |
TopOfListMethodDefiner.call(caller_class, top_of_list) | |
AddNewAtMethodDefiner.call(caller_class, add_new_at) | |
AuxMethodDefiner.call(caller_class) | |
CallbackDefiner.call(caller_class, add_new_at) | |
include ::ActiveRecord::Acts::List::InstanceMethods | |
end |
I could possibly treat definers as plugins and load them with:
def acts_as_list(column: "position", scope: "1 = 1", top_of_list: 1, add_new_at: :bottom) | |
load_definer_plugins dir: "definers" | |
include ::ActiveRecord::Acts::List::InstanceMethods | |
end |
But I think it'd be an overkill. My main argument against that is that these modules aren't really plugins. If there was a standard way to add plugins in Ruby, that might have been plausible, but adding an ad-hoc plugin system would only make things more complicated. And instead of reading a number of .call
s, reader would have to figure out the plugin system. A no-go.
So, that's the best I can do with this method (as per step 4.5).
In part 2 I'll reap the consequences of choosing the approach 3 to split .acts_as_list
into modules, and will refactor one of those modules. I've already started on that, so I can say that it's interesting to see how the choice to use a separate module allowed to further improve the code by extracting methods. Single Responsibility Principle isn't there for nothing after all :)
If you want to know when the part 2 is out, sign up for my email list.
Happy hacking!
The post acts_as_list refactoring part 1 first appeared on Ruby clarity.
]]>