Saturday, May 9, 2015

Django Signals Performance Implications

Django signals are cool and very useful sometimes. They allow certain receivers get notified when certain actions occur elsewhere in the framework. For example, if we have registered a receiver function save_contribution() to listen on the post_save signal, when an article is saved, the save_contribution() will be notified and executed without explicitly calling it in the articles module. Cool, right?

However, the performance implications regarding the Django signals can be easily overlooked. An optimization I did recently reduced the number of unnecessary signals being fired from 3525 to 36 in one of our applications, which significantly speed up the sites.

Take a look at the post_init signal below:

def your_awesome_function(...):
    ...

post_init.connect(your_awesome_function)    # bad

This is bad unless you really want your_awesome_function to be called every time an object is created. What's the problem here? Well, this signal doesn't specify the Sender. As a result,  if you have a large application as we do, say, your app has 10000 objects, the function will be called 10000 times, faithfully!

post_init.connect(your_awesome_function, sender=YourModel)  # better

Now, it's better. It only gets fired when an instance of YourModel is created.

There are other cases that signals can be avoided altogether. For instance, I found a couple of unnecessary post_init signals in our applications that can be easily replaced with the functions in the base class because they really mean only for the subclasses of that base class.  

Keep in mind that signals are asynchronous, or blocking. For the performance consideration, the rule of thumb is:

1) Specify the Sender if possible.

2) Eliminate unnecessary signals.

Saturday, January 24, 2015

Using django-pyodbc to Connect to MS SQL Server Database on Ubuntu


Django doesn't officially support Microsoft SQL Server. To connect to SQL Server in your Django projects, you can use django-pyodbc as the DB backend.

1. Install required packages on your Ubuntu server:

sudo apt-get install unixodbc unixodbc-dev freetds-dev tdsodbc

In the virtualenv of your django project, install pyodbc and django-pyodbc:

pip install --allow-external pyodbc --allow-unverified pyodbc pyodbc
pip install django-pyodbc 
 
2. Edit /etc/freetds/freetds.conf to update the server for SQL Server .

# A typical Microsoft server
[sqlserver]
        host = <server ip>
        port = 1433
        tds version = 7.0


3. Set up FreeTDS driver in etc/odbcinst.ini (Run odbcinst -j to check where the configuration files are located and make sure the paths for the Driver and Setup are valid):

[FreeTDS]
Description = TDS driver (Sybase/MS SQL)
Driver = /usr/lib/x86_64-linux-gnu/odbc/libtdsodbc.so
Setup = /usr/lib/x86_64-linux-gnu/odbc/libtdsS.so
CPTimeout =
CPReuse =
FileUsage = 1

4. Update the DATABASES setting for your django project:

DATABASES = { 
     'default': {
         'ENGINE': "django_pyodbc", 
         'HOST': "sqlserver", 
         'USER': "mssql_user", 
         'PASSWORD': "mssql_password", 
         'NAME': "database_name", 
         'OPTIONS': {
             'host_is_server': False, 
             'driver': 'FreeTDS', 
         }, 
     } 
}

5. Test your connection and celebrate.