SQL Performance Tuning Interview Questions

  • Performance Monitor\System Monitor - Tool to capture macro level performance metrics.
  • Profiler - Tool to capture micro level performance metrics based on the statements issued by a login, against a database or from host name.
  • Server Side Trace - System objects to write the detailed statement metrics to a table or file, similar to Profiler.
  • Dynamic Management Views and Functions - SQL Server objects with low level metrics to provide insight into a specific portion of SQL Server i.e. the database engine, query plans, Service Broker, etc.
  • Management Studio's Built-In Performance Reports - Ability to capture point in time metrics as pre-defined by Microsoft.
  • Custom scripts - Custom scripts can be developed to monitor performance, determine IO usage, monitor fragmentation, etc. all in an effort to improve performance.
  • Third party applications - Performance monitoring and tuning applications from vendors in the SQL Server community.
2.Explain how the hardware running SQL Server can help or hinder performance.
  • Taxed CPUs will queue requests and hinder query performance.
  • Insufficient memory could cause paging resulting in slow query performance.
  • Incorrectly configured disk drives could exacerbate IO problems.
3.Why is it important to avoid functions in the WHERE clause?
Because SQL Server will scan the index or the table as opposed to seeking the data.  The scan operation is a much more costly operation than a seek.
Often a slightly different approach can be used to prevent using the function in the WHERE clause yielding a favorable query plan and high performance.

4.How is it possible to capture the IO and time statistics for your queries?
Use the SET STATISTICS IO and SET STATISTICS TIME settings in your queries or enable the settings in your Management Studio session.

5.True or False - It is possible to correlate the Performance Monitor metrics with Profiler data in a single SQL Server native product?
True - This functionality is possible with SQL Server Profiler.

6.How can I/O statistics be gathered and reviewed for individual database files?
By using the fn_virtualfilestats function to capture the metrics.This process can be automated with a script to determine the file usage with numerous samples.

7.What is a query plan and what is the value from a performance tuning perspective?
  • A query plan is the physical break down of the code being passed to the SQL Server optimizer.
  • The value from a performance tuning perspective is that each component of the query can be understood and the percentage of resource utilization can be determined at a micro level.  As query tuning is being conducted, the detailed metrics can be reviewed to compare the individual coding techniques to determine the best alternative.
8.True or False - It is always beneficial to configure TempDB with an equal number of fixed sized files as the number of CPU cores.
  • False - With alwaysbeing the operative word in the question.
  • Depending on the version of SQL Server, the disk subsystem, load, queries, etc., a to  ratio of files to cores may be necessary on high end SQL Servers with intense processing.
  • If you do not have that luxury, a starting point may to be have half the number of tempdb files as compared to CPU cores.
  • This is a configuration to load test and monitor closely depending on the type of processing, load, hardware, etc. that your SQL Server is expected to support.
9.Explain the NOLOCK optimizer hint and some pros\cons of using the hint.
  • The NOLOCK query hint allows SQL Server to ignore the normal locks that are placed and held for a transaction allowing the query to complete without having to wait for the first transaction to finish and therefore release the locks.
  • This is one short term fix to help prevent locking, blocking or deadlocks.
  • However, when the NOLOCK hint is used, dirty data is read which can compromise the results returned to the user.
10.Explain three different approaches to capture a query plan.
SHOWPLAN_TEXT
SHOWPLAN_ALL
Graphical Query Plan
sys.dm_exec_query_optimizer_info
sys.dm_exec_query_plan
sys.dm_exec_query_stats

11.True or False - A LEFT OUTER JOIN is always faster than a NOT EXISTS statement.
False - With always being the operative word.  Depending on the situation the OUTER JOIN may or may not be faster than a NOT EXISTS statement.  It is necessary to test the techniques, review the query plans and tune the queries accordingly.

12.Name three different options to capture the input (code) for a query in SQL Server.
DBCC INPUTBUFFER
fn_get_sql
sys.dm_exec_sql_text

13.Explain why the NORECOMPUTE option of UPDATE STATISTICS is used.
  • This command is used on a per table basis to prevent the table from having statistics automatically updated based on the 'Auto Update Statistics' database configuration.
  • Taking this step will prevent UPDATE STATISTICS from running during an unexpected time of the day and cause performance problems.
  • By setting this configuration it is necessary to manually UPDATE STATISTICS on a regular basis.
14.Explain a SQL Server deadlock, how a deadlock can be identified, how it is a performance problem and some techniques to correct deadlocks.
  • A deadlock is a situation where 2 spids have data locked and cannot release their lock until the opposing spid releases their lock. Depending on the severity of the deadlock, meaning the amount of data that is locked and the number of spids that are trying to access the same data, an entire chain of spids can have locks and cause a number of deadlocks, resulting in a performance issue.
  • Deadlocks can be identified by Profiler in either textual, graphical or XML format.
  • Deadlocks are a performance problem because they can prevent 2 or more processes from being able to process data.  A deadlock chain can occur and impact hundreds of spids based on the data access patterns, number of users, object dependencies, etc.
  • Deadlocks could require a database design change, T-SQL coding change to access the objects in the same order, separating reporting and OLTP applications, including NOLOCK statements in SELECT queries that can accept dirty data, etc.
15.Please explain why SQL Server does not select the same query plan every time for the same code (with different parameters) and how SQL Server can be forced to use a specific query plan.
The query plan is chosen based on the parameters and code being issued to the SQL Server optimizer.  Unfortunately, a slightly different query plan can cause the query to execute much longer and use more resources than another query with exactly the same code and only parameter differences.
The OPTIMIZE FOR hint can be used to specify what parameter value we want SQL Server to use when creating the execution plan. This is a SQL Server 2005 and beyond hint.

16.How do you tune SQL queries to improve performance?
Tuning your SQL queries can have a significantly positive impact on performance. And understanding how your specific RDBMS works can help tremendously as well. But here we will go over some tips on how to tune SQL queries in general, non-RDBMS specific terms.

17.What is a query execution plan?
  • Understanding query execution plans is one of the first steps to properly tuning SQL queries.
18.What is a query execution plan – also known as an explain plan? 
Well, a query execution plan lists all the details of how that particular RDBMS plans on processing a particular query. Inside this plan are details on how the index will be used, how joins will be performed (and their associated logic), and also an estimate of the resource cost. Understanding the explain plan utility for your particular RDBMS is critical if you want to successfully tune SQL queries.
  • Here are some non-RDBMS specific things to keep in mind when tuning your SQL queries to improve performance:
  • Reduce the rows that are returned by a query
  • This is fairly obvious – if a query returns less rows, then clearly the query will be more efficient.
  • Get rid of unnecessary columns and tables
  • Even though this isn’t a change made to your actual query, the less unused space in your database, the more efficient your queries will be. This is another pretty obvious one.
  • GROUP BY may be better to use than DISTINCT. In some DBMS’s GROUP BY is a more efficient way of retrieving unique rows than DISTINCT. This is because GROUP BY performs the sort that finds duplicates earlier in the processing of a query than DISTINCT. The DISTINCT clause will perform the sort at the very last step, and will do this against the final result set. Why does it matter when the sort is performed? Well, if duplicate rows are eliminated earlier on in the processing of a query, then it means that the rest of the processing of the query will be more efficient because there will presumably be less rows to perform the rest of processing on, since the duplicates have already been eliminated. For your particular RDBMS, you should look at the explain plans for running a query with GROUP BY or DISTINCT to see how they compare.
  • Hints might help you tune your SQL queries.
FabGreen Technologies-Madivala
FabGreenTech.com

No comments:

Post a Comment