- SQL Server Reference Guide
- Introduction
- SQL Server Reference Guide Overview
- Table of Contents
- Microsoft SQL Server Defined
- SQL Server Editions
- SQL Server Access
- Informit Articles and Sample Chapters
- Online Resources
- Microsoft SQL Server Features
- SQL Server Books Online
- Clustering Services
- Data Transformation Services (DTS) Overview
- Replication Services
- Database Mirroring
- Natural Language Processing (NLP)
- Analysis Services
- Microsot SQL Server Reporting Services
- XML Overview
- Notification Services for the DBA
- Full-Text Search
- SQL Server 2005 - Service Broker
- Using SQL Server as a Web Service
- SQL Server Encryption Options Overview
- SQL Server 2008 Overview
- SQL Server 2008 R2 Overview
- SQL Azure
- The Utility Control Point and Data Application Component, Part 1
- The Utility Control Point and Data Application Component, Part 2
- Microsoft SQL Server Administration
- The DBA Survival Guide: The 10 Minute SQL Server Overview
- Preparing (or Tuning) a Windows System for SQL Server, Part 1
- Preparing (or Tuning) a Windows System for SQL Server, Part 2
- Installing SQL Server
- Upgrading SQL Server
- SQL Server 2000 Management Tools
- SQL Server 2005 Management Tools
- SQL Server 2008 Management Tools
- SQL Azure Tools
- Automating Tasks with SQL Server Agent
- Run Operating System Commands in SQL Agent using PowerShell
- Automating Tasks Without SQL Server Agent
- Storage – SQL Server I/O
- Service Packs, Hotfixes and Cumulative Upgrades
- Tracking SQL Server Information with Error and Event Logs
- Change Management
- SQL Server Metadata, Part One
- SQL Server Meta-Data, Part Two
- Monitoring - SQL Server 2005 Dynamic Views and Functions
- Monitoring - Performance Monitor
- Unattended Performance Monitoring for SQL Server
- Monitoring - User-Defined Performance Counters
- Monitoring: SQL Server Activity Monitor
- SQL Server Instances
- DBCC Commands
- SQL Server and Mail
- Database Maintenance Checklist
- The Maintenance Wizard: SQL Server 2000 and Earlier
- The Maintenance Wizard: SQL Server 2005 (SP2) and Later
- The Web Assistant Wizard
- Creating Web Pages from SQL Server
- SQL Server Security
- Securing the SQL Server Platform, Part 1
- Securing the SQL Server Platform, Part 2
- SQL Server Security: Users and other Principals
- SQL Server Security – Roles
- SQL Server Security: Objects (Securables)
- Security: Using the Command Line
- SQL Server Security - Encrypting Connections
- SQL Server Security: Encrypting Data
- SQL Server Security Audit
- High Availability - SQL Server Clustering
- SQL Server Configuration, Part 1
- SQL Server Configuration, Part 2
- Database Configuration Options
- 32- vs 64-bit Computing for SQL Server
- SQL Server and Memory
- Performance Tuning: Introduction to Indexes
- Statistical Indexes
- Backup and Recovery
- Backup and Recovery Examples, Part One
- Backup and Recovery Examples, Part Two: Transferring Databases to Another System (Even Without Backups)
- SQL Profiler - Reverse Engineering An Application
- SQL Trace
- SQL Server Alerts
- Files and Filegroups
- Partitioning
- Full-Text Indexes
- Read-Only Data
- SQL Server Locks
- Monitoring Locking and Deadlocking
- Controlling Locks in SQL Server
- SQL Server Policy-Based Management, Part One
- SQL Server Policy-Based Management, Part Two
- SQL Server Policy-Based Management, Part Three
- Microsoft SQL Server Programming
- An Outline for Development
- Database
- Database Services
- Database Objects: Databases
- Database Objects: Tables
- Database Objects: Table Relationships
- Database Objects: Keys
- Database Objects: Constraints
- Database Objects: Data Types
- Database Objects: Views
- Database Objects: Stored Procedures
- Database Objects: Indexes
- Database Objects: User Defined Functions
- Database Objects: Triggers
- Database Design: Requirements, Entities, and Attributes
- Business Process Model Notation (BPMN) and the Data Professional
- Business Questions for Database Design, Part One
- Business Questions for Database Design, Part Two
- Database Design: Finalizing Requirements and Defining Relationships
- Database Design: Creating an Entity Relationship Diagram
- Database Design: The Logical ERD
- Database Design: Adjusting The Model
- Database Design: Normalizing the Model
- Creating The Physical Model
- Database Design: Changing Attributes to Columns
- Database Design: Creating The Physical Database
- Database Design Example: Curriculum Vitae
- NULLs
- The SQL Server Sample Databases
- The SQL Server Sample Databases: pubs
- The SQL Server Sample Databases: NorthWind
- The SQL Server Sample Databases: AdventureWorks
- The SQL Server Sample Databases: Adventureworks Derivatives
- UniversalDB: The Demo and Testing Database, Part 1
- UniversalDB: The Demo and Testing Database, Part 2
- UniversalDB: The Demo and Testing Database, Part 3
- UniversalDB: The Demo and Testing Database, Part 4
- Getting Started with Transact-SQL
- Transact-SQL: Data Definition Language (DDL) Basics
- Transact-SQL: Limiting Results
- Transact-SQL: More Operators
- Transact-SQL: Ordering and Aggregating Data
- Transact-SQL: Subqueries
- Transact-SQL: Joins
- Transact-SQL: Complex Joins - Building a View with Multiple JOINs
- Transact-SQL: Inserts, Updates, and Deletes
- An Introduction to the CLR in SQL Server 2005
- Design Elements Part 1: Programming Flow Overview, Code Format and Commenting your Code
- Design Elements Part 2: Controlling SQL's Scope
- Design Elements Part 3: Error Handling
- Design Elements Part 4: Variables
- Design Elements Part 5: Where Does The Code Live?
- Design Elements Part 6: Math Operators and Functions
- Design Elements Part 7: Statistical Functions
- Design Elements Part 8: Summarization Statistical Algorithms
- Design Elements Part 9:Representing Data with Statistical Algorithms
- Design Elements Part 10: Interpreting the Data—Regression
- Design Elements Part 11: String Manipulation
- Design Elements Part 12: Loops
- Design Elements Part 13: Recursion
- Design Elements Part 14: Arrays
- Design Elements Part 15: Event-Driven Programming Vs. Scheduled Processes
- Design Elements Part 16: Event-Driven Programming
- Design Elements Part 17: Program Flow
- Forming Queries Part 1: Design
- Forming Queries Part 2: Query Basics
- Forming Queries Part 3: Query Optimization
- Forming Queries Part 4: SET Options
- Forming Queries Part 5: Table Optimization Hints
- Using SQL Server Templates
- Transact-SQL Unit Testing
- Index Tuning Wizard
- Unicode and SQL Server
- SQL Server Development Tools
- The SQL Server Transact-SQL Debugger
- The Transact-SQL Debugger, Part 2
- Basic Troubleshooting for Transact-SQL Code
- An Introduction to Spatial Data in SQL Server 2008
- Performance Tuning
- Performance Tuning SQL Server: Tools and Processes
- Performance Tuning SQL Server: Tools Overview
- Creating a Performance Tuning Audit - Defining Components
- Creating a Performance Tuning Audit - Evaluation Part One
- Creating a Performance Tuning Audit - Evaluation Part Two
- Creating a Performance Tuning Audit - Interpretation
- Creating a Performance Tuning Audit - Developing an Action Plan
- Understanding SQL Server Query Plans
- Performance Tuning: Implementing Indexes
- Performance Monitoring Tools: Windows 2008 (and Higher) Server Utilities, Part 1
- Performance Monitoring Tools: Windows 2008 (and Higher) Server Utilities, Part 2
- Performance Monitoring Tools: Windows System Monitor
- Performance Monitoring Tools: Logging with System Monitor
- Performance Monitoring Tools: User Defined Counters
- General Transact-SQL (T-SQL) Performance Tuning, Part 1
- General Transact-SQL (T-SQL) Performance Tuning, Part 2
- General Transact-SQL (T-SQL) Performance Tuning, Part 3
- Performance Monitoring Tools: An Introduction to SQL Profiler
- Performance Tuning: Introduction to Indexes
- Performance Monitoring Tools: SQL Server 2000 Index Tuning Wizard
- Performance Monitoring Tools: SQL Server 2005 Database Tuning Advisor
- Performance Monitoring Tools: SQL Server Management Studio Reports
- Performance Monitoring Tools: SQL Server 2008 Activity Monitor
- The SQL Server 2008 Management Data Warehouse and Data Collector
- Performance Monitoring Tools: Evaluating Wait States with PowerShell and Excel
- Practical Applications
- Choosing the Back End
- The DBA's Toolbox, Part 1
- The DBA's Toolbox, Part 2
- Scripting Solutions for SQL Server
- Building a SQL Server Lab
- Using Graphics Files with SQL Server
- Enterprise Resource Planning
- Customer Relationship Management (CRM)
- Building a Reporting Data Server
- Building a Database Documenter, Part 1
- Building a Database Documenter, Part 2
- Data Management Objects
- Data Management Objects: The Server Object
- Data Management Objects: Server Object Methods
- Data Management Objects: Collections and the Database Object
- Data Management Objects: Database Information
- Data Management Objects: Database Control
- Data Management Objects: Database Maintenance
- Data Management Objects: Logging the Process
- Data Management Objects: Running SQL Statements
- Data Management Objects: Multiple Row Returns
- Data Management Objects: Other Database Objects
- Data Management Objects: Security
- Data Management Objects: Scripting
- Powershell and SQL Server - Overview
- PowerShell and SQL Server - Objects and Providers
- Powershell and SQL Server - A Script Framework
- Powershell and SQL Server - Logging the Process
- Powershell and SQL Server - Reading a Control File
- Powershell and SQL Server - SQL Server Access
- Powershell and SQL Server - Web Pages from a SQL Query
- Powershell and SQL Server - Scrubbing the Event Logs
- SQL Server 2008 PowerShell Provider
- SQL Server I/O: Importing and Exporting Data
- SQL Server I/O: XML in Database Terms
- SQL Server I/O: Creating XML Output
- SQL Server I/O: Reading XML Documents
- SQL Server I/O: Using XML Control Mechanisms
- SQL Server I/O: Creating Hierarchies
- SQL Server I/O: Using HTTP with SQL Server XML
- SQL Server I/O: Using HTTP with SQL Server XML Templates
- SQL Server I/O: Remote Queries
- SQL Server I/O: Working with Text Files
- Using Microsoft SQL Server on Handheld Devices
- Front-Ends 101: Microsoft Access
- Comparing Two SQL Server Databases
- English Query - Part 1
- English Query - Part 2
- English Query - Part 3
- English Query - Part 4
- English Query - Part 5
- RSS Feeds from SQL Server
- Using SQL Server Agent to Monitor Backups
- Reporting Services - Creating a Maintenance Report
- SQL Server Chargeback Strategies, Part 1
- SQL Server Chargeback Strategies, Part 2
- SQL Server Replication Example
- Creating a Master Agent and Alert Server
- The SQL Server Central Management System: Definition
- The SQL Server Central Management System: Base Tables
- The SQL Server Central Management System: Execution of Server Information (Part 1)
- The SQL Server Central Management System: Execution of Server Information (Part 2)
- The SQL Server Central Management System: Collecting Performance Metrics
- The SQL Server Central Management System: Centralizing Agent Jobs, Events and Scripts
- The SQL Server Central Management System: Reporting the Data and Project Summary
- Time Tracking for SQL Server Operations
- Migrating Departmental Data Stores to SQL Server
- Migrating Departmental Data Stores to SQL Server: Model the System
- Migrating Departmental Data Stores to SQL Server: Model the System, Continued
- Migrating Departmental Data Stores to SQL Server: Decide on the Destination
- Migrating Departmental Data Stores to SQL Server: Design the ETL
- Migrating Departmental Data Stores to SQL Server: Design the ETL, Continued
- Migrating Departmental Data Stores to SQL Server: Attach the Front End, Test, and Monitor
- Tracking SQL Server Timed Events, Part 1
- Tracking SQL Server Timed Events, Part 2
- Patterns and Practices for the Data Professional
- Managing Vendor Databases
- Consolidation Options
- Connecting to a SQL Azure Database from Microsoft Access
- SharePoint 2007 and SQL Server, Part One
- SharePoint 2007 and SQL Server, Part Two
- SharePoint 2007 and SQL Server, Part Three
- Querying Multiple Data Sources from a Single Location (Distributed Queries)
- Importing and Exporting Data for SQL Azure
- Working on Distributed Teams
- Professional Development
- Becoming a DBA
- Certification
- DBA Levels
- Becoming a Data Professional
- SQL Server Professional Development Plan, Part 1
- SQL Server Professional Development Plan, Part 2
- SQL Server Professional Development Plan, Part 3
- Evaluating Technical Options
- System Sizing
- Creating a Disaster Recovery Plan
- Anatomy of a Disaster (Response Plan)
- Database Troubleshooting
- Conducting an Effective Code Review
- Developing an Exit Strategy
- Data Retention Strategy
- Keeping Your DBA/Developer Job in Troubled Times
- The SQL Server Runbook
- Creating and Maintaining a SQL Server Configuration History, Part 1
- Creating and Maintaining a SQL Server Configuration History, Part 2
- Creating an Application Profile, Part 1
- Creating an Application Profile, Part 2
- How to Attend a Technical Conference
- Tips for Maximizing Your IT Budget This Year
- The Importance of Blue-Sky Planning
- Application Architecture Assessments
- Transact-SQL Code Reviews, Part One
- Transact-SQL Code Reviews, Part Two
- Cloud Computing (Distributed Computing) Paradigms
- NoSQL for the SQL Server Professional, Part One
- NoSQL for the SQL Server Professional, Part Two
- Object-Role Modeling (ORM) for the Database Professional
- Business Intelligence
- BI Explained
- Developing a Data Dictionary
- BI Security
- Gathering BI Requirements
- Source System Extracts and Transforms
- ETL Mechanisms
- Business Intelligence Landscapes
- Business Intelligence Layouts and the Build or Buy Decision
- A Single Version of the Truth
- The Operational Data Store (ODS)
- Data Marts – Combining and Transforming Data
- Designing Data Elements
- The Enterprise Data Warehouse — Aggregations and the Star Schema
- On-Line Analytical Processing (OLAP)
- Data Mining
- Key Performance Indicators
- BI Presentation - Client Tools
- BI Presentation - Portals
- Implementing ETL - Introduction to SQL Server 2005 Integration Services
- Building a Business Intelligence Solution, Part 1
- Building a Business Intelligence Solution, Part 2
- Building a Business Intelligence Solution, Part 3
- Tips and Troubleshooting
- SQL Server and Microsoft Excel Integration
- Tips for the SQL Server Tools: SQL Server 2000
- Tips for the SQL Server Tools – SQL Server 2005
- Transaction Log Troubles
- SQL Server Connection Problems
- Orphaned Database Users
- Additional Resources
- Tools and Downloads
- Utilities (Free)
- Tool Review (Free): DBDesignerFork
- Aqua Data Studio
- Microsoft SQL Server Best Practices Analyzer
- Utilities (Cost)
- Quest Software's TOAD for SQL Server
- Quest Software's Spotlight on SQL Server
- SQL Server on Microsoft's Virtual PC
- Red Gate SQL Bundle
- Microsoft's Visio for Database Folks
- Quest Capacity Manager
- SQL Server Help
- Visual Studio Team Edition for Database Professionals
- Microsoft Assessment and Planning Solution Accelerator
- Aggregating Server Data from the MAPS Tool
In the previous article in this series, I explained a little about the latest IT buzzword: NoSQL. I described where it came from, and why the data professional, specifically the SQL Server professional needs to care about it.
No doubt your developers are already using a NoSQL offering, even if they are only testing it out. It's best for you to understand where it does and does not fit. In that last article I explained its primary strengths and weaknesses. Most NoSQL excels at ease of programming for the developer, and does a good job of horizontally scaling with distributed systems. Where NoSQL has some disadvantages is in the ACID properties it maintains and the Programmatic Referential Integrity requirements. Again, for many developers this is acceptable.
I'll have to start out this article as I did the last one — with a caveat. There are so many NoSQL implementations and the term has such a loose meaning that the moment this article is published it will be out of date. Rather than let that happen, I'll point to a location that keeps up on the latest implementations for a more authoritative list. What I will do here is break the NoSQL offerings into the largest, most popular categories — at least as of this writing. You can take the information here and use it to apply to the latest products that exist in the NoSQL World.
As before, since there is no official governing body for the NoSQL products, each is left to define itself as NoSQL or something else. That means some of the information in this overview might not be applicable to a given product, but the concepts should apply at least broadly.
NoSQL Uses and Implementations
NoSQL databases, because of the distributed nature of their architectures (see my last article in this series) do well in column-style heavy read operations involving a search. They can distribute the read operations across multiple nodes in the system and filter the data accordingly.
Also, some kinds of writes — especially small, few-columned writes (such as a Tweet in Twitter) do well with a NoSQL implementation.
Although some folks throw around the term "Large" data as a use-case for NoSQL, I think that requires a definition. What is considered "large" data one year is a normal size of data in many organizations the next. And SQL Server, Oracle and other RDBMS systems routinely store Petabytes of data, which at the moment most folks consider a large data set.
The large data argument normally involves the locking mechanism and transactions. In an RDBMS, to ensure that someone isn't selling inventory while another person is selling the same item, locks are enforced by the RDBMS platform. In NoSQL implementations — some of them, anyway — locks are relaxed, or managed at the program level. This allows greater scale, but can also create "Eventual Consistency", meaning that a time-stamp is used to overwrite the latest datum. Sometimes the distributed nature means that the system might need to "catch up" as the reads happen faster than the writes. Again, in narrower columns and quick writes, this isn't always an issue.
NoSQL implementations handle indexing differently, depending on the product. In some cases the indexes are easier to design and deal with, and in others they are more difficult.
At the risk of over-generalizing, I'll categorize the NoSQL products into three large groups: Key/Value, Document, and Graph-Oriented databases. I'll mention a fourth which is certainly not an SQL implementation, but not always talked about as a NoSQL offering.
For the comparison, it works well for me to think about the storage arrangements of data. In a Relational system, a simple set of table relationships might look something like this for a names-to-addresses setup:
Name
NameID, PK, INT
FName, VARCHAR(50)
LName, VARCHAR(50
Address
AddressID, PK, INT
NameID, FK, INT
Street, VARCHAR(150)
City, VARCHAR(50)
State, CHAR(2)
Zip, VARCHAR(12)
DateStart, DATETIME
In an RDBMS we would set up a foreign key from Address that points to the Primary Key in Name, allowing us to have multiple addresses for a single person, in this case, like this:
Name
1234
Buck
Woody
Address
1
1234
784 Overton Lane
Seattle
WA
98052
07/07/2005
2
1234
453 Bellevue Square
Melbourne
FL
32935
06/13/2000
Using the second value as the "key" that points back, we relate the data, forming the heart of an RDBMS system. I'll use that same example as we go.
Key/Value Pair Storage — Columnar Databases
This type of NoSQL offering is exactly what its name implies — there is a value used as a "Key" (like our Primary Keys), but a generic container called the "Value" holds everything else. It's as if there are only two columns in a database, and data is available only for one.
The interesting thing is that the "Key" serves not only as a unique indicator of some kind, but often is the way the data is physically partitioned or laid out on disk. That means you can get incredible performance by merely using a lot of Key values — hundreds or thousands of them.
So one possible structure for the data above is:
NameTable
Key: 1234
Value: ["Buck Woody", "784 Overton Lane, Seattle, WA 98052, 07/07/2005'] [453 Bellevue Square, Melbourne, FL, 32935, 06/13/2000]
Sometimes the items I show separated here by commas have a prefix format and even other mechanisms. In most cases the programming language used to obtain the data defines the data store format.
Although a formal "JOIN" operation isn't always that common in a NoSQL database, it is possible. I've worked with some that defined the data above this way:
Key: 1234
Value: ["Buck Woody", "784 Overton Lane, Seattle, WA 98052, 06/13/2000]
Key: 1456
Value: ["Buck Woody", 453 Bing Place, 07/12/2005]
Using functions found in some language constructs, the two data sets are brought back using an associative array. I this case, it used the name (a bad choice, in my opinion) to show that Buck Woody moved from one street to another in Seattle on the 12th of July in 2005.
Using this construct, you would have something that looks more like a Dimension Table in a Star Schema. If you used multiple values similar to the first example, it feels more like a de-normalized table in SQL Server. And of course other structures are possible.
I'll show you where to find some examples of Key/Value NoSQL databases at the end of this article.
Document Structures
http://en.wikipedia.org/wiki/Document-oriented_database
Document Databases deal heavily with semi-structured data. Rather than working with small, discrete columns, these databases can also deal with larger sets of even binary information. But they shine best as a place to store differing "columns" within multiple "rows" — even when they are not structured that way. In fact, XML Databases are often referred to as "Document-Oriented Databases".
The structure in this example might look something like this:
Name="BuckWoody", Address="784 Overton Lane, Seattle, WA 98052", Profession=[{Name:"Computer Technology",YearsExperience:25}, {Name:"Marjorie", YearsExperience:15}, {Name:"Christina Woody", Address="784 Overton Lane, Seattle, WA 98052"}, {Name:"Bill Wilson", Age:27}
Notice how the structure of the data "wanders" all over the place, even in line with a single element. This is probably the most confusing structure to an RDBMS data professional.
Most often these databases are accessed using JSON. More on that here.
Graph Oriented Data
http://en.wikipedia.org/wiki/Graph_database
While an RDBMS follows a relational calculus from math, Graph databases follow a different mathematical model. This model deals with structures called Nodes or sometimes Vertices, and the connections between them are called Edges.
A Graph database is useful for a quick routing lookup structure, especially when data has multiple relationships to each other. The interesting thing is that you can quickly query via the relationship or the data. In other words, you're able to quickly find out how many relationships a datum has, or how much data is attached to a relationship. Although not a perfect comparison, you can think of the functions you can perform on a HierarchyID data type in SQL Server. In fact, Graph database Nodes and Edges are very similar to the HierarchyID examples.
You'll deal much more with relationships in this type of NoSQL offering, but not in the same way you would with an RDBMS. For our example, the structure might look like this:
GraphObject Name = createGraphObject(); // create a node called GraphObject
GraphObject Address = createGraphObject(); // create second node
Name.relate( Address ); // creates the edge or relationship between nodes
Now to work with this data, the pseudo-code looks something like this:
Name.traverseToNeighbors().getSingleMember(); // returns a single address
Address.traverseToNeighbors().getSingleMember(); // returns a single name
Working with Graph databases you not only get good relationship management, you can also have a better control over integrity in the data, since the functions handle the transaction-like processing.
Object-Oriented Databases
Most research I've done does not place an Object-Oriented Database (OODB) in the NoSQL camp, and I tend to agree with that categorization. The reason is that for the most part, an OODB can be implemented by simply combining Object-Oriented Programming (OOP) over almost any data store or structure. There are full OODB systems in use, but they are not ubiquitous. I mention it here only for completeness, and to show that I don't think it belongs in the NoSQL category. You can learn more about OODB's here, so I won't cover the concepts in great depth.
You'll find that in an OODB the data structure is similar (sometimes identical) to programming classes. For that reason, developers can usually navigate and work with these structures more quickly and easily than having to map out the structures from an RDBMS.
One of the most interesting things I've found in an OODB is the locking mechanisms. Because each data element is an object, they can be grouped in multiple ways, and then locked accordingly. This allows a great mix of low and high level locking based on the need of the program, but can also cause clashes between program elements requiring access to the data. For those reasons, and because of the increased programmatic complexity, I sometimes see a much more relaxed level of "Dirty Reads" — meaning that developers sometimes only lock an object or series of objects for writes. This is something to be aware of when your developers look into this paradigm, and one of the reasons I include OODB in this article.
NoSQL Resources
I'm leery of using Wikipedia as a standard resource tool, but in this case it is useful. You can start here: http://en.wikipedia.org/wiki/NoSQL
In as much as there is a standard body of work around the NoSQL offerings, this one seems to be the most up to date: http://nosql-database.org/
Even Windows Azure Table Storage can be used in a NoSQL like fashion. More on that here: http://blog.smarx.com/posts/sample-code-for-batch-transactions-in-windows-azure-tables