FILESTREAM – What and When – Sharing Experiences

If you have to store files (BLOB) in MS SQL Server database, you might consider using FILESTREAM (started from MS SQL Server 2008). This is essentially enabling FILESTREAM attribute of varbinary (max) type column. Some other setups are necessary prior to that.

FILESTREAM combines MS SQL Server with NTFS file system. Database is good at reading/writing/processing small pieces (rows and columns) of information scattered around in the database files while file system is good at reading/writing files as a whole. File system start performing better when the files are larger than 1 MB.

By combining the two you can get the best of the both worlds, like getting past the low max size limit of database BLOB and use max NTFS size, good streaming performance of file system, integration with structured data (transactional consistency), better fragmentation management, manageability (backups etc can be done together) etc.

However if you are doing frequent small updates FILESTREAM performs poor. This is because here it has to work more – guarantee transactional consistency as well write to file system that essentially creates/writes a new copy and delete the old one. It also takes more time to take backup.

FILESTREAM data can be accessed in two ways: standard Transact-SQL and by using Win32 APIs. If you want the best performance of File System you need to use Win32 APIs. There are provisions for the Win32 APIs to work within the context of database transaction.

FILESTREAM BLOBS use file system cache and does not use database buffer pool. This improves query processing and overall database performance.

However do note that certain features like Transparent Data Encryption (TDE) are not supported with FILESTREAM.

To summarize, if you are to save files 1 MB or larger in database that will be read a lot with few small updates and that a middle tier is available where you can use Win32 APIs for accessing them then FILESTREAM is a good choice.

Reference

Index

Author: Gopal Das

Director (Data Science) @ CrimsonLogic, Singapore; BS in CSE from Khulna University; ME in Internet Science & Engineering from Indian Institute of Science (IISc); Publications on Query Optimization in RDBMS in ACM SIGMOD, IEEE ICDE etc.; Founding team member and VP Technical Staff at iTwin, a spinoff from A*STAR; Software engineer/data scientist for 22 years; Software, Database, ML; Father of 3 (two @ NUS High School and one is too little!); www.linkedin.com/in/dasgopal; https://github.com/gopalcdas; View all posts by Gopal Das

	Angelina on Are You Blindly Trusting Plans…
	Victoria on Online technical stuffs I like…
	SMPF 2023 – Sh… on Bengali as a Mother Tongue Lan…
	Bengali as a Mother… on SMPF 2023
	Pavel on Dijkstra’s Problem with Negati…

Share this:

Related

Author: Gopal Das

Leave a comment Cancel reply