Salesforce Summaries – BigObjects — Bring Big Data to

By Andy Hitchings

SalesforceSummaries: a series delivering key insights from Salesforce YouTube videos, to save you time as you keep up to date with the latest technological changes within the Salesforce ecosystem.


Since the Winter ’18 release, Big Objects are now GA (generally available). This summary highlights the key takeaways from the excellent ‘BigObjects’ video from the Salesforce Developers YouTube channel and details why this big data functionality was built and how you can best use it. This summary is linked to this one on Async SOQL; which is the query language used for Big Objects.

Details: ‘BigObjects — Bring Big Data to

Presenter: James Ferguson

Time: 47 minutes

Key Terms: BigObjects, big data, Async SOQL


1.@1.00 — BigObjects is now GA (generally available) as per the Winter 18 release.

2.@1.30 — The agenda for this talk is below:

3.@1.30 — Customers are trying to capture hundreds of millions and billions of records. This is the ‘data challenge’ because more customers = more data = more time. This is a growing data challenge. In order to process and manage such large data sets, the right tools and needed.

4.@2.40 — Customers want to collect and store data for various reasons. The primary use cases for this is to enhance customer 360 (know more about their customers), to capture user activity and also to retain historical data (sometimes for regulatory purposes).

5.@3.30 — We will now discuss the following:

6.@3.50 — Under Setup | Create, you’ll notice a new tab ‘Big Objects’:

7.@4.30 — In this demo, a Big Object has been created to store ~ 160 million weather related records. As per the screenshot, you’ll notice that the Big Object looks very similar to a custom object. And that is the point: to create new functionality that looks very much like existing functionality that Salesforce Developers are aware of.

You can create custom relationships with other standard or custom objects too.

8.@7.00 — You can use SOQL to return a set of results from Big Objects too (although to process all of the records from a Big Object, you would use Async SOQL).

9.@9.00 — The metadata for Big Objects is very similar to that of standard and custom objects however, the major difference is that the metadata for Big Objects contains indexes.

10.@9.10 — A good analogy here is that indexes are like looking up a surname in a phone directory. If you know the first few characters of a surname, then you can find the surname quickly regardless of how many other surnames there are in the phone directory.

11.@10.00 — You can leverage indexes with filters in standard SOQL queries:

In this example, using filters, out of the ~ 160 million records, 9 records have been left that match the filter criteria:

12.@13.00 — You can also build Lightning components with data from Big Objects and display them on a page in the Lightning App Builder. In this way, you can have Big Object data being displayed directly on a page.

Looking at the metadata of the Big Object, we can see that it is very important which fields are indexed.

When you are building a query from a Big Object that displays data immediately, you can only filter on those indexes.

13.@14.50 — Let’s take a quick recap:

14.@15.30 — There is a huge amount of innovation that allows developers to build apps and solve problems that couldn’t have been possible 5 years ago. However, despite this, the technology that falls under ‘Big Data’ is very segmented. There isn’t just one ‘Big Data’ stack that you can get and install. Developers are having to be system integrators to take advantage of big data. The integration and operational costs of having to maintain multiple different stacks and APIs and technologies slows us down from doing what we want to do exactly with our big data.

15.@17.00 — So, with that in mind, Big Objects is an answer to that problem as Salesforce says ‘let us take care of the integration and operational aspects’.

16.@19.30 — Async SOQL works with Big Objects to process millions and billions of records. Please read this Salesforce Summary to learn more about Async SOQL.

17.@25.00 — You can also use Einstein Analytics (Wave) to pull data from Big Objects. This is powerful because it enables you to visualize the data quickly and effectively.

For example, you can visualize all the 160 million records in a heat map format:

18.@28.00 — Another use for Big Objects is to use its functionality as a means to store field history, audit trail information and historical records which you want to keep but don’t want to be detrimental to the Org speed performance.

19.@29.00 — A common use case is to use Big Objects to store (archive) millions (and sometimes billions) of tasks. You can leverage Batch Apex with Big Objects to copy the data from the tasks and create an archived record in a Big Object.

20.@30.40 — You would leverage Database.insertImmediate, because the archived records aren’t in the set of standard sObjects. You can also leverage Queueable with Batch Apex for Big Objects; which might be helpful for you.

21.@32.00 — Big Objects are built on a set of proven big data technologies, with Apache HBase at its core. Big Objects are not built on brand new technology in Salesforce. In actual fact, the big data technology stack on HBase has been running Salesforce GA (generally available) products for 2 years now. In particular, the field audit portion of Salesforce Shield uses the same technology stack. At Salesforce, we like to say that field audit was the first ‘big object’.

22.@34.00 — Finally, Salesforce have baked in best practices around big data into the way that Big Objects work. If your application has a thousand records today but 1 billion records tomorrow, your application will still work exactly as it should.

23.@42.00 — You can use Platform Events and the Bulk API with Big Objects too.

The Author

Andy Hitchings

Andy is a certified Salesforce and DocuSign admin and a certified Salesforce developer. His favorite technologies right now are the platform and Javascript.

Leave a Reply