Filtered Element Collectors

One of the most important enhancements made in the Revit 2011 API which is guaranteed to affect almost every single conceivable Revit add-in is the new filtered element collector mechanism, and we have already looked at numerous aspects and examples of using it in various ways.

It represents the one and only way to access elements in the Revit database, if you disregard a number of dedicated Document properties providing direct access to specialised collections of various types, which can easily be replaced by filtered element collectors as well.

In addition to all the results published here so far, I was astounded by the wealth of yet more exciting and useful information and benchmarking results presented by Kevin Vandecar, ex Principal Engineer of the Revit API Team, at the AEC DevCamp conference in June 2010 in his presentation on 'Revit Performance Tips and Tricks – Exploiting 2011 API Features'. Kevin has since rejoined the ADN DevTech team, where he specialises on the M & E APIs and is impressed by the powerful and fascinating programming possibilities they offer. Here is his session summary:

In this class we will cover techniques that can be used to efficiently find elements and information from a Revit model. This new framework was introduced in the 2011 release and includes much more flexibility than before. Along with the flexibility comes some complexity, so we will shed light on the best approaches for common element gathering tasks. We will discuss the various new filters including logical and inverted aspects. We will also show how the iteration framework can be used with the .NET Framework LINQ APIs. We will discuss Regeneration topics including "when" regeneration should be done. Finally there are a few auto-join performance tips that we will share.

Here are Kevin's complete session materials AecDevCamp2010-RevitPerformanceTipsTricks.zip, which include the presentation, the hand-out document, a C# .NET sample add-in, and a couple of Revit test models.

Unfortunately, Kevin will not be able to present this material himself at Autodesk University. It would be a great shame if it were not shown at all, so I have been asked to present it for him instead, which I am very much looking forward to doing. It will be part of my class CP234-2. Here are the complete descriptions of all three of my AU 2010 classes.

In the best spirit of Autodesk University, Kevin wrote a hand-out document covering the entire session content in some detail. His handout is presented in full below. In one of the upcoming posts, I would like to present some details of his benchmarking and example application together with an overview of the other existing samples and results published so far on the blog:

Performance Tips and Tricks for the Revit 2011 API

Introduction

This course covers tips and techniques to efficiently filter and iterate elements within the Revit model. The system is designed to ensure that you determine what you really need to find before iterating so that there is an implicit performance gain over previous techniques. By combining multiple criteria, the new framework works efficiently internally to find a subset of the model that can then be iterated to perform common tasks. The course also covers regeneration and auto-join aspects of the API.

If you are not already an Autodesk Developer Network ADN member, see this information about ADN.

We also post periodically SDK updates and other Revit API information to the Revit Developer Center.

Sample Code

The sample code that accompanies this course is meant mainly to illustrate techniques and performance aspects using different techniques. Because of this, there is minimal error checking and exception handling. If reusing any of the sample code in your production code, please remember to add appropriate error handling.

The sample code also uses the .NET framework timing utility class called Stopwatch. This is part of the Microsoft .NET System.Diagnostics namespace and is an easy-to-use facility for timing operations. The Stopwatch class was built by MS using low-level API calls, with less overhead than other .NET methods. If the hardware and Windows version of the computer support a high-resolution performance counter, it will use this counter instead of the standard PC clock.

Revit 2011 API Filtering Framework

The 2011 release introduces a new filtering and iteration framework. The internal implementation is done on native side so matching takes place before the managed wrapper is created. This ensures the greatest performance when finding elements with certain criteria. The built-in filters provide this criteria and then you can iterate or filter further using built-in iteration APIs, or with LINQ. In providing the built-in filters, it removes much of the per-element access that would marshal between native internals and the managed API.

This removes much of the per-element access that marshals between native internals and managed API.

Filtering Elements

The process of finding elements that match your search criteria is easier than ever with Revit 2011. The new filtering framework process can be broken down into three basic steps:

  1. Filtering is the base step – first decide what you want
  2. Choose how to access
  3. Access the found elements

Example Results

To demonstrate the enhanced performance of 2011 over 2010, some internal testing provided performance results for some of the filters as follows, listing the execution time of various filters in milliseconds in 2010 and 2011 and the performance gain determined by the relation between the two:

Example2010 ms2011 msGain
Filter for each API type48286187.8
All elements of ElementType751.4
Elements of Category25887893.2
Elements by Parameter filter49202.4
Elements of StructuralType86184.7
Elements of StructuralUsage210434.8
Elements of Structural MaterialTypes116167.2
Walls of Structural WallUsages70135.4
Enclosures and Tags112205.6
Curve-Driven Elements53317.7
Elements of Design Options51351.5
Elements on Levels74213.5

FilteredElementCollector

At the heart of the system is the FilteredElementCollector class. It provides the collection facility based on applied filters and then provides the interfaces to access the found elements. The documentation is provided here for convenience.

Developers can assign a variety of conditions to filter the elements which are returned. This class requires that at least one condition be set before making the attempt to access the elements.

Revit will attempt to organize the filters in order to minimize expansion of elements regardless of the order in which conditions and filters are applied.

There are three groups of methods which you can use on a given collector once you have applied filter(s) to it. One group provides collections of all passing elements, a second finds the first match of the given filter(s), and a third provides an iterator which is evaluated lazily (each element is tested by the filter only when the iterator reaches it). You should only use one of the methods from these groups at a time; the collector will reset if you call another method to extract elements. Thus, if you have previously obtained an iterator, it will be stopped and traverse no more elements if you call another method to extract elements.

In .NET, this class supports the IEnumerable interface for Elements. You can use this class with LINQ queries and operations to process lists of elements. Note that because the ElementFilters and the shortcut methods offered by this class process elements in native code before their managed wrappers are generated, better performance will be obtained by using as many native filters as possible on the collector before attempting to process the results using LINQ queries.

One special consideration when using this class in .NET: the debugger will attempt to traverse the members of the collector because of its implementation of IEnumerable. You may see strange results if you also attempt to extract the first element or all elements from the collector while the debugger is also looking at the contents of the collector.

Tip: Do NOT use the debugger to inspect the element list during IEnumerable access (i.e. through LINQ). As the statement above mentions, strange results can occur. In fact during the creation of this material, it was observed a crash in Revit simply due to the debugger trying to inspect the contents of the list.

Filter Types

Filters are applied to the collector to define the search criteria. The base class is ElementFilter, and three derived classes provide the different types of filters that can be applied.

Tip: It is better to couple slow filters with at least one ElementQuickFilter (e.g. class filter), to minimize the number of elements expanded in order to evaluate against the criteria set by this filter.

Efficiency Guidelines

There is no special need to apply quick filters before slow ones, since Revit internally optimizes the order automatically, pushing the quick filters first, regardless of the external order specified. Because the actual iteration takes place only once based on the filter criteria, this optimization can be performed internally.

It is however important to use as many quick filters as possible to reduce the number of elements before any slow filters are applied. Some internal reordering may take place, but this still ensures best performance.

When you apply slow filters, the quick ones will already have eliminated a large number of elements, so the slow filters will not have such a performance impact.

After using the built-in filtering techniques to maximum capacity, you may want to consider using LINQ or explicit coding to narrow the results down further. Be aware that slow filters will normally still be faster than explicit iteration over the results and application of LINQ or custom queries.

Tip: Use the shortcut methods on FilteredElementCollector. Because there are currently no shortcuts for slow filters, you can be sure when using a shortcut you are getting a quick filter. Examples:

Logical Filters

There are two logical filters representing Boolean AND and OR statements. Both can combine the results of two or more input filters:

Quick Filters

Here is a list of all quick filters with their passing criteria and shortcut methods, if available:

Slow Filters

Here is a list of all slow filters with their passing criteria. As said, none of these have any shortcut methods defined:

Variances of the Filters – Tips/Tricks

Remember the filters have many capabilities. The follow functionality may not be obvious, but can be very powerful:

ElementParameterFilter

The ElementParameterFilter is a unique filter designed for use with parameters. Benchmarking examples show that a parameter filter may be faster by a factor of two than post-processing results using explicit coding or LINQ to find a specific subset of elements. Although it is a slow filter, it is advised to consider depending on the search criteria.

Tip: Use Revit Lookup to find the API name for the desired parameter.

Using LINQ

The FilteredElementCollector implements the IEnumerable interface which allows it to work with other .NET framework facilities that can use it. For example, the LINQ APIs are very powerful at allowing you to further find specific data. LINQ is short for the .NET framework Language-INtegrated Query, set, and transform operations. For more information see this link.

LINQ can be used to further filter information using a query, or make use of its other features like XML or SQL.

Regeneration and Auto-Join

From the API Developer's Guide:

After new elements are created or elements are modified, regeneration and auto-joining of elements is required to propagate the changes throughout the model. Without a regeneration (and auto-join, when relevant), the Geometry property and the AnalyticalModel for Elements are either unobtainable (in the case of creating a new element) or they may be invalid. It is important to understand how and when regeneration occurs before accessing the Geometry or AnalyticalModel of an Element.

Although regeneration and auto-join are necessary to propagate changes made in the model, it can be time consuming. It is best if these events occur only as often as necessary. When using RegenerationOption.Automatic, regeneration and auto-joining happen automatically after each and every API call that modifies the model. In this case, the Geometry and AnalyticalModel are assigned immediately and no further action is required prior to accessing them. However, due to the slow performance of this option, it is obsolete and will be removed in a future version of Revit.

In RegenerationOption.Manual, regeneration and auto-joining occur automatically when a transaction that modifies the model is committed successfully, or whenever the Document.Regenerate or Document.AutoJoinElements methods are called. Regenerate and AutoJoinElements may only be called inside an open transaction. It should be noted that the Regeneration method can fail, in which case the RegenerationFailedException will be thrown. If this happens, the changes to the document need to be rolled back by rolling back the current transaction or subtransaction.

Regeneration

Regeneration access has been improved in 2011, but it means you need to be more aware of its consequences. In prior releases, regeneration was something that you had to control in, sort-of a 'reverse way'. For example there was a suspend mode, or a special batch create mode that were meant to help performance. With the 2011 release, you should now apply the manual attribute to your Revit API entry point functions and call Document.Regenerate when it is needed. The automatic mode will be removed in the future because it is very inefficient. From the 2011 documentation:

Manual Regeneration Option

For 2011 and the future you should get into the habit of using the manual option and Document.Regenerate. Consider your performance options as you move over to it. The main point is that you should use regenerate only when needed. This will improve the performance of modification routines and allow you to group or queue modifications until necessary to regenerate. Remember that manual mode will never be slower than the old automatic option. The only problem is the risk of missing a regenerate call and retrieving stale data. Regenerating more often than necessary should not always be considered a problem and does not cause too much overhead. The regeneration algorithm is optimized to minimize the regeneration of only those items that need it. However, remember that it is another operation, and it is advisable to experiment with your particular algorithm and typical data sets to understand the best performance. The performance of regeneration will depend on the magnitude of change and the complexity of the model – there is no simple and generic answer about how long regeneration will take.

Tips and Tricks

Auto-Join

As mentioned above, auto-join is something that needs to be done when creating or modifying elements to ensure the geometry aspects are updated appropriately before accessing those elements. The trick is to know when to call it. Generally, you must call it when elements are related by geometry to other elements. For example, when two walls are added and a corner is formed, those elements are related at the corner and need auto-join.

Tip: With RegenerationOption.Automatic, elements that are meant to be auto-joined should automatically be updated. There was a problem with this in the very first version of 2011 that was fixed in the first user release update.

Next Instalment

As said, one of the next topics will be an overview of Kevin's sample code and collector benchmarks, followed by a global overview of all the filtering samples we have studied so far.