Funnelback Features
Our feature rich, high powered, customizable, search engine allows organizations to find accurate information quickly and easily. Funnelback is used by organisations in Australia, North America and the UK. We have an extensive blue chip customer base including Commonwealth and State government agencies, over one third of all Universities in Australia, the Australian Broadcasting Corporation, Westpac Banking Corporation, the Australian Stock Exchange, CareerOne and ninemsn.
Deployment
- Hosted software
- A fully outsourced web site search service. Hardware, software upgrades, network traffic, all managed by Funnelback.
- Licensed software
- Software installed on your own server with a bundled web server for easy setup.
- Operating systems supported
- Windows Server, Linux (Red Hat), Solaris. Funnelback can run on a variety of platforms allowing you to leverage off of your existing systems.
Capability
- Combined full-text & metadata search
- Allows search over unstructured data with a sophisticated ranking algorithm returning high quality results. Use metadata to up weight results or deliver faceted and geospatial searching.
- Scalability
- Search many millions of documents with fast query response times.
- Federated Enterprise search
- Search many repositories enterprise-wide with a single query via a simple web interface. Separate indexes for each repository are created and users are able to choose which repositories they search over.
- Integration
- Integrates with Content Management Systems and Portals.
- Geospatial search
- Search results with spatial data presented on a map.
- Multimedia search
- Search audio and video metadata
- RSS or subscription search
- Allows users to keep up to date.
Content Repositories
- Intranet & internet
- Crawls HTTP and HTTPS sites with support for cookies.
- Different repository types
- Also indexes Oracle, MySQL, MS SQL Server, Lotus Notes, Domino, TRIM and SharePoint. Support for additional document sources can be developed as required.
- File shares / Shared drives
- Windows, NFS, Novell.
- Multiple file types supported
- Including HTML, XML, Text, PDF, Word, Power Point, Excel and RTF. Ability to plug in custom filters.

- Multiple western languages supported
- ISO-8859-I languages.
Search Features
- Search Key word searching
- Available across document content and metadata.
- Features Refine queries
- Modify the existing query by adding or removing words, search on the current result set only or use the word stemming operator (include plurals and other word forms) on the current query.
- Spelling suggestions
- Provides alternatives from the Funnelback dictionary.
- Thesaurus suggestions
- Provides a list of synonyms.
- Featured pages
- Specified pages are promoted when triggered by specific keywords.
- Ranking
- Documents are ranked by their relevance score which takes into account factors such as the relative rarity of the query terms, the frequency with which they occur in the document, the length of the document, URL length and anchor text.
- Customise ranking
- Boost the rankings of important sites or pages. Up-weight or down-weight metadata, search a query term in a URL or use the same site suppression feature to create a valuable diversity of results from different content.
Query Language
- Fielded search
- Search metadata fields sourced from the indexed documents. DC (Dublin Core), AGLS (Australian Government Locator Service) and Netscape metadata are supported by default or define your own which correspond to a URL prefix.
- Scoped search
- A customised search enabling results to be restricted based on a set of inclusion and exclusion patterns.
- Boolean search
- Boolean operators provide the user with a simple means to combine words, phrases and sub queries with Boolean operators, to either widen or narrow a result set.
- Phrase searching
- Use the phrase operator which requires component words to appear consecutively and in the order specified.
- Date query
- Allows user to constrain the results to documents which were last modified within a given date range.
- Word stemming
- Stemming (grammatical variations) allow the users to search for common variants of the specified word(s).
- Advanced querying
- A wildcard search uses truncation operators (‘*’) to provide results containing variations of the search terms. Proximity search shows results where query words appear near each other.
Security / Access
- Collection level security
- Security controls that restrict access to sensitive information in repositories and business applications to authorised users.
- TRIM, file share, intranet and document level security
- Funnelback supports NTLM authentications for searching TRIM and NTFS based shared drives.
- Access restriction
- Enables access to individual collections to be restricted on an IP address/hostname basis.
- Collection redirection
- Search users who are unauthorized to access a given collection can be redirected to a ‘public’ collection, which will contain publicly accessible content.
- Accessibility
- Funnelback conforms to AA Web content accessibility guidelines.
Administration Features
- Intuitive administration web interface
- Create, configure and monitor the status of managed collections.
- Crawler and indexer configuration
- Provides the ability to create new collections, specify spider rules and start/stop the crawling process, all from a single administrative web interface.
- HTTPS (for web spidering)
- Secure sites can be crawled by providing user credentials required by the web servers.
- Offline indexing
- An offline view is maintained for indexing purposes. This insures that search operations aren’t affected whilst the collection is being updated. Once it is updated it’s switched to the live view.
- Full/Incremental index update
- Updates to the collections can be triggered through the administrative web interface including; Full update (a complete crawl from scratch); Restart crawl from the latest checkpoint; Filter: extracts textual information from specific file formats; Index: indexes the existing offline crawl data and then swaps the live and offline views; and Swap views: swaps the live and offline views.
- Scheduled updates of indexes
- Provides the ability to schedule the updating of collections on an hourly, daily, weekly and monthly basis.
- Featured pages
- Hard coded or ‘best-bets’ results. The search administrator specifies a set of URLs to appear in the results page whenever specific keywords are entered.
- Query tuning / Expansion
- Provides the ability to expand query terms or create synonyms to match up with an organisation’s terminology.
- Click data
- Provides a mechanism for search users to influence the result relevance over a period of time. By simply monitoring the search patterns of users, and the documents which they go on to access, Funnelback can give a higher relevance ranking to these documents for the particular query.
- Inclusion and exclusion patterns
- Restrictions to the crawl are governed by a set of inclusion and exclusion patterns.
- Meta collections
- A seamless mechanism for searching over multiple collections. Users can transparently search up to 16 primary collections with a single query.
- External metadata indexing
- Assign metadata to distributed files using a central configuration. Assign additional metadata to other pages.
- Index freshness
- Crawls can be set to update only parts of the index which have changed since the last crawl and create a balance between freshness and infrastructure load.
- Design results pages
- Search results can be customised to match your required look and feel.
Advanced Administration
- Templates
- Use templates as a starting point to customize the search interfaces or developers can tailor the pages to suit individual organizations.
- Command line interface
- Administrators are provided with an extensive set of command line tools for advanced functionality, as an alternative to Funnelback’s web based interface.
- Execute external program and custom scripts
- via Funnelback’s search & update workflow process. This provides an excellent opportunity to customize the indexing process for those organizations whose business rules won’t work with the out of the box Funnelback architecture. Funnelback easily integrates with other systems.
Results & Reporting
- Results as HTML, XML or RSS.
- Tiers
- Results are grouped into tiers based on how many search constraints the results have satisfied. Tiers are sorted in descending order with regard to the number of constraints satisfied (e.g. fully matching, matching 1/ 2, etc.).
- Relevant summaries
- Results summaries relate to the current query with optional query highlighting. Fully customisable.
- Metadata summaries
- Summaries can include metadata fields. Extensive log & query reports - Monitor search behaviour and usage over a short or extended period of time. These include: monthly summary of queries, most commonly searched for keywords, top scopes
used in queries, query terms which return zero results, queries per hour and documents per site. Date based reporting is also available.
- Data reports
- Provide statistics and reports on data gathered during the update process.
- Alternate sort specification
- Results are displayed in descending order of relevance, determined by Funnelback’s superior ranking algorithm. Alternatively results can be sorted by date, title, size, URL as well as any other specified metadata class, in ascending or descending order.
- Document level highlighting
- Provides a cached view of the selected result with search term highlighting.
- Document view
- Indexed documents are cached as part of the indexing process and are stored as plain text documents so binary formats such as PDF, Word etc. can be viewed as html through a standard browser. Search terms can also be highlighted with this document cached view.
- Fluster Clustering Engine
- Provides users with a set of related sub headings (clusters) related to their
search query. The clusters allow users to determine query refinement suggestions derived from the
content.
Customer Satisfaction
- Accessibility
- Access all content via a single web interface.
- Speed / Highly efficient
- Fast indexing and query processing.
- Customisable
- Every step of the Funnelback workflow can be expanded and customised.
- Integration
- Seamlessly integrates into your business interface.
- Leading edge technology
- The technology was developed by CSIRO, Australia’s premier scientific and industrial research organisation. Heavy investment in research and development continues.
- Tailored to your needs
- Funnelback resolves complex search problems which are not solved by a one size fits all system, no matter how large or small your organisation.
- Quality customer service and support
- A dedicated support team with access to research and development resources on site. 24/7 support packages available.


