Cisco Launched Cisco Time Collection Mannequin: Their First Open-Weights Basis Mannequin primarily based on Decoder-only Transformer Structure

Cisco and Splunk have launched the Cisco Time Collection Mannequin, a univariate zero shot time sequence basis mannequin designed for observability and safety metrics. It’s launched as an open weight checkpoint on Hugging Face underneath an Apache 2.0 license, and it targets forecasting workloads with out process particular high-quality tuning. The mannequin extends TimesFM 2.0 with an express multiresolution structure that fuses coarse and high-quality historical past in a single context window.

Why observability wants multiresolution context?

Manufacturing metrics will not be easy single scale alerts. Weekly patterns, long run progress and saturation are seen solely at coarse resolutions. Saturation occasions, site visitors spikes and incident dynamics present up at 1 minute or 5 minute decision. The frequent time sequence basis fashions work at a single decision with context home windows between 512 and 4096 factors, whereas TimesFM 2.5 extends this to 16384 factors. For 1 minute information this nonetheless covers at most a few weeks and infrequently much less.

This can be a downside in observability the place information platforms usually retain solely outdated information in aggregated type. Effective grained samples expire and survive solely as 1 hour rollups. Cisco Time Collection Mannequin is constructed for this storage sample. It treats coarse historical past as a firstclass enter that improves forecasts on the high-quality decision. The structure operates instantly on a multiresolution context as an alternative of pretending that each one inputs reside on a single grid.

Multiresolution enter and forecasting goal

Formally, the mannequin consumes a pair of contexts, (x_c, x_f). The coarse context (x_c) and the high-quality context (x_f) every have size as much as 512. The spacing of (x_c) is fastened at 60 occasions the spacing of (x_f). A typical observability setup makes use of 512 hours of 1 hour aggregates and 512 minutes of 1 minute values. Each sequence terminate on the identical forecast reduce level. The mannequin predicts a horizon of 128 factors on the high-quality decision, with a imply and a set of quantiles from 0.1 to 0.9.

Structure, TimesFM core with decision embeddings

Internally, Cisco Time Collection Mannequin reuses the TimesFM patch primarily based decoder stack. The inputs are normalized, patched into non overlapping chunks, and handed by way of a residual embedding block. The transformer core consists of fifty decoder solely layers. A closing residual block maps tokens again to the horizon. The analysis crew take away positional embeddings and as an alternative depend on patch ordering, the multiresolution construction and a brand new decision embedding to encode construction.

Two additions make the structure multiresolution conscious. A particular token, usually referred to as ST within the report, is inserted between the coarse and high-quality token streams. It lives in sequence area and marks the boundary between resolutions. Decision embeddings, usually referred to as RE, are added in mannequin area. One embedding vector is used for all coarse tokens and one other for all high-quality tokens. Ablation research within the paper present that each parts enhance high quality, particularly in lengthy context eventualities.

The decode process can also be multiresolution. The mannequin outputs imply and quantile forecasts for the high-quality decision horizon. Throughout lengthy horizon decoding, newly predicted high-quality factors are appended to the high-quality context. Aggregates of those predictions replace the coarse context. This creates an autoregressive loop through which each resolutions evolve collectively throughout forecasting.

Coaching information and recipe

Cisco Time Collection Mannequin is skilled by continued pretraining on prime of TimesFM weights. The ultimate mannequin has 500 million parameters. Coaching makes use of AdamW for biases, norms and embeddings, and Muon for the hidden layers, with cosine studying charge schedules. The loss combines imply squared error on the imply forecast with quantile loss over the quantiles from 0.1 to 0.9. The crew trains for 20 epochs and picks one of the best checkpoint by validation loss.

The dataset is massive and skewed towards observability. The Splunk crew stories about 400 million metrics time sequence from their very own Splunk Observability Cloud deployments, collected at 1 minute decision over 13 months and partly aggregated to five minute decision. The analysis crew states that the ultimate corpus accommodates greater than 300 billion distinctive information factors, with about 35 % 1 minute observability, 16.5 % 5 minute observability, 29.5 % GIFT Eval pretraining information, 4.5 % Chronos datasets and 14.5 % artificial KernelSynth sequence.

Benchmark outcomes on observability and GIFT Eval

The analysis crew consider the mannequin on two primary benchmarks. The primary is an observability dataset derived from Splunk metrics at 1 minute and 5 minute decision. The second is a filtered model of GIFT Eval, the place datasets that leak TimesFM 2.0 coaching information are eliminated.

On observability information at 1 minute decision with 512 high-quality steps, Cisco Time Collection Mannequin utilizing a 512 multiresolution context reduces imply absolute error from 0.6265 for TimesFM 2.5 and 0.6315 for TimesFM 2.0 to 0.4788, with comparable enhancements in imply absolute scaled error and steady ranked chance rating. Related positive factors seem at 5 minute decision. Throughout each resolutions, the mannequin outperforms Chronos 2, Chronos Bolt, Toto and AutoARIMA baselines underneath the normalized metrics used within the paper.

On the filtered GIFT Eval benchmark, Cisco Time Collection Mannequin matches the bottom TimesFM 2.0 mannequin and performs competitively with TimesFM-2.5, Chronos-2 and Toto. The important thing declare is just not common dominance however preservation of normal forecasting high quality whereas including a robust benefit on lengthy context home windows and observability workloads.

Key Takeaways

Cisco Time Collection Mannequin is a univariate zero shot time sequence basis mannequin that extends the TimesFM 2.0 decoder solely spine with a multiresolution structure for observability and safety metrics.
The mannequin consumes a multiresolution context, with a rough sequence and a high-quality sequence, every as much as 512 steps lengthy, the place the coarse decision is 60 occasions the high-quality decision, and it predicts 128 high-quality decision steps with imply and quantile outputs.
Cisco Time Collection Mannequin is skilled on greater than 300B information factors, with greater than half from observability, mixing Splunk machine information, GIFT Eval, Chronos datasets and artificial KernelSynth sequence, and it has about 0.5B parameters.
On observability benchmarks at 1 minute and 5 minute resolutions, the mannequin achieves decrease error than TimesFM 2.0’s, Chronos and different baselines, whereas retaining aggressive efficiency on the overall objective GIFT Eval benchmark.

Try the Paper, Weblog and Mannequin Card on HF. Be happy to take a look at our GitHub Web page for Tutorials, Codes and Notebooks. Additionally, be happy to observe us on Twitter and don’t overlook to affix our 100k+ ML SubReddit and Subscribe to our Publication. Wait! are you on telegram? now you may be part of us on telegram as nicely.

Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is dedicated to harnessing the potential of Synthetic Intelligence for social good. His most up-to-date endeavor is the launch of an Synthetic Intelligence Media Platform, Marktechpost, which stands out for its in-depth protection of machine studying and deep studying information that’s each technically sound and simply comprehensible by a large viewers. The platform boasts of over 2 million month-to-month views, illustrating its recognition amongst audiences.

🙌 Comply with MARKTECHPOST: Add us as a most well-liked supply on Google.

Sample Page Title

Why observability wants multiresolution context?

Multiresolution enter and forecasting goal

Structure, TimesFM core with decision embeddings

Coaching information and recipe

Benchmark outcomes on observability and GIFT Eval

Key Takeaways

Related Articles

XRP Prepared For Subsequent Bull Run? Here is How This Analyst Arrived At $13 Goal

Market Session Meter Indicator – Buying and selling Methods – 26 April 2026

Subsequent Candle Predictor Indicator MT4

LEAVE A REPLY Cancel reply

Latest Articles

XRP Prepared For Subsequent Bull Run? Here is How This Analyst Arrived At $13 Goal

Market Session Meter Indicator – Buying and selling Methods – 26 April 2026

Subsequent Candle Predictor Indicator MT4

Caregiving burnout: What to know

12 Key Issues Christians Ought to Suppose About Earlier than Selecting Cremation

EDITOR PICKS

XRP Prepared For Subsequent Bull Run? Here is How This Analyst...

Market Session Meter Indicator – Buying and selling Methods – 26...

Subsequent Candle Predictor Indicator MT4

POPULAR POSTS

Qubic’s Mining Pool Attacking Monero Falls Beneath Assault

Feedback on the brand new buying and selling dialog in Metatrader...

What’s nano-texture glass and do I would like it?

POPULAR CATEGORY