What is a customer data platform
This guide explains what a customer data platform (CDP) is and how IT managers and venue operations directors can deploy one to convert guest WiFi infrastructure into a first-party data asset. It covers technical architecture, GDPR-compliant data capture, identity resolution, audience segmentation, and activation strategies, with measurable ROI benchmarks from hospitality and retail deployments including Harrods' 57x return on investment.
Listen to this guide
View podcast transcript
- Executive summary
- Technical deep-dive
- What a CDP actually does
- The data capture layer: guest WiFi as the ingestion point
- Identity resolution in practice
- CDP vs CRM vs DMP: choosing the right tool
- The Purple architecture: cloud overlay on existing hardware
- Implementation guide
- Best practices
- Troubleshooting & risk mitigation
- ROI & business impact

Executive summary
Your guest WiFi network already generates two types of data: anonymous presence data from device probe requests, and identified engagement data from captive portal authentications. A customer data platform (CDP) sits at the intersection of these two streams, resolving identities, building unified customer profiles, and activating those profiles across email, SMS, and paid media channels.
For IT managers and venue operations directors, this means the network infrastructure you have already deployed can become the primary engine for first-party data capture. Purple Engage captures verified guest email and phone data at login and automates marketing campaigns, integrating with hardware from Cisco Meraki, HPE Aruba, Ruckus, Juniper Mist, Ubiquiti UniFi, Cambium, Extreme, and Fortinet without requiring a hardware replacement. Harrods achieved a 57x return on investment by marketing to customers acquired through their guest WiFi network (Purple, 2026). This guide provides the technical architecture, implementation steps, and compliance framework to replicate that outcome at your venue.
Technical deep-dive
What a CDP actually does
A customer data platform (CDP) is a centralised system that ingests data from multiple sources, resolves identities, and builds persistent unified customer profiles that marketing and operations teams can activate. The CDP Institute defines a CDP as "packaged software that creates a persistent, unified customer database that is accessible to other systems" (CDP Institute, 2024). In a physical venue, the primary ingestion point is the Guest WiFi network.
The architecture has five functional layers:
| Layer | Function | Venue context |
|---|---|---|
| Data ingestion | Pulls data from WiFi, CRM, POS, mobile app | Captive portal authentication events |
| Identity resolution | Links MAC address, email, CRM ID to one profile | Returning guest recognised across visits |
| Profile unification | Merges all attributes into a single record | Visit history, dwell time, spend data |
| Audience segmentation | Builds cohorts from profile attributes | "Visited 3x in 30 days, dwell > 20 min" |
| Activation | Syncs segments to email, SMS, ad platforms | Automated re-engagement campaign |

The data capture layer: guest WiFi as the ingestion point
When a device enters your venue with its WiFi radio active, it broadcasts probe requests - the device asking the network whether a known access point is nearby. Every access point in range picks up the probe request, recording the device MAC address and signal strength. This is the foundation of presence analytics: anonymous footfall counting and dwell time calculation.
The complication is MAC randomisation. Since iOS 14 and Android 10, mobile devices rotate through temporary MAC addresses for probe requests. Platforms that do not correct for this overstate visitor counts significantly. Purple applies statistical correction models calibrated against camera ground truth, maintaining accuracy within 3% to 7% (Purple, 2026).
Engagement data begins when the user connects through the captive portal. The captive portal is the authentication gateway and the primary mechanism for capturing first-party data. The user provides a verified email address or phone number. The CDP links this identifier to the device session and, via identity resolution, to any existing CRM record. For a deeper look at how this works across multiple network segments, see Three SSIDs to rule them all: guest, Passpoint, and IoT WiFi .
Identity resolution in practice
Identity resolution is the process of linking multiple identifiers - MAC address, email, CRM ID, loyalty number - to a single individual. When a shopper authenticates via WiFi at your Manchester store, then purchases through your mobile app two days later, the CDP recognises both interactions as the same person. It merges the records and updates the unified profile.
This matters because fragmented data is the default state in most venues. The POS system holds transaction history. The CRM holds contact records. The WiFi network holds visit frequency and dwell time. Without identity resolution, these datasets cannot be joined. With it, you build a profile that reflects the full relationship: how often the person visits, how long they stay, what they buy, and which marketing messages they respond to.
CDP vs CRM vs DMP: choosing the right tool
A common source of confusion is the distinction between a CDP, a customer relationship management system (CRM), and a data management platform (DMP). These tools serve different purposes and operate on different data types.

A CRM manages known contacts and transactional relationships. It is built for sales and support teams. A DMP aggregates anonymous, cookie-based data for advertising targeting. It operates on short retention windows and is increasingly constrained by cookie deprecation. A CDP unifies first-party, consent-based data into persistent profiles. It is built for marketing and IT teams working together, and it is the only tool in this group that meets the GDPR standard for consent-based marketing activation.
The practical implication: if you already have a CRM, a CDP does not replace it. The CDP ingests data from the CRM, enriches it with behavioural data from WiFi and other channels, and syncs the enriched profiles back. The two systems work together.
The Purple architecture: cloud overlay on existing hardware
Purple operates as a cloud overlay. You do not replace your access points. You configure your existing Cisco Meraki, HPE Aruba, Ruckus, Juniper Mist, Ubiquiti UniFi, Cambium, Extreme, or Fortinet hardware to direct authentication traffic to the Purple cloud. The platform delivers the captive portal, captures the data, and manages the API integrations with your downstream systems.
The data flows out via standard REST APIs or webhooks into your CRM, POS system, or business intelligence platform. Purple has processed 440 million logins in 2024 across 80,000+ live venues, and operates at 99.999% uptime (Purple, 2026). The platform is ISO 27001 certified, GDPR compliant, and holds Cyber Essentials and B Corp certification.
Implementation guide
Deploying a CDP in a physical venue requires coordination between IT, marketing, and operations teams. The following steps apply regardless of which hardware vendor you use.
Step 1: Audit your current infrastructure. Map every access point in your venue against the supported hardware list. Confirm firmware versions are current. Identify which SSID will carry guest traffic and which will carry staff traffic. These should be separate - see Guest WiFi for SSID segmentation guidance.
Step 2: Design the captive portal flow. The captive portal is your primary data ingestion point. Keep the login flow to three steps or fewer. Ask for the minimum viable data - usually email or phone number. Use Purple Verify to validate email addresses at the point of capture. A validated email list reduces bounce rates and improves deliverability for downstream campaigns.
Step 3: Configure consent management. GDPR requires explicit, granular consent for marketing communications. Your captive portal must present a clear opt-in mechanism that is separate from the terms of service acceptance. The CDP stores the consent state against each profile and enforces it across all activation channels. Do not conflate network access consent with marketing consent - these are two separate legal bases.
Step 4: Integrate downstream systems. Data sitting in a dashboard does not generate value. Configure the API integrations to push data into your CRM and marketing automation platforms. Purple supports standard REST API and webhook integrations. Prioritise the CRM integration first - this is where the identity resolution loop closes. For SMS marketing automation, see How to leverage tools for SMS marketing to increase return visits .
Step 5: Optimise access point placement for analytics. If you want accurate location analytics to enrich CDP profiles, design your network for density, not just coverage. Place access points on zone boundaries to enable signal triangulation. The rule of thumb is one access point per 150 to 200 square metres in open-plan environments. In retail environments with high fixture density, reduce this to one per 100 square metres.
Step 6: Define your activation segments. Before launch, agree with the marketing team on the first three audience segments you will activate. Common starting points are: first-time visitors (trigger a welcome offer), lapsed visitors (trigger a re-engagement campaign), and high-frequency visitors (trigger a loyalty programme invitation). These segments should be defined in the CDP and synced to the marketing automation platform before the network goes live.
Best practices
Use first-party data as your primary acquisition channel. First-party data captured via WiFi drives 2.9x higher marketing ROI than third-party data (Purple, 2026) and is not subject to cookie deprecation. Optimise your captive portal conversion rate before investing in additional data sources.
Automate audience suppression. Excluding known contacts from paid acquisition campaigns is the highest-ROI CDP use case. Sync your CDP audience to Google Ads and Meta Ads as a suppression list. This eliminates 10% to 20% of wasted ad spend from week one.
Validate data at the point of capture. A contact list with 30% invalid email addresses is not an asset - it is a liability. Use Purple Verify to validate email addresses at login. This maintains deliverability rates above 95% and protects your sender reputation.
Segment by behaviour, not demographics. Demographic segments (age, gender) are less predictive than behavioural segments (visit frequency, dwell time, zone affinity). Build your initial segments around behavioural signals. A visitor who has been to your venue four times in the last 30 days is a better loyalty programme candidate than one who matches a demographic profile.
Run a Data Protection Impact Assessment (DPIA) before deploying presence analytics. Presence analytics - tracking anonymous devices across your venue - operates under the legitimate interest legal basis under GDPR. A DPIA is required to document this basis. Purple's compliance team can provide a template. For healthcare and transport venues, additional sector-specific requirements apply.
Troubleshooting & risk mitigation
Low captive portal conversion rate. If fewer than 40% of connected devices complete the portal login, the flow is too complex. Reduce the number of form fields. Test the portal on both iOS and Android devices - rendering differences are a common cause of abandonment. Offer a clear value exchange: free WiFi in exchange for an email address.
Duplicate profiles in the CDP. Duplicate profiles indicate that identity resolution is not working correctly. The most common cause is inconsistent email formatting (e.g., " John.Smith@example.com " vs " john.smith@example.com "). Ensure the CDP normalises email addresses to lowercase before matching. Purple's identity resolution engine applies this normalisation by default.
MAC randomisation inflating visitor counts. If your presence analytics show visitor counts significantly higher than your actual footfall, MAC randomisation is likely the cause. Purple's statistical correction model addresses this, but you should validate against a ground truth source (e.g., door counter or CCTV analytics) during the first 30 days of deployment.
GDPR consent not propagating to activation channels. If a contact has opted out of marketing but is still receiving campaigns, the consent state is not being respected by the activation channel. Audit the API integration between the CDP and your marketing automation platform. Ensure the consent field is included in every sync and that the marketing platform honours opt-out flags.
Data not flowing to CRM. Check the webhook endpoint URL and authentication credentials. Confirm that the CRM API rate limits are not being exceeded. Purple's integration logs provide a per-event audit trail that identifies the point of failure.
ROI & business impact
The business case for a CDP rests on three measurable outcomes: increased marketing ROI, reduced acquisition cost, and improved operational efficiency.
Increased marketing ROI. Harrods achieved a 57x return on investment by marketing to customers acquired through their guest WiFi network (Purple, 2026). This figure reflects the value of a clean, consented, first-party data asset combined with a marketing automation workflow that acts on it. A more conservative baseline for a well-configured deployment is a two to five times return in the first year.
Reduced acquisition cost. Audience suppression - excluding known contacts from paid acquisition campaigns - eliminates wasted spend on customers you already have. A venue with 50,000 known contacts suppressing these from a paid campaign with a 15% overlap saves the cost of 7,500 impressions per campaign cycle.
Improved operational efficiency. Location analytics derived from WiFi data can optimise staffing levels, reduce queue times, and improve the overall visitor experience. For hospitality venues, this translates directly to guest satisfaction scores and repeat booking rates.
Measuring success. Define your KPIs before deployment. The three metrics that matter most are: Captive Portal conversion rate (target: 40% or above), email campaign open rate for WiFi-acquired contacts (target: 25% or above), and repeat visit rate for the first-time visitor cohort (target: 30% within 90 days).
For more on how Purple's WiFi Analytics platform delivers these outcomes, and to explore deployment options for your specific vertical, visit the Guest WiFi product page or speak to our team.
Key Definitions
Customer data platform (CDP)
A centralised system that ingests data from multiple sources, resolves identities, and builds persistent unified customer profiles accessible to marketing and operations teams for activation.
When IT teams need to unify fragmented data from WiFi, CRM, POS, and mobile app into a single actionable record. The CDP Institute defines it as "packaged software that creates a persistent, unified customer database that is accessible to other systems" (CDP Institute, 2024).
Identity resolution
The process of linking multiple identifiers - MAC address, email address, CRM ID, loyalty number - to a single individual across touchpoints and time.
When a customer interacts with a brand across multiple channels (WiFi, mobile app, POS) and the data needs to be merged into one profile. Without identity resolution, cross-channel analytics and personalisation are not possible.
Captive Portal
The authentication gateway that users interact with when connecting to a Guest WiFi network. It is the primary mechanism for capturing first-party data and obtaining marketing consent.
When venues need to capture email addresses and phone numbers at the point of network access. The portal design directly determines the conversion rate and data quality of the CDP.
MAC randomisation
A privacy feature in mobile devices (introduced in iOS 14 and Android 10) that rotates the MAC address used for probe requests, preventing persistent device tracking by access points.
When calculating anonymous footfall and dwell time, MAC randomisation inflates visitor counts if not corrected. Purple applies statistical correction models calibrated against camera ground truth to maintain accuracy within 3% to 7%.
Audience segmentation
The process of dividing a customer base into distinct cohorts based on shared behavioural characteristics, such as visit frequency, dwell time, zone affinity, or purchase history.
When marketing teams need to deliver targeted campaigns to specific groups. Behavioural segments derived from WiFi data are more predictive than demographic segments for re-engagement and loyalty activation.
First-party data
Information collected directly from customers with their explicit consent, via owned channels such as captive portals, mobile apps, and email sign-ups.
The primary data type for a CDP. First-party data is not subject to cookie deprecation and meets GDPR consent requirements. Purple's Engage plan captures verified first-party data at the point of WiFi authentication.
Cloud overlay
A software architecture that integrates with existing hardware infrastructure without requiring replacement. The overlay handles authentication, data capture, and API integrations as a managed cloud service.
Purple's deployment model. You configure your existing Cisco Meraki, HPE Aruba, Ruckus, Juniper Mist, Ubiquiti UniFi, Cambium, Extreme, or Fortinet access points to direct authentication traffic to the Purple cloud.
Audience suppression
The practice of uploading a known customer list to an ad platform (e.g., Google Ads, Meta Ads) to exclude those contacts from paid acquisition campaigns, preventing spend on customers you already have.
Typically the highest-ROI CDP use case in the first weeks of deployment. A venue with 50,000 known contacts and a 15% ad audience overlap saves the cost of 7,500 impressions per campaign cycle.
Presence analytics
The analysis of anonymous device probe requests to calculate footfall, dwell time, and zone occupancy without requiring user authentication.
The baseline analytics capability of a WiFi network. Presence analytics operates under the legitimate interest legal basis under GDPR. A Data Protection Impact Assessment (DPIA) is required before deployment.
Conscious-choice opt-in
An explicit, active consent mechanism where the user makes a deliberate choice to opt in to marketing communications, as distinct from a pre-ticked box or implied consent.
Required by GDPR for marketing communications. Purple's captive portal implements conscious-choice opt-ins as the default consent mechanism, storing the consent state against each profile.
Worked Examples
A 300-room Premier Inn property wants to reduce OTA commission dependency and increase direct bookings. They have an existing HPE Aruba network but no unified view of their guests across visits.
Deploy Purple Engage as a cloud overlay on the HPE Aruba access points. Configure the captive portal to capture guest email addresses with explicit marketing consent. Integrate the CDP with the property management system (PMS) via REST API to link WiFi session data to reservation records. Set up an automated email campaign to trigger three days after checkout, offering a 10% discount on the guest's next direct booking. Create a second segment for guests who have stayed more than twice in the past six months and sync this to the loyalty programme invitation flow. The entire deployment runs on existing hardware - no access point replacement required.
A retail chain with 50 locations wants to understand cross-store shopping behaviour and suppress known customers from paid acquisition campaigns to reduce wasted ad spend.
Implement Purple across all 50 locations using the existing Cisco Meraki access points. Configure the captive portal with a single-field email capture form and Purple Verify validation. Use identity resolution to link MAC addresses to email addresses across all locations, building a unified profile that records which stores each shopper has visited and their dwell time per zone. Create an audience segment for all customers who have authenticated at any location in the past 90 days. Sync this segment to Google Ads and Meta Ads as a suppression list via the Purple API. Separately, create a high-frequency segment (visited 3 or more locations in 30 days) and activate this with a personalised cross-store offer via email. Measure the reduction in cost-per-acquisition against the pre-suppression baseline.
Practice Questions
Q1. A stadium operations director wants to use WiFi analytics to manage crowd flow and reduce queue times at concession stands. The marketing team simultaneously wants to capture fan email addresses for post-event campaigns. The network currently runs on Ruckus access points with no captive portal. How do you design the deployment to satisfy both requirements without compromising either?
Hint: Consider the difference between presence analytics (anonymous) and engagement data (authenticated), and the role of access point density in enabling zone-level analytics.
View model answer
Deploy Purple as a cloud overlay on the existing Ruckus access points. Configure the network for high access point density - one per 100 to 150 square metres in the concourse areas - to enable accurate zone-level dwell time analytics for the operations team. The presence analytics layer operates on anonymous probe requests and does not require fan authentication, so it is available immediately. For the marketing team, configure a captive portal on the Guest WiFi SSID with a single-field email capture form and explicit marketing consent opt-in. The two data streams - anonymous presence and identified engagement - are processed in parallel by the CDP. The operations team gets real-time crowd density dashboards. The marketing team gets a growing first-party contact list for post-event campaigns. Run a DPIA before deployment to document the legitimate interest basis for presence analytics.
Q2. Your venue is experiencing a captive portal conversion rate of 22% - well below the 40% target. Users connect to the network but do not complete the authentication process. What are the three most likely causes and how do you diagnose and resolve each one?
Hint: Think about the user experience on mobile devices, the value exchange, and the technical rendering of the portal across iOS and Android.
View model answer
The three most likely causes are: (1) A complex login flow with too many form fields. Diagnose by reviewing the portal analytics for the step at which users drop off. Resolve by reducing the form to a single email field and removing any non-essential fields. (2) Poor rendering on iOS or Android. Diagnose by testing the portal on both operating systems. iOS and Android handle captive portal detection differently, and rendering issues are common. Resolve by testing across device types and adjusting the portal template. (3) An unclear or weak value exchange. If users do not understand what they receive in exchange for their email address, they will not complete the form. Resolve by adding a clear headline to the portal - for example, 'Connect in seconds. Get exclusive offers.' - and ensuring the benefit is visible above the fold on a mobile screen.
Q3. A retail client operating 30 stores on Ubiquiti UniFi hardware asks you to explain how MAC randomisation will affect their visitor analytics and what Purple does to address it. They are concerned that their footfall data will be unreliable.
Hint: Differentiate between the impact on anonymous presence analytics and the impact on identified engagement data captured via the captive portal.
View model answer
MAC randomisation affects anonymous presence analytics - the footfall and dwell time data derived from device probe requests - because devices rotate their MAC addresses, making it appear that more unique devices are present than there actually are. This inflates visitor counts if not corrected. Purple addresses this with statistical correction models calibrated against camera ground truth, maintaining accuracy within 3% to 7%. For the client's identified engagement data - the profiles built from captive portal authentications - MAC randomisation has no impact. Once a user authenticates, the CDP links their session to their verified email address. On subsequent visits, even if the device presents a different MAC address, the CDP recognises the returning user via the email identifier when they re-authenticate. The practical implication is that presence analytics should be treated as an approximation for footfall counting, while authenticated session data provides the reliable basis for marketing activation and repeat visit tracking.
Q4. A conference centre wants to deploy a CDP to capture attendee data across multiple events throughout the year. Each event has a different organiser, and the centre is concerned about data ownership and GDPR compliance. How do you structure the deployment?
Hint: Consider the data controller and data processor roles under GDPR, and how Purple's Multi-Tenant WiFi capability applies to this scenario.
View model answer
Structure the deployment using Purple's Multi-Tenant WiFi capability. The conference centre is the data controller for the underlying network infrastructure and the venue-level analytics (footfall, dwell time). Each event organiser is a separate data controller for the attendee data captured during their event. Configure separate SSIDs or VLAN segments for each event, with event-specific captive portals that clearly identify the event organiser as the data controller in the consent language. The conference centre retains access to anonymised aggregate analytics. The event organiser receives the first-party contact data captured during their event. Document the data controller and data processor relationships in a Data Processing Agreement (DPA) between the conference centre and each organiser. Run a DPIA for the venue-level presence analytics. This structure ensures GDPR compliance while allowing the conference centre to build its own first-party dataset from repeat attendees who consent to communications from the venue itself.