Skip to main content

Quels types de données client le WiFi peut-il capturer ?

Ce guide faisant autorité détaille les quatre catégories principales de données client capturées par les plateformes WiFi d'entreprise : identité, comportementales, déclarées et métadonnées d'appareil. Il fournit des conseils pratiques en matière d'architecture, de conformité et de déploiement pour les leaders informatiques afin de transformer l'infrastructure de réseau invité en un actif de données de première partie sécurisé.

📖 4 min de lecture📝 986 mots🔧 2 exemples3 questions📚 8 termes clés

🎧 Écouter ce guide

Voir la transcription
What Types of Customer Data Can WiFi Capture? — A Purple Intelligence Briefing [INTRODUCTION — approx. 1 minute] Welcome to the Purple Intelligence Briefing. I'm your host, and today we're cutting straight to a question that comes up in almost every enterprise WiFi conversation: what types of customer data can a WiFi platform actually capture, and how do you turn that raw signal into something commercially useful? Whether you're running a hotel group, a retail estate, a stadium, or a public-sector estate, the answer to that question shapes your entire data strategy. Get it right, and your guest WiFi becomes one of the most valuable first-party data assets in your business. Get it wrong, and you're either leaving intelligence on the table or — worse — creating a compliance liability. So let's get into it. We'll cover the four core data categories, the technical architecture behind the capture, what good looks like in practice, and the pitfalls that catch organisations out. This is a ten-minute briefing, so we'll move at pace. [TECHNICAL DEEP-DIVE — approx. 5 minutes] Let's start with the fundamentals. When a guest or visitor connects to your WiFi network, the interaction creates multiple data signals across four distinct categories. Understanding these categories is the foundation of any intelligent WiFi deployment. The first category is identity data — sometimes called declared identifier data. This is what the user actively provides at the point of authentication. On a guest WiFi platform like Purple, that happens at the captive portal, or splash page. The user sees a branded login screen and chooses how to authenticate: via email, mobile number, or a social login through Facebook, Google, or Apple. Each method yields a different identifier. Email gives you a verified contact address. Phone number gives you an SMS-capable channel. Social login gives you a richer profile — potentially including age range, location, and interests — depending on the permissions the user grants. The key technical point here is that this is first-party data. The user has actively consented to share it with your organisation, in exchange for network access. That consent event is logged with a timestamp, IP address, and the specific terms presented — which is exactly what GDPR Article 7 requires you to be able to demonstrate. Purple's platform handles that consent audit trail automatically, which removes a significant compliance burden from your IT and legal teams. The second category is behavioural data, and this is where WiFi analytics really differentiates itself from other data sources. Behavioural data is derived from the network interactions of connected devices — it doesn't require the user to do anything beyond staying connected. The most commercially valuable behavioural signals are dwell time, visit frequency, and zone-level movement. Dwell time is the duration a device remains associated with the network. In a retail environment, a dwell time of twelve minutes in a specific department correlates strongly with purchase intent. In a hotel lobby, a spike in dwell time at 11pm might indicate a bar revenue opportunity. Visit frequency tells you whether a guest is a first-timer or a loyal returner — and the delta between those two segments is where your CRM strategy lives. Zone-level movement data comes from triangulating signal strength across multiple access points. This is where the architecture matters. A single access point gives you presence data — you know the device is on the network. Multiple access points, properly positioned and calibrated, give you location data — you know which zone of the venue the device is in. This is the foundation of indoor positioning, and it's what separates a basic guest WiFi deployment from a genuine analytics platform. If you want to go deeper on the positioning architecture, there's a detailed guide on the Purple blog covering UWB, BLE, and WiFi-based indoor positioning systems that's worth reading alongside this. The third category is declared data — information the user explicitly provides beyond their login identifier. This typically comes through post-connection surveys, preference capture forms, or in-session prompts. Examples include dietary preferences in a hospitality setting, product category interests in retail, or accessibility requirements in a public-sector venue. Declared data has the highest signal quality of any category because there's no inference involved — the user has told you directly. The challenge is capture rate. You need to design the data collection touchpoint carefully to maximise completion without creating friction that degrades the connection experience. The fourth category is device and network metadata. This is data generated by the device itself during the association process, and it includes the device's MAC address — or a randomised proxy of it, since iOS 14 and Android 10 introduced MAC randomisation — the device type, operating system version, and signal strength readings from each access point. This data is primarily useful for network operations: understanding device mix, diagnosing coverage gaps, and capacity planning. But it also feeds into behavioural analytics — knowing that 68% of your visitors are on iOS, for example, shapes your push notification strategy and your app development roadmap. Now, a word on MAC randomisation, because it's a topic that trips up a lot of network architects. Since 2020, both Apple and Google have implemented per-network MAC randomisation by default. This means the hardware MAC address a device presents to your network changes on each new connection, which breaks the traditional method of using MAC as a persistent device identifier for repeat visit tracking. The workaround is to anchor your persistent identifier to the authenticated user record — the email or phone number captured at the splash page — rather than the device MAC. This is how Purple's platform handles it, and it's the correct architectural approach. The MAC becomes a session-level identifier; the authenticated credential becomes the persistent one. [IMPLEMENTATION RECOMMENDATIONS AND PITFALLS — approx. 2 minutes] Let me give you three implementation principles that separate deployments that deliver ROI from those that don't. First: design your splash page for data quality, not just data volume. It's tempting to ask for everything — name, email, phone, date of birth, preferences — in a single form. Resist that. Conversion rates drop sharply with each additional field. The better approach is progressive profiling: capture the minimum at first connection, then enrich the profile over subsequent visits through targeted prompts. A hotel guest who connects three times in a week is a far better candidate for a preference survey than a first-time visitor. Second: segment your data collection by venue type from day one. A retail deployment and a hospitality deployment have fundamentally different data priorities. In retail, dwell time and zone movement are the primary value drivers. In hospitality, repeat visit frequency and declared preferences drive the most revenue. Configure your analytics dashboards and your CRM integrations to reflect those priorities rather than using a one-size-fits-all template. Third, and this is the one most organisations get wrong: build your GDPR compliance architecture before you go live, not after. The five non-negotiables are: a documented lawful basis for each data type you collect — which for guest WiFi is almost always consent; a data minimisation policy that defines exactly what you capture and why; a retention schedule with automated deletion; a Subject Access Request workflow that can respond within the statutory 30-day window; and a breach notification protocol that meets the 72-hour ICO reporting requirement. Purple's platform automates the consent logging, SAR workflow, and retention scheduling components — but you still need the internal policies and the DPO sign-off. The most common pitfall I see is organisations deploying guest WiFi as an IT project rather than a data strategy project. The network goes live, users connect, and six months later someone in marketing asks "what data do we have?" and the answer is "not much, because nobody configured the analytics layer." Treat the data architecture as a day-one requirement, not a phase-two nice-to-have. [RAPID-FIRE Q&A — approx. 1 minute] Let me run through three questions that come up regularly. "Can we capture data from devices that don't connect to the network?" — No. Passive probe request monitoring was a common technique before MAC randomisation made it unreliable. For any meaningful data capture, the device needs to authenticate to your network. "Does social login give us access to the user's social media posts?" — No. Social login via OAuth gives you the profile fields the user consents to share — typically name, email, and profile picture. It does not give you access to their timeline, messages, or connections. "How does WiFi data integrate with our existing CRM?" — Most enterprise WiFi platforms, including Purple, support API-based CRM integration with platforms like Salesforce, HubSpot, and Microsoft Dynamics. The authenticated identifier — email or phone — is the join key. You push the behavioural and declared data from the WiFi platform into the CRM record, enriching your existing customer profiles with venue-level intelligence. [SUMMARY AND NEXT STEPS — approx. 1 minute] To wrap up: a well-deployed guest WiFi platform captures four categories of customer data — identity, behavioural, declared, and device metadata. Each category serves a different purpose, and the real value comes from combining them: knowing who your visitor is, how they behave in your venue, what they've told you about their preferences, and what device they're using. The architecture decisions that matter most are: anchoring persistent identity to authenticated credentials rather than MAC addresses; designing for progressive data enrichment rather than one-shot capture; and building your compliance framework before you go live. If you're evaluating a guest WiFi platform or looking to get more from an existing deployment, the Purple platform is built specifically around this data architecture. There are detailed guides on the Purple website covering data protection, analytics configuration, and integration patterns — links in the show notes. Thanks for listening. We'll be back with the next briefing shortly.

header_image.png

Résumé Exécutif

Pour les sites d'entreprise — des domaines de Retail aux groupes d' Hospitality — le WiFi invité est passé d'un service de base à un canal d'acquisition de données essentiel. Cependant, de nombreuses organisations déploient encore des réseaux sans fil comme de pures infrastructures informatiques, manquant l'opportunité de capturer des informations client de première partie à haute valeur ajoutée. Ce guide détaille les types exacts de données client qu'une plateforme Guest WiFi d'entreprise peut capturer, l'architecture technique requise pour le faire en toute sécurité, et les cadres de conformité nécessaires pour les protéger. Nous explorons les quatre catégories de données principales : identité, comportementales, déclarées et métadonnées d'appareil. Pour les CTO et les architectes réseau, l'objectif est clair : implémenter une couche WiFi Analytics robuste qui génère un ROI mesurable grâce à l'enrichissement du CRM, tout en adhérant strictement aux principes de minimisation des données et du GDPR.

Approfondissement Technique : Les Quatre Catégories de Données WiFi

Lorsqu'un utilisateur s'associe à un réseau sans fil d'entreprise, la plateforme peut capturer des données dans quatre catégories distinctes. Comprendre les mécanismes techniques et les limitations de chacune est essentiel pour un déploiement efficace.

1. Données d'Identité (Identifiants Déclarés)

Les données d'identité sont explicitement fournies par l'utilisateur pendant le processus d'authentification sur le Captive Portal (page de connexion). C'est le fondement de votre stratégie de données de première partie.

  • Adresse E-mail & Numéro de Téléphone : Capturés via des champs de formulaire standard. Ceux-ci servent d'identifiants persistants principaux pour l'intégration CRM.
  • Profil de Connexion Sociale : Capturé via l'intégration OAuth (par exemple, Facebook, Google, Apple). Selon le consentement de l'utilisateur, cela peut fournir des données de profil riches incluant le nom, la tranche d'âge et l'e-mail vérifié.

Note d'Architecture Technique : La capture des données d'identité doit être associée à un journal de consentement auditable. La plateforme doit enregistrer l'horodatage, l'adresse IP, l'adresse MAC et les Conditions Générales spécifiques présentées à l'utilisateur. L'architecture de Purple automatise cette journalisation pour assurer la conformité à l'Article 7 du GDPR.

data_categories_infographic.png

2. Données Comportementales (Analyse Réseau)

Les données comportementales sont dérivées passivement de l'interaction de l'appareil avec l'infrastructure réseau. Elles ne nécessitent aucune saisie active de l'utilisateur au-delà du maintien d'une connexion.

  • Présence & Temps de Permanence : La durée pendant laquelle un appareil reste associé au réseau. Des temps de permanence élevés dans des zones spécifiques (par exemple, un bar d'hôtel ou un présentoir de vente au détail) sont fortement corrélés à l'intention de conversion.
  • Fréquence & Récence des Visites : Suivi de l'écart entre les visites pour distinguer les nouveaux visiteurs des clients fidèles.
  • Mouvement au Niveau de la Zone : En triangulant les données de l'Indicateur de Force du Signal Reçu (RSSI) à travers plusieurs points d'accès, les plateformes peuvent cartographier les parcours des utilisateurs dans un espace physique. Pour une exploration plus approfondie de la technologie sous-jacente, consultez notre guide sur Système de Positionnement Intérieur : Guide UWB, BLE, & WiFi .

3. Données Déclarées (Profilage Progressif)

Les données déclarées vont au-delà de l'identité de base, capturant les préférences explicites directement de l'utilisateur. Ces données ont la plus haute qualité de signal car elles reposent sur une saisie directe plutôt que sur une inférence.

  • Réponses aux Enquêtes : Enquêtes post-authentification ou post-visite (par exemple, Net Promoter Score, commentaires sur les installations).
  • Capture des Préférences : Invites en session recueillant des intérêts spécifiques (par exemple, exigences alimentaires dans le Healthcare ou intérêts produits dans le commerce de détail).

4. Métadonnées d'Appareil et de Réseau

Ces données sont générées par le matériel et le système d'exploitation de l'appareil pendant le processus d'association 802.11.

  • Adresse MAC : L'identifiant matériel. Contrainte cruciale : Depuis iOS 14 et Android 10, la randomisation MAC par réseau est la valeur par défaut. Les adresses MAC ne peuvent plus être utilisées de manière fiable comme identifiants persistants entre les visites sans un enregistrement utilisateur authentifié.
  • Type d'Appareil & Version d'OS : Extraits de la chaîne HTTP User-Agent lors du rendu du portail ou via le fingerprinting DHCP.
  • Utilisation des Données : Métriques de débit (volume d'upload/download), qui aident à la planification de la capacité et à l'identification des utilisateurs gourmands en bande passante.

Guide d'Implémentation : Architecture pour la Capture de Données

Le déploiement d'un réseau WiFi axé sur les données nécessite des décisions architecturales qui équilibrent l'expérience utilisateur et le rendement des données.

Surmonter la Randomisation MAC

Le changement architectural le plus significatif de ces dernières années est la dépréciation de l'adresse MAC en tant qu'identifiant persistant. Pour suivre précisément les visites répétées, l'architecture doit ancrer le profil utilisateur à l'identifiant authentifié (e-mail/téléphone) plutôt qu'au matériel de l'appareil.

  1. Initiation de Session : L'appareil se connecte avec une MAC randomisée.
  2. Authentification : L'utilisateur fournit son e-mail via le Captive Portal.
  3. Liaison de Profil : La plateforme lie la session MAC randomisée actuelle au profil e-mail persistant.
  4. Visites Ultérieures : Si l'appareil présente une nouvelle MAC randomisée, l'utilisateur doit se réauthentifier (souvent de manière transparente via un flux utilisateur de retour ou une authentification basée sur le profil comme OpenRoaming) pour relier la session à son profil.

Profilage Progressif vs. Friction

Ne demandez pas chaque point de données lors de la première connexion. Les Captive Portals à forte friction souffrent de taux d'abandon élevés. Implémentez le profilage progressif : à mesure quek pour une adresse e-mail lors de la première visite, un numéro de téléphone lors de la troisième visite et une enquête de préférence lors de la cinquième visite.

Pour des conseils spécifiques sur la sécurisation de ces données une fois capturées, consultez Comment protéger les données clients collectées via le WiFi .

Bonnes pratiques et conformité

Considérez le WiFi invité comme un projet de stratégie de données, et non comme un simple déploiement informatique. La conformité doit être intégrée à l'architecture dès le premier jour.

gdpr_compliance_diagram.png

  1. Base légale et consentement : Assurez-vous que le Captive Portal sépare explicitement l'acceptation des conditions d'utilisation du consentement marketing. Les cases pré-cochées ne sont pas conformes au GDPR.
  2. Minimisation des données : Ne collectez que les données pour lesquelles vous avez un cas d'utilisation commercial. Si vous n'avez pas de stratégie de marketing par SMS, ne rendez pas obligatoire la collecte de numéros de téléphone.
  3. Rétention automatisée : Configurez la plateforme pour purger automatiquement les profils inactifs après une période définie (par exemple, 24 mois) afin de respecter les principes de limitation du stockage.
  4. Demandes d'accès des personnes concernées (SAR) : Assurez-vous que votre plateforme dispose d'un flux de travail automatisé pour exporter ou supprimer les données d'un utilisateur dans le délai légal de 30 jours sur demande.

ROI et impact commercial

Le ROI d'une plateforme d'analyse WiFi est mesuré par son intégration avec l'ensemble de la pile martech. En poussant les données d'identité, comportementales et déclarées via l'API vers des plateformes comme Salesforce ou HubSpot, les lieux peuvent déclencher des flux de travail automatisés. Par exemple, un pôle Transport peut automatiquement envoyer par e-mail une réduction pour le salon à un passager dont le temps de présence dépasse 45 minutes. L'impact commercial ultime est la conversion du trafic piéton anonyme en une base de données commercialisable et segmentée.

Termes clés et définitions

Captive Portal

A web page that a user of a public-access network is obliged to view and interact with before access is granted. It is the primary mechanism for capturing identity data and consent.

IT teams configure this to balance security, branding, and data capture requirements.

MAC Randomisation

A privacy feature in modern OSs (iOS, Android) where the device generates a temporary, random MAC address for each specific WiFi network it joins, preventing cross-network tracking.

This forces network architects to rely on authenticated user profiles rather than hardware identifiers for repeat visit tracking.

Dwell Time

The total duration a device remains continuously associated with the WiFi network or a specific zone within the network.

Used by operations and marketing to gauge engagement, queue lengths, or intent to purchase.

Progressive Profiling

The practice of collecting user data incrementally over multiple sessions rather than demanding all information during the initial interaction.

Crucial for maintaining high WiFi connection rates while still building rich customer profiles over time.

First-Party Data

Information a company collects directly from its customers and owns entirely, typically gathered via direct interactions like WiFi authentication.

Highly valuable as third-party cookies deprecate; it provides the most accurate and compliant foundation for marketing.

Received Signal Strength Indicator (RSSI)

A measurement of the power present in a received radio signal. Used in WiFi analytics to estimate the distance between a device and an access point.

The technical metric underlying zone-level movement tracking and indoor positioning.

Subject Access Request (SAR)

A mechanism under GDPR allowing individuals to request a copy of their personal data, or request its deletion.

IT must ensure the WiFi platform can easily query and export or purge specific user records to meet the 30-day compliance window.

Data Minimisation

The principle that a data controller should limit the collection of personal information to what is directly relevant and necessary to accomplish a specified purpose.

A core compliance requirement; prevents venues from hoarding unnecessary data that increases breach liability.

Études de cas

A 200-room hotel needs to increase direct bookings and reduce OTA (Online Travel Agency) commissions. They currently offer open, unauthenticated WiFi.

The hotel deploys a captive portal requiring email or social authentication. They implement progressive profiling: on the first connection, they capture email and marketing consent. On the third connection during the stay, a micro-survey captures the reason for travel (Business/Leisure). Post-checkout, the CRM uses the WiFi identity data to send a targeted 'Book Direct' offer for their next stay, bypassing the OTA.

Notes de mise en œuvre : This approach solves the 'anonymous guest' problem common with OTA bookings. By moving from open WiFi to authenticated access, the hotel captures the first-party data necessary to own the guest relationship. The use of progressive profiling prevents connection friction while still yielding rich segmentation data.

A large retail chain wants to measure the impact of a new store layout on customer engagement, but their current WiFi only tracks total daily connections.

The IT team upgrades the network to support zone-level analytics by calibrating multiple access points. They define virtual zones within the analytics platform corresponding to key departments. They can now measure not just presence, but 'Zone Dwell Time'. By comparing dwell times in the newly laid-out zones against historical benchmarks, they quantify the layout's impact on engagement.

Notes de mise en œuvre : This scenario highlights the shift from basic network metrics (connections) to commercial behavioural metrics (dwell time). It demonstrates how physical network architecture (AP density and placement) directly dictates the granularity of the data captured.

Analyse de scénario

Q1. Your marketing team wants to track how often specific customers return to your stadium over a season. The current network uses open access (no portal) and tracks MAC addresses. Why will this fail, and what must you change?

💡 Astuce :Consider recent changes in mobile operating system privacy features.

Afficher l'approche recommandée

It will fail due to MAC randomisation; modern devices present a different MAC address on subsequent visits, breaking the tracking. You must implement a captive portal to force authentication (e.g., via email or ticketing integration) and anchor the repeat visit tracking to that persistent user credential rather than the hardware MAC.

Q2. A venue director requests that the new WiFi splash page collects Name, Email, Phone, Date of Birth, Postcode, and Dietary Preferences to build a comprehensive CRM database immediately. How should the IT architect respond?

💡 Astuce :Balance data yield against the user experience and connection drop-off rates.

Afficher l'approche recommandée

The architect should advise against this due to the Friction vs. Yield trade-off. A 6-field form will cause massive connection abandonment. Instead, recommend progressive profiling: capture Name and Email on the first visit, and use subsequent visits to prompt for Phone or Dietary Preferences. Furthermore, under data minimisation principles, Date of Birth should not be collected unless there is a strict legal requirement (e.g., age-gated venues).

Q3. During a security audit, the compliance team asks how the WiFi platform proves that a user opted into marketing communications. What specific data points must the system be able to produce?

💡 Astuce :Think about the requirements of GDPR Article 7 regarding the demonstration of consent.

Afficher l'approche recommandée

The system must produce a definitive audit trail for that specific user. This includes the timestamp of the consent action, the IP address and MAC address used during the session, the exact version of the Terms & Conditions/Privacy Policy presented at that time, and the specific checkbox (which must have been actively opted-in, not pre-ticked) that the user interacted with.

Quels types de données client le WiFi peut-il capturer ? | Technical Guides | Purple