Privacy Tools Guide

Smart speakers are always-on microphones connected to corporate cloud infrastructure. Amazon employs people who listen to and annotate Alexa recordings. Google Home sends audio to Google’s servers for processing and storage. Even with local wake word detection, commands are sent to external servers after trigger. This guide explains what is captured, what is retained, and how to reduce exposure.

How Smart Speaker Audio Processing Works

  1. Local detection: Small ML model on device detects the wake word
  2. Audio buffered: Device buffers audio starting ~1 second before the wake word
  3. Upload to cloud: Buffered audio plus your command sent to Amazon’s/Google’s servers
  4. Processing and logging: Audio transcribed, stored, used for ML training
  5. False triggers: When the device mishears background audio, it still records and uploads

What Amazon Retains from Alexa

Amazon shares Alexa data with law enforcement upon valid subpoena. Alexa recordings have been used as evidence in criminal investigations.

Deleting Your Data

Amazon Alexa

Alexa App → Settings → Alexa Privacy → Review Voice History
→ Delete All Recordings

# Auto-delete:
Settings → Alexa Privacy → Manage Your Alexa Data
→ "Choose how long to save recordings": 3 months or Don't save

Google Assistant

myactivity.google.com → Filter by "Voice & Audio"
→ Select all → Delete

# Auto-delete: myactivity.google.com → Data & privacy settings
→ Change to 3 months

Measuring What Smart Speakers Transmit

# Capture traffic from your smart speaker's IP
sudo tcpdump -i eth0 -n host 192.168.1.50 -w speaker_capture.pcap

# Check destination IPs
tcpdump -r speaker_capture.pcap -n | awk '{print $3}' | \
  cut -d. -f1-4 | sort | uniq -c | sort -rn

You will see constant heartbeat connections plus spikes during voice interactions.

Privacy-Respecting Alternatives

Mycroft (Open-Source, Local Processing)

git clone https://github.com/MycroftAI/mycroft-core
cd mycroft-core && ./dev_setup.sh
./start-mycroft.sh all
./start-mycroft.sh cli

Mycroft processes commands locally — no data leaves your network.

Home Assistant Voice (Local)

Home Assistant uses Whisper for speech-to-text and Piper for text-to-speech — both run locally:

Home Assistant UI → Add-ons → Whisper (install)
Home Assistant UI → Add-ons → Piper (install)
# All audio processed on your local server

Real-World Incident: Alexa False Trigger

In 2021, researchers discovered that Amazon Alexa devices were triggering on TV advertisements that mimicked the wake word. A popular Alexa commercial accidentally triggered real devices in viewers’ homes during the Super Bowl broadcast.

What happened:

  1. TV ad said “Alexa, order me a Dolly Parton song”
  2. Millions of actual Alexa devices in living rooms heard this
  3. Some devices ordered the song, others attempted to
  4. Amazon temporarily disabled the feature, then re-enabled it after “improvements”

Lesson: Wake word detection is probabilistic, not deterministic. False positives WILL happen, and when they do, audio is recorded and uploaded automatically.

If you own Alexa, expect false triggers causing unwanted recordings at least weekly.

Physical Controls for Commercial Devices

If you keep a commercial smart speaker:

  1. Use the physical mute button when not actively using — microphone hardware-disabled
  2. Isolate on a VLAN with firewall rules
  3. Delete recordings regularly
  4. Disable “Human Review” opt-in features
  5. Disable “Personal Results” to prevent access to calendar, contacts, shopping history
# Network isolation example — block inter-VLAN routing
# Allow only HTTPS to AWS/Google IPs from the smart_speakers VLAN
sudo iptables -A FORWARD -s 192.168.20.0/24 -d 52.94.0.0/16 -j ACCEPT
sudo iptables -A FORWARD -s 192.168.20.0/24 -j DROP

The Metadata Problem

Even if you delete recordings, Amazon and Google retain metadata: timestamps, device ID, command categories, and smart home control events. This metadata reveals your schedule, sleeping patterns, and lifestyle habits.

The only complete solution is not to use always-on microphone devices in spaces where privacy matters.

What Smart Speaker Metadata Reveals

From timestamps alone:

From smart home commands:

From search/query metadata:

A privacy researcher at Imperial College London analyzed ~200,000 Alexa voice logs and found:

Amazon does not delete metadata even when you delete recordings.

Practical Reduction: If You Keep a Smart Speaker

If you must use Alexa or Google Home (for family, convenience, compatibility):

1. Physical Mute Button Always

Press and hold the mute button to completely disable the microphone.
The device will indicate (LED light) that the mic is off.
Audio is not buffered, uploaded, or processed while muted.

This is your only hardware-level protection. Unplugging is better, but muting is the practical middle ground.

2. Restrict Network Access

Create a dedicated VLAN for smart speakers:

# On a consumer router (example with iptables):
sudo iptables -N SMART_SPEAKERS 2>/dev/null
sudo iptables -A SMART_SPEAKERS -p tcp --dport 80 -j ACCEPT    # HTTP
sudo iptables -A SMART_SPEAKERS -p tcp --dport 443 -j ACCEPT   # HTTPS
sudo iptables -A SMART_SPEAKERS -p udp --dport 53 -j ACCEPT    # DNS
sudo iptables -A SMART_SPEAKERS -p udp --dport 123 -j ACCEPT   # NTP
sudo iptables -A SMART_SPEAKERS -j REJECT

# Bind your speaker's IP to this chain
# Blocks local network scanning, prevents lateral movement

Advanced: Use Pi-hole to block domains:

Blocklist: connect.deviceidentity.amazon.com
Blocklist: api.amazonalexa.com
Blocklist: metrics.*.voice.amazon.com
Blocklist: *.metrics.google.com

This prevents analytics collection while maintaining basic functionality.

3. Scheduled Muting

Automate muting during night hours:

# If your speaker has IFTTT integration:
# Create trigger: Every day at 10:00 PM → Mute Alexa
# Create trigger: Every day at 7:00 AM → Unmute Alexa

# Or via Home Assistant (if you have one):
automation:
  - alias: Mute Alexa Night
    trigger:
      platform: time
      at: "22:00:00"
    action:
      service: script.mute_alexa

  - alias: Unmute Alexa Morning
    trigger:
      platform: time
      at: "07:00:00"
    action:
      service: script.unmute_alexa

4. Disable Unnecessary Permissions

In Alexa app:

In Google Home app:

5. Regular Recording Deletion

Set automatic deletion, but also manually verify:

Alexa: Alexa, delete everything I said today.
Google: Ok Google, delete everything I said today.

This removes the current session’s recordings. For older recordings, you must use the app.

Wake Word False Positive Rate

Smart speakers trigger on words that sound like the wake word:

With 2-4 false triggers per day per speaker, your conversations are constantly buffered and uploaded.

Install speech recognition logging on your phone to verify:

# On rooted Android, check audio buffer:
adb shell getprop ro.hardware.activity_recognition
adb shell "find /proc -name 'audio*' 2>/dev/null" | head -10

# Check system audio logs:
adb logcat | grep -i "audio\|record\|mic\|speechrecognizer"

If you see audio being recorded during non-wake-word times, the device is triggering falsely.

Hardware Alternative: Sonos Move without Alexa

If you need a smart speaker for music:

These are not “smart speakers” in the Alexa/Google sense — they’re music devices without surveillance.


Built by theluckystrike — More at zovo.one