Coding Best Practices
Essential patterns and pitfalls to avoid when developing Archon.
Async/Await Patterns
❌ DON'T: Lambda Functions Returning Unawaited Coroutines
# This will break - the coroutine is never awaited
progress_callback=lambda data: update_progress(progress_id, data)
✅ DO: Use asyncio.create_task for Async Callbacks
# Properly schedule async function execution
progress_callback=lambda data: asyncio.create_task(update_progress(progress_id, data))
❌ DON'T: Call Sync Functions from Async Context
# This causes "event loop already running" errors
async def process_data():
embeddings = create_embeddings_batch(texts) # Sync function in async context
✅ DO: Use Async Versions in Async Context
async def process_data():
embeddings = await create_embeddings_batch_async(texts)
Socket.IO Best Practices
Room-Based Broadcasting
Always broadcast to specific rooms, not all clients:
# ✅ Good - targeted broadcast
await sio.emit('crawl_progress', data, room=progress_id)
# ❌ Bad - broadcasts to everyone
await sio.emit('crawl_progress', data)
Simple Event Handlers
Keep Socket.IO handlers simple and direct:
# ✅ Good - simple, clear purpose
@sio.event
async def crawl_subscribe(sid, data):
progress_id = data.get('progress_id')
if progress_id:
await sio.enter_room(sid, progress_id)
await sio.emit('crawl_subscribe_ack', {'status': 'subscribed'}, to=sid)
Service Layer Patterns
Progress Callbacks
When services need progress callbacks, ensure proper async handling:
# ✅ Good - service accepts optional async callback
async def process_with_progress(data, progress_callback=None):
if progress_callback:
await progress_callback({'status': 'processing', 'percentage': 50})
Error Handling in Services
Always provide fallbacks for external service failures:
# ✅ Good - graceful degradation
try:
embeddings = await create_embeddings_batch_async(texts)
except Exception as e:
logger.warning(f"Embedding creation failed: {e}")
# Return zero embeddings as fallback
return [[0.0] * 1536 for _ in texts]
Common Pitfalls
Event Loop Issues
Problem: "This event loop is already running"
Solution: Check if you're in an async context before using sync functions:
try:
loop = asyncio.get_running_loop()
# Already in async context - use async version
result = await async_function()
except RuntimeError:
# No event loop - safe to use sync version
result = sync_function()
Socket.IO Progress Updates
Problem: Progress updates not reaching the UI
Solution: Ensure the client is in the correct room and progressId is included:
# Always include progressId in the data
data['progressId'] = progress_id
await sio.emit('crawl_progress', data, room=progress_id)
Embedding Service Optimization
Problem: Synchronous embedding calls blocking async operations
Solution: Always use async versions and batch operations:
# Process in batches with rate limiting
async def create_embeddings_for_documents(documents):
batch_size = 20 # OpenAI limit
for i in range(0, len(documents), batch_size):
batch = documents[i:i + batch_size]
embeddings = await create_embeddings_batch_async(batch)
await asyncio.sleep(0.5) # Rate limiting
Testing Async Code
Mock Async Functions Properly
# ✅ Good - proper async mock
mock_update = AsyncMock()
with patch('module.update_progress', mock_update):
await function_under_test()
mock_update.assert_called_once()
Test Socket.IO Events
# ✅ Good - test room management
@patch('src.server.socketio_app.sio')
async def test_subscribe_joins_room(mock_sio):
await crawl_subscribe('sid-123', {'progress_id': 'progress-456'})
mock_sio.enter_room.assert_called_with('sid-123', 'progress-456')
React/Frontend Best Practices
Performance Anti-Patterns to Avoid
❌ DON'T: Update State on Every Keystroke Without Debouncing
// This causes excessive re-renders
<input
value={formData.title}
onChange={(e) => setFormData({ ...formData, title: e.target.value })}
/>
✅ DO: Use Debounced Inputs or Local State
// Use a debounced input component
<DebouncedInput
value={formData.title}
onChange={handleTitleChange}
delay={300}
/>
❌ DON'T: Create New Functions in Render
// Creates new function every render
<button onClick={() => handleAction(item.id)}>Click</button>
✅ DO: Use useCallback for Stable References
const handleClick = useCallback((id) => {
handleAction(id);
}, []);
<button onClick={() => handleClick(item.id)}>Click</button>
Socket.IO Client-Side Patterns
Room Subscription Pattern
useEffect(() => {
const ws = createWebSocketService();
const connect = async () => {
await ws.connect('/'); // Always default namespace
// Join room
ws.send({
type: 'join_project',
data: { project_id: projectId }
});
// Handle messages
ws.addMessageHandler('task_updated', handleTaskUpdate);
};
connect();
return () => ws.disconnect();
}, [projectId]);
State Management Patterns
Batch Updates
// ✅ Good - Single render
setState(prev => ({
...prev,
field1: value1,
field2: value2,
field3: value3
}));
// ❌ Bad - Three renders
setField1(value1);
setField2(value2);
setField3(value3);
Local State for Transient Values
// Keep form inputs local until save
const [localTitle, setLocalTitle] = useState(title);
const handleSave = () => {
onSave({ title: localTitle });
};
Key Takeaways
- Always await async functions - Never let coroutines go unawaited
- Use rooms for Socket.IO - Target specific audiences, not everyone
- Handle async boundaries - Know when you're in an async context
- Fail gracefully - Always have fallbacks for external services
- Test async code properly - Use AsyncMock and proper async test patterns
- Optimize React renders - Use memo, useCallback, and debouncing
- Batch state updates - Minimize renders with single setState calls
- Use local state - Keep transient values local to components