生成假数据以填充数据库

Generating fake data to populate database

我正在尝试使用 factory boy 和 faker 为我正在构建的网站生成一些虚假数据。这是我的 models.py:

# External Imports
from django.db import models
import uuid

# Internal Imports
from applications.models.application import Application
from users.models.user import User
from .session import Session

# Fake data
import factory
import factory.django
import factory.fuzzy
from datetime import datetime
from faker import Faker
from faker.providers import BaseProvider
import random


class ButtonClick(models.Model):
    """**Database model that tracks and saves button clicks for an application**
    """
    # identifier
    id = models.UUIDField(default=uuid.uuid4, primary_key=True, editable=False)

    # info
    button_name = models.CharField(max_length=128, null=True, blank=True)
    application = models.ForeignKey(
        Application, related_name='button_clicks', null=True, blank=True, on_delete=models.CASCADE)
    user = models.ForeignKey(
        User, related_name='button_clicks', null=True, blank=True,      on_delete=models.CASCADE)
    session = models.ForeignKey(
        Session, related_name='button_clicks', null=True, blank=True, on_delete=models.CASCADE)
        timestamp = models.DateTimeField(auto_now=True)

class Meta:
    db_table = 'button_clicks'
    ordering = ('-timestamp', )

def __str__(self):
    return f'{self.application} - {self.button_name}'

fake = Faker()
faker = Factory.create()


class ApplicationFactory(factory.DjangoModelFactory):
    class Meta:
        model = Application

    application = factory.LazyAttribute(lambda _: faker.word())


class FakeButtonClick(factory.django.DjangoModelFactory):
    class Meta:
        model = ButtonClick

    button_name = factory.Faker('first_name')
    application = factory.SubFactory(ApplicationFactory)
    user = factory.Faker('name')
    session = factory.Faker('random_int')
    timestamp = factory.Faker('date')

当我尝试在终端中 运行 以下代码时,出现错误:

>>> from analytics.models.button_click import FakeButtonClick
>>> for _ in range(200): FakeButtonClick.create()
... 

Traceback (most recent call last):
  File "<console>", line 1, in <module>
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/factory/base.py", line 564, in create
    return cls._generate(enums.CREATE_STRATEGY, kwargs)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/factory/django.py", line 141, in _generate
    return super(DjangoModelFactory, cls)._generate(strategy, params)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/factory/base.py", line 501, in _generate
    return step.build()
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/factory/builder.py", line 279, in build
    kwargs=kwargs,
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/factory/base.py", line 315, in instantiate
    return self.factory._create(model, *args, **kwargs)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/factory/django.py", line 185, in _create
    return manager.create(*args, **kwargs)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/django/db/models/manager.py", line 82, in manager_method
    return getattr(self.get_queryset(), name)(*args, **kwargs)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/django/db/models/query.py", line 431, in create
    obj = self.model(**kwargs)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/django/db/models/base.py", line 482, in __init__
    _setattr(self, field.name, rel_obj)
  File "/Users/ryan/bloks/bloks-backend/venv/lib/python3.7/site-packages/django/db/models/fields/related_descriptors.py", line 219, in __set__
    self.field.remote_field.model._meta.object_name,
ValueError: Cannot assign "9714": "ButtonClick.application" must be a "Application" instance.

我过去使用 factory boy 和 faker 创建了一些非常简单的数据,但回溯似乎暗示我需要在我的 FakeButtonClick class 中创建一个应用程序实例?我检查了文档,应用程序似乎不是工厂 boy/faker 的可用实例。我需要自己创建实例吗?也许是一个子工厂?

这是我的数据库填充文件,可能会对您有所帮助(我认为比您的文件简单得多):

# Don't change the format. Order matters!

import os
os.environ.setdefault('DJANGO_SETTINGS_MODULE', 'project.settings')

import django
django.setup()

import random
from faker import Faker

from todo.models import Todo


fakegen = Faker()

def populate(N = 10):
    for entry in range(N):
        fake_tmp = fakegen.catch_phrase()
        levels = ['important', 'normal', 'unimportant']

        fake_title = fake_tmp if len(fake_tmp) <= 40 else (fake_tmp[:37] + '...')
        fake_desc = fakegen.sentence(nb_words=70)
        fake_level = levels[random.randint(0, 2)]

        todo_item = Todo.objects.get_or_create(title=fake_title, desc=fake_desc, level=fake_level)

if __name__ == '__main__':
    print('Populating data...')
    populate(20)
    print('Populating complete')

您的 ButtonClick 模型有 3 个字段定义为 ForeignKeyapplicationusersession

当你想创建一个 ButtonClick 实例时,Django 要求你为定义为 ForeignKey 的每个字段提供一个有效值——在这里,这意味着提供模型实例或 None(因为那些 ForeignKey 可以为空)。

对于 FactoryBoy,这意味着您必须:

  1. 为每个模型定义一个 Factory class。
  2. 使用 factory.SubFactory 指向每个字段的工厂。

例如:


class UserFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = User
    username = factory.Faker('username')

class SessionFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = Session
    uuid = factory.Faker('uuid4')
    user = factory.SubFactory(UserFactory)

class ApplicationFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = Application
    name = factory.Faker('name')

class ButtonClickFactory(factory.django.DjangoModelFactory):
    class Meta:
        model = ButtonClick
    user = factory.SubFactory(UserFactory)
    # Ensure that click.user == click.session.user
    session = factory.SubFactory(SessionFactory, user=factory.SelfAttribute('..user'))
    application = factory.SubFactory(ApplicationFactory)

你可以看看at the docs.

顺便说一下,使用 FactoryBoy 的 faker integration,您不需要直接导入它:factory.Faker('uuid4') 等同于 faker.Faker().uuid4().

为了未来的读者! 要为 django 生成虚假数据,您可以使用 django-seed。 这是一个简单的过程,因为

  • pip install django-seed(安装django-seed

  • settings.py 文件中的应用程序中添加 django_seed

    INSTALLED_APPS = (
        ...
        'django_seed',
    )
    
  • python manage.py seed <app-name>

例如:播种 api django 应用 python manage.py seed api --number=15

如果需要,您还可以指定特定字段应具有的值。例如,如果你想为 MyModel 做 15 个种子,但你需要 my_field 对所有这些都是相同的,你可以这样做:

python manage.py seed api --number=15 --seeder "MyModel.my_field" "1.1.1.1"

这就是我如何将假数据生成到 Django sqlite

Mackaroo website 转到此网站并填写详细信息并下载任何格式的文件(sqljsoncsv)任何格式

关于本网站的优点,您可以在您的列空值和任何format 用于数字、日期等

然后 下载 或将其转储到您的 database