To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 衆??衆??伎??題詐??杖?衆??伎?? 1000111101001111001111110011111110001111010011110011111100111111100010101110101000111111001111111001000111101000100011011011110000111111001111111000111111110001001111111000111101001111001111110011111110001010111010100011111100111111 8f4f3f3f8f4f3f3f8aea3f3f91e88dbc3f3f8ff13f8f4f3f3f8aea3f3f
EUC-JP 衆??衆??伎??題詐??杖?衆??伎?? 1011110110110000001111110011111110111101101100000011111100111111101101001110110000111111001111111100001011101010101110101011111000111111001111111011111011110011001111111011110110110000001111110011111110110100111011000011111100111111 bdb03f3fbdb03f3fb4ec3f3fc2eababe3f3fbef33fbdb03f3fb4ec3f3f
UTF-8 衆累걋衆肋렱伎렚렣題詐렰렦杖렱衆肋렱伎렚렣 111010001010000110000110111011111010010110001111111010101011000110001011111010001010000110000110111011111010010110010011111010111010000010110001111001001011110010001110111010111010000010011010111010111010000010100011111010011010000110001100111010001010100110010000111010111010000010110000111010111010000010100110111001101001110110010110111010111010000010110001111010001010000110000110111011111010010110010011111010111010000010110001111001001011110010001110111010111010000010011010111010111010000010100011 e8a186efa58feab18be8a186efa593eba0b1e4bc8eeba09aeba0a3e9a18ce8a990eba0b0eba0a6e69d96eba0b1e8a186efa593eba0b1e4bc8eeba09aeba0a3
UHC 衆累걋衆肋렱伎렚렣題詐렰렦杖렱衆肋렱伎렚렣 111100011110101111010010111010011011000011000000111100011110101111010010111100011000111010111110110100001110101110001110101011011000111010110100111100001011100111011110111100011000111010111101100011101011010111101101111010001000111010111110111100011110101111010010111100011000111010111110110100001110101110001110101011011000111010110100 f1ebd2e9b0c0f1ebd2f18ebed0eb8ead8eb4f0b9def18ebd8eb5ede88ebef1ebd2f18ebed0eb8ead8eb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)