To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??誼??楡??鼇???↑?淫??筌?? 11100001100111110011111100111111100010110110001000111111001111111001111010111110001111110011111111101010100001110011111100111111001111111000000110101010001111111000100011111010001111110011111111100010101000110011111100111111 e19f3f3f8b623f3f9ebe3f3fea873f3f3f81aa3f88fa3f3fe2a33f3f
EUC-JP 癲??誼??楡??鼇???↑?淫??筌?? 11100010101000010011111100111111101101011100001100111111001111111101110011000000001111110011111111110011111001110011111100111111001111111010001010101100001111111011000011111100001111110011111111100100101001010011111100111111 e2a13f3fb5c33f3fdcc03f3ff3e73f3f3fa2ac3fb0fc3f3fe4a53f3f
UTF-8 癲곷끂誼뜻쾬楡㏓빒鼇잕퇌劉↑뜌淫롮젞筌뗭닪 111001111001100110110010111010101011001110110111111010111000000110000010111010001010101010111100111010111001110010111011111011001011111010101100111001101010010110100001111000111000111110010011111010111011100110010010111010011011110010000111111011001001111010010101111011011000011110001100111011111010011110000111111000101000011010010001111010111001110010001100111001101011011110101011111010111010000110101110111011001010000010011110111001111010110110001100111010111001011110101101111010111000101110101010 e799b2eab3b7eb8182e8aabceb9cbbecbeace6a5a1e38f93ebb992e9bc87ec9e95ed878cefa787e28691eb9c8ce6b7abeba1aeeca09ee7ad8ceb97adeb8baa
UHC 癲곷끂誼뜻쾬楡㏓빒鼇잕퇌劉↑뜌淫롮젞筌뗭닪 111011111010011010000001111010111000010110111000111010111111111010110110111001101011001010000011111010101111100010100111111010111001010110110110111010001010100010011111111010101011011110011101111010101110010110100001111010001000110110001111111010111110001010001110111011001010000010011000111011111010011110001011111011001000100010100101 efa681eb85b8ebfeb6e6b283eaf8a7eb95b6e8a89feab79deae5a1e88d8febe28eeca098efa78bec88a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)