To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 娃??玉??臟? 1000100010100001001111110011111110001011110010100011111100111111111001000110011000111111 88a13f3f8bca3f3fe4663f
EUC-JP 娃??玉??臟? 1011000010100011001111110011111110110110110011000011111100111111111001111100011100111111 b0a33f3fb6cc3f3fe7c73f
UTF-8 娃욑슘玉뉐쥞臟뢔 111001011010100010000011111011001001101010010001111011001000101010011000111001111000111010001001111010111000100110010000111011001010010110011110111010001000011110011111111010111010001010010100 e5a883ec9a91ec8a98e78e89eb8990eca59ee8879feba294
UHC 娃욑슘玉뉐쥞臟뢔 11101000110111111001111011101111101111011011011111101000101011001000011111100101101000101001001111101101111101001000111101001111 e8df9eefbdb7e8ac87e5a293edf48f4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)