To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 賊?將?垣兢??制經工賊?鬱 1001000110101111001111111001101110010010001111111000101001011111100110010101110100111111001111111001000010100111111000110101001110001101010010001001000110101111001111111001111101010100 91af3f9b923f8a5f995d3f3f90a7e3538d4891af3f9f54
EUC-JP 賊?將?垣兢??制經工賊?鬱 1100001010110001001111111101010111110010001111111011001111000000110100011011111000111111001111111100000010101001111001011011010010111001101010011100001010110001001111111101110110110101 c2b13fd5f23fb3c0d1be3f3fc0a9e5b4b9a9c2b13fddb5
UTF-8 賊렠將렚垣兢렩렰制經工賊렠鬱 111010001011001110001010111010111010000010100000111001011011000010000111111010111010000010011010111001011001111010100011111001011000010110100010111010111010000010101001111010111010000010110000111001011000100010110110111001111011011010010011111001011011011110100101111010001011001110001010111010111010000010100000111010011010110010110001 e8b38aeba0a0e5b087eba09ae59ea3e585a2eba0a9eba0b0e588b6e7b693e5b7a5e8b38aeba0a0e9acb1
UHC 賊렠將렚垣兢렩렰制經工賊렠鬱 11101110111001001000111010110001111011011110001010001110101011011110101010101111110100001110011110001110101101111000111010111101111100001010010011001100111010001100110111101111111011101110010010001110101100011110101010100110 eee48eb1ede28eadeaafd0e78eb78ebdf0a4cce8cdefeee48eb1eaa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)