To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??受b?猷???k6???猷???? 100101110101000100111111001111111000111011110011100000101000001000111111100101110101000100111111001111110011111110000010100010111000001001010101001111110011111100111111100101110101000100111111001111110011111100111111 97513f3f8ef382823f97513f3f3f828b82553f3f3f97513f3f3f3f
EUC-JP 猷??受b?猷???k6???猷???? 110011011011001000111111001111111011110011110101101000111110001000111111110011011011001000111111001111110011111110100011111010111010001110110110001111110011111100111111110011011011001000111111001111110011111100111111 cdb23f3fbcf5a3e23fcdb23f3f3fa3eba3b63f3f3fcdb23f3f3f3f
UTF-8 猷뜹럳受b뒄猷띄퐣裂k6吏묅뒄猷띄퐣裂쥲 111001111000110010110111111010111001110010111001111010111001111110110011111001011000111110010111111011111011110110000010111010111001001010000100111001111000110010110111111010111001110110000100111011011001000010100011111011111010011010100000111011111011110110001011111011111011110010010110111011111010011110011110111010111010110010000101111010111001001010000100111001111000110010110111111010111001110110000100111011011001000010100011111011111010011010100000111011001010010110110010 e78cb7eb9cb9eb9fb3e58f97efbd82eb9284e78cb7eb9d84ed90a3efa6a0efbd8befbc96efa79eebac85eb9284e78cb7eb9d84ed90a3efa6a0eca5b2
UHC 猷뜹럳受b뒄猷띄퐣裂k6吏묅뒄猷띄퐣裂쥲 11101011101000111011011011100101100011101001001111100001111101001010001111100010100010101000001011101011101000111011011011100111101111011000110011100110111100011010001111101011101000111011011011101100101001111001000111100010100010101000001011101011101000111011011011100111101111011000110011100110111100011010001101000010 eba3b6e58e93e1f4a3e28a82eba3b6e7bd8ce6f1a3eba3b6eca791e28a82eba3b6e7bd8ce6f1a342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)