To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 永??鎰??猷??z永??鎰??猷??zB 100010010110100100111111001111111110100001001100001111110011111110010111010100010011111100111111011110101000100101101001001111110011111111101000010011000011111100111111100101110101000100111111001111110111101001000010 89693f3fe84c3f3f97513f3f7a89693f3fe84c3f3f97513f3f7a42
EUC-JP 永??鎰??猷??z永??鎰??猷??zB 101100011100101000111111001111111110111110101101001111110011111111001101101100100011111100111111011110101011000111001010001111110011111111101111101011010011111100111111110011011011001000111111001111110111101001000010 b1ca3f3fefad3f3fcdb23f3f7ab1ca3f3fefad3f3fcdb23f3f7a42
UTF-8 永띕엪鎰섇립猷쎈뮣z永띕엪鎰섇립猷쎈뮣zB 111001101011000010111000111010111001110110010101111011001001011110101010111010011000111010110000111011001000010010000111111010111010011010111101111001111000110010110111111011001000111010001000111010111010111010100011011110101110011010110000101110001110101110011101100101011110110010010111101010101110100110001110101100001110110010000100100001111110101110100110101111011110011110001100101101111110110010001110100010001110101110101110101000110111101001000010 e6b0b8eb9d95ec97aae98eb0ec8487eba6bde78cb7ec8e88ebaea37ae6b0b8eb9d95ec97aae98eb0ec8487eba6bde78cb7ec8e88ebaea37a42
UHC 永띕엪鎰섇립猷쎈뮣z永띕엪鎰섇립猷쎈뮣zB 111001111011010110110110111010111001111010000011111011001111000010011000111001011011100010110011111010111010001110111101111010111001001010101111011110101110011110110101101101101110101110011110100000111110110011110000100110001110010110111000101100111110101110100011101111011110101110010010101011110111101001000010 e7b5b6eb9e83ecf098e5b8b3eba3bdeb92af7ae7b5b6eb9e83ecf098e5b8b3eba3bdeb92af7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)