To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 湖??絅?鎖??粟??夭?ゑ粟?????粟往凋 100011001100111000111111001111111110001101000100001111111000110110111101001111110011111110001000101111100011111100111111100110101110111000111111100000101110111110001000101111100011111100111111001111110011111100111111100010001011111010001001100111011001001010011100 8cce3f3fe3443f8dbd3f3f88be3f3f9aee3f82ef88be3f3f3f3f3f88be899d929c
EUC-JP 湖?饔絅?鎖??粟雩?夭?ゑ粟雩?雩??粟往凋 1011100011010000001111111000111111101000111011111110010110100101001111111011101010111111001111110011111110110000110000001000111111100110111110100011111111010100111100000011111110100100111100011011000011000000100011111110011011111010001111111000111111100110111110100011111100111111101100001100000010110001111111011100001111111100 b8d03f8fe8efe5a53fbabf3f3fb0c08fe6fa3fd4f03fa4f1b0c08fe6fa3f8fe6fa3f3fb0c0b1fdc3fc
UTF-8 湖렕饔絅뤈鎖쵌곧粟雩▩夭쳩ゑ粟雩눗雩첁곧粟往凋 111001101011100110010110111010111010000010010101111010011010010110010100111001111011010110000101111010111010010010001000111010011000111010010110111011001011010110001100111010101011001110100111111001111011001010011111111010011001101110101001111000101001011010101001111001011010010010101101111011001011001110101001111000111000001010010001111001111011001010011111111010011001101110101001111010111000100010010111111010011001101110101001111011001011001010000001111010101011001110100111111001111011001010011111111001011011111010000000111001011000011110001011 e6b996eba095e9a594e7b585eba488e98e96ecb58ceab3a7e7b29fe99ba9e296a9e5a4adecb3a9e38291e7b29fe99ba9eb8897e99ba9ecb281eab3a7e7b29fe5be80e5878b
UHC 湖렕饔絅뤈鎖쵌곧粟雩▩夭쳩ゑ粟雩눗雩첁곧粟往凋 11111011110010011000111010101010111010001011110111001100111001111000111110111000111000011111000010101100100011101011000011110000111000011101100011101001111011001010001011001100111010001110110010101011100011101010101011110001111000011101100011101001111011001011010010110000111010011110110010101010100011101011000011110000111000011101100011101000110110011111000010111101 fbc98eaae8bdcce78fb8e1f0ac8eb0f0e1d8e9eca2cce8ecab8eaaf1e1d8e9ecb4b0e9ecaa8eb0f0e1d8e8d9f0bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)