To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???????????????淹??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100111111011100100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f9fb93f3f3f3f3f3f3f
EUC-JP ???????????????淹??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111110111101011101100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fdebb3f3f3f3f3f3f3f
UTF-8 溜삳젛溜븍젨溜블썒溜쀫졋溜볥졋淹ㅻ젡溜뷸뇘溜뢆 111011111010011110001011111011001000001010110011111011001010000010011011111011111010011110001011111010111011100010001101111011001010000010101000111011111010011110001011111010111011100010010100111011001000110110010010111011111010011110001011111011001000000010101011111011001010000110001011111011111010011110001011111010111011001110100101111011001010000110001011111001101011011110111001111000111000010110111011111011001010000010100001111011111010011110001011111010111011011110111000111010111000011110011000111011111010011110001011111010111010001010000110 efa78bec82b3eca09befa78bebb88deca0a8efa78bebb894ec8d92efa78bec80abeca18befa78bebb3a5eca18be6b7b9e385bbeca0a1efa78bebb7b8eb8798efa78beba286
UHC 溜삳젛溜븍젨溜블썒溜쀫졋溜볥졋淹ㅻ젡溜뷸뇘溜뢆 11101010111111101011101111101011101000001001011111101010111111101011101011101011101000001010000011101010111111101011101011101101100110111000010111101010111111101001011111101011101000001011101011101010111111101001001111101011101000001011101011100101111101001010010011101011101000001001101011101010111111101011101011100110100001111000001111101010111111101000111101000010 eafebbeba097eafebaeba0a0eafebaed9b85eafe97eba0baeafe93eba0bae5f4a4eba09aeafebae68783eafe8f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)