To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 褥λ????陰???????????懿??^ 1110010111110001100000111100100100111111001111110011111100111111100010010100000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001110011110010001111110011111101011110 e5f183c93f3f3f3f89413f3f3f3f3f3f3f3f3f3f3f9cf23f3f5e
EUC-JP 褥λ????陰???????????懿??^ 1110101011110011101001101100101100111111001111110011111100111111101100011010001000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101100011110100001111110011111101011110 eaf3a6cb3f3f3f3fb1a23f3f3f3f3f3f3f3f3f3f3fd8f43f3f5e
UTF-8 褥λ졂輦딅젘陰낆빋溜얗낄溜녕냺溜붿냵懿곕죰^ 111010001010010010100101110011101011101111101100101000011000001011101111101001101001100011101011100101001000010111101100101000001001100011101001100110011011000011101011100000101000011011101011101110011000101111101111101001111000101111101100100101101001011111101011100000101000010011101111101001111000101111101011100001011001010111101011100000111011101011101111101001111000101111101011101101101011111111101011100000111011010111100110100001111011111111101010101100111001010111101100101000111011000001011110 e8a4a5cebbeca182efa698eb9485eca098e999b0eb8286ebb98befa78bec9697eb8284efa78beb8595eb83baefa78bebb6bfeb83b5e687bfeab395eca3b05e
UHC 褥λ졂輦딅젘陰낆빋溜얗낄溜녕냺溜붿냵懿곕죰^ 11101001101100111010010111101011101000001011001111100110111001001000101011101011101000001001010011101011111001001000010111101100100101011011000111101010111111101011111011101001101100111010010111101010111111101011001111100111100001101000101011101010111111101001010011101100100001101000010111101011111100111011000011101011101000011000101101011110 e9b3a5eba0b3e6e48aeba094ebe485ec95b1eafebee9b3a5eafeb3e7868aeafe94ec8685ebf3b0eba18b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)