To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????º? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011101000111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fba3f
SJIS-WIN 岳??韋??????ル????懿??孃??蚓 10001010011110000011111100111111111010001110100000111111001111110011111100111111001111110011111110000011100010110011111100111111001111110011111110011100111100100011111100111111100110110110111100111111001111111110010101101101 8a783f3fe8e83f3f3f3f3f3f838b3f3f3f3f9cf23f3f9b6f3f3fe56d
EUC-JP 岳??韋??????ル?沅??懿??孃?º蚓 1011001111011001001111110011111111110000111010100011111100111111001111110011111100111111001111111010010111101011001111111000111111000110111010010011111100111111110110001111010000111111001111111101010111010000001111111000111110100010111010111110100111001110 b3d93f3ff0ea3f3f3f3f3f3fa5eb3f8fc6e93f3fd8f43f3fd5d03f8fa2ebe9ce
UTF-8 岳묒빖韋뉏펺栒룔렍曆ル벥沅싮펶懿멸컳孃뉖º蚓 1110010110110010101100111110101110101100100100101110101110111001100101101110100110011111100010111110101110001001100011111110110110001110101110101110011010100000100100101110101110100011100101001110101110100000100011011110111110100110100010111110001110000011101010111110101110110010101001011110011010110010100001011110110010001011101011101110110110001110101101101110011010000111101111111110101110101001101110001110110010111011101100111110010110101101100000111110101110001001100101101100001010111010111010001001101010010011 e5b2b3ebac92ebb996e99f8beb898fed8ebae6a092eba394eba08defa68be383abebb2a5e6b285ec8baeed8eb6e687bfeba9b8ecbbb3e5ad83eb8996c2bae89a93
UHC 岳묒빖韋뉏펺栒룔렍曆ル벥沅싮펶懿멸컳孃뉖º蚓 1110010010111111100100011110110010010101101110001110101011011111100001111110010010111100100010101110001011100011101101111110001110001110101000111110011010110111101010111110101110010011101111011110101010110110100110101110100110111100100001111110101111110011101110001110101010110000100110011110010110111110100001111110101110101000101011001110110011100010 e4bf91ec95b8eadf87e4bc8ae2e3b7e38ea3e6b7abeb93bdeab69ae9bc87ebf3b8eab099e5be87eba8acece2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)