To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??乳??蘖??揖????????乙??誤 10001100111010110011111100111111100100111111101100111111001111111001111101010000001111110011111110010111010010110011111100111111001111110011111100111111001111110011111100111111100010011011001100111111001111111000110011101011 8ceb3f3f93fb3f3f9f503f3f974b3f3f3f3f3f3f3f3f89b33f3f8ceb
EUC-JP 誤??乳??蘖??揖????????乙??誤 10111000111011010011111100111111110001101111110100111111001111111101110110110001001111110011111111001101101011000011111100111111001111110011111100111111001111110011111100111111101100101011010100111111001111111011100011101101 b8ed3f3fc6fd3f3fddb13f3fcdac3f3f3f3f3f3f3f3fb2b53f3fb8ed
UTF-8 誤곥끃乳쎌쭬蘖뽰궪揖쇠쫨栒밸꺗吏졿릸乙쇱돹誤 111010001010101010100100111010101011001110100101111010111000000110000011111001001011100110110011111011001000111010001100111011001010110110101100111010001001100010010110111010111011110110110000111010101011011010101010111001101000111110010110111011001000011110100000111011001010101110101000111001101010000010010010111010111011000010111000111010101011101010010111111011111010011110011110111011001010000110111111111010111010011010111000111001001011100110011001111011001000011110110001111010111000111110111001111010001010101010100100 e8aaa4eab3a5eb8183e4b9b3ec8e8cecadace89896ebbdb0eab6aae68f96ec87a0ecaba8e6a092ebb0b8eaba97efa79eeca1bfeba6b8e4b999ec87b1eb8fb9e8aaa4
UHC 誤곥끃乳쎌쭬蘖뽰궪揖쇠쫨栒밸꺗吏졿릸乙쇱돹誤 1110100010100110100000011110001110000101101110011110101011100001101111011110110010100111101000001110010111101110100101101110110010000010101111001110101111100111101111001110100010100110100000011110001011100011101110011110101110000011101111011110110010100111101000001110011010010000100101101110101111100000101111001110110010001001101111001110100010100110 e8a681e385b9eae1bdeca7a0e5ee96ec82bcebe7bce8a681e2e3b9eb83bdeca7a0e69096ebe0bcec89bce8a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)