To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔?醍?暲淨?垣?鬱憺弔?醍?暲淨?垣?鬱憺^ 10010010101000100011111110010001111001110011111111111010110111001001111111000100001111111000101001011111001111111001111101010100100111001110100110010010101000100011111110010001111001110011111111111010110111001001111111000100001111111000101001011111001111111001111101010100100111001110100101011110 92a23f91e73ffadc9fc43f8a5f3f9f549ce992a23f91e73ffadc9fc43f8a5f3f9f549ce95e
EUC-JP 弔?醍?暲淨?垣?鬱憺弔?醍?暲淨?垣?鬱憺^ 110001001010010000111111110000101110100100111111100011111100001011011011110111101100011000111111101100111100000000111111110111011011010111011000111010111100010010100100001111111100001011101001001111111000111111000010110110111101111011000110001111111011001111000000001111111101110110110101110110001110101101011110 c4a43fc2e93f8fc2dbdec63fb3c03fddb5d8ebc4a43fc2e93f8fc2dbdec63fb3c03fddb5d8eb5e
UTF-8 弔렲醍닸暲淨렠垣렖鬱憺弔렲醍닸暲淨렠垣렖鬱憺^ 11100101101111001001010011101011101000001011001011101001100001101000110111101011100010111011100011100110100110101011001011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101001101011001011000111100110100001101011101011100101101111001001010011101011101000001011001011101001100001101000110111101011100010111011100011100110100110101011001011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101001101011001011000111100110100001101011101001011110 e5bc94eba0b2e9868deb8bb8e69ab2e6b7a8eba0a0e59ea3eba096e9acb1e686bae5bc94eba0b2e9868deb8bb8e69ab2e6b7a8eba0a0e59ea3eba096e9acb1e686ba5e
UHC 弔렲醍닸暲淨렠垣렖鬱憺弔렲醍닸暲淨렠垣렖鬱憺^ 111100001100000010001110101111111111000010110101101101001110011011101101111001111110111111100100100011101011000111101010101011111000111010101011111010101010011011010011101111001111000011000000100011101011111111110000101101011011010011100110111011011110011111101111111001001000111010110001111010101010111110001110101010111110101010100110110100111011110001011110 f0c08ebff0b5b4e6ede7efe48eb1eaaf8eabeaa6d3bcf0c08ebff0b5b4e6ede7efe48eb1eaaf8eabeaa6d3bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)