To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔?醍??淨?垣?鬱憺弔?醍??淨?垣?鬱憺^ 1001001010100010001111111001000111100111001111110011111110011111110001000011111110001010010111110011111110011111010101001001110011101001100100101010001000111111100100011110011100111111001111111001111111000100001111111000101001011111001111111001111101010100100111001110100101011110 92a23f91e73f3f9fc43f8a5f3f9f549ce992a23f91e73f3f9fc43f8a5f3f9f549ce95e
EUC-JP 弔?醍??淨?垣?鬱憺弔?醍??淨?垣?鬱憺^ 1100010010100100001111111100001011101001001111110011111111011110110001100011111110110011110000000011111111011101101101011101100011101011110001001010010000111111110000101110100100111111001111111101111011000110001111111011001111000000001111111101110110110101110110001110101101011110 c4a43fc2e93f3fdec63fb3c03fddb5d8ebc4a43fc2e93f3fdec63fb3c03fddb5d8eb5e
UTF-8 弔렲醍당긺淨렠垣렖鬱憺弔렲醍당긺淨렠垣렖鬱憺^ 11100101101111001001010011101011101000001011001011101001100001101000110111101011100010111011100111101010101110001011101011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101001101011001011000111100110100001101011101011100101101111001001010011101011101000001011001011101001100001101000110111101011100010111011100111101010101110001011101011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101001101011001011000111100110100001101011101001011110 e5bc94eba0b2e9868deb8bb9eab8bae6b7a8eba0a0e59ea3eba096e9acb1e686bae5bc94eba0b2e9868deb8bb9eab8bae6b7a8eba0a0e59ea3eba096e9acb1e686ba5e
UHC 弔렲醍당긺淨렠垣렖鬱憺弔렲醍당긺淨렠垣렖鬱憺^ 111100001100000010001110101111111111000010110101101101001110011110110001111001111110111111100100100011101011000111101010101011111000111010101011111010101010011011010011101111001111000011000000100011101011111111110000101101011011010011100111101100011110011111101111111001001000111010110001111010101010111110001110101010111110101010100110110100111011110001011110 f0c08ebff0b5b4e7b1e7efe48eb1eaaf8eabeaa6d3bcf0c08ebff0b5b4e7b1e7efe48eb1eaaf8eabeaa6d3bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)