To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????k?????????k^ 001111110011111100111111001111110011111100111111001111110011111100111111011010110011111100111111001111110011111100111111001111110011111100111111001111110110101101011110 3f3f3f3f3f3f3f3f3f6b3f3f3f3f3f3f3f3f3f6b5e
SJIS-WIN 儀??儀?????k儀??儀?????k^ 10001011010101100011111100111111100010110101011000111111001111110011111100111111001111110110101110001011010101100011111100111111100010110101011000111111001111110011111100111111001111110110101101011110 8b563f3f8b563f3f3f3f3f6b8b563f3f8b563f3f3f3f3f6b5e
EUC-JP 儀??儀?????k儀??儀?????k^ 10110101101101110011111100111111101101011011011100111111001111110011111100111111001111110110101110110101101101110011111100111111101101011011011100111111001111110011111100111111001111110110101101011110 b5b73f3fb5b73f3f3f3f3f6bb5b73f3fb5b73f3f3f3f3f6b5e
UTF-8 儀붾젇儀붻뼧溜뽯젉k儀붾젇儀붻뼧溜뽯젉k^ 111001011000010010000000111010111011011010111110111011001010000010000111111001011000010010000000111010111011011010111011111010111011110010100111111011111010011110001011111010111011110110101111111011001010000010001001011010111110010110000100100000001110101110110110101111101110110010100000100001111110010110000100100000001110101110110110101110111110101110111100101001111110111110100111100010111110101110111101101011111110110010100000100010010110101101011110 e58480ebb6beeca087e58480ebb6bbebbca7efa78bebbdafeca0896be58480ebb6beeca087e58480ebb6bbebbca7efa78bebbdafeca0896b5e
UHC 儀붾젇儀붻뼧溜뽯젉k儀붾젇儀붻뼧溜뽯젉k^ 111010111111000010010100111010111010000010001010111010111111000010010100111010001001011010101010111010101111111010010110111010111010000010001011011010111110101111110000100101001110101110100000100010101110101111110000100101001110100010010110101010101110101011111110100101101110101110100000100010110110101101011110 ebf094eba08aebf094e896aaeafe96eba08b6bebf094eba08aebf094e896aaeafe96eba08b6b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)