To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???歪?????[???歪?????[^ 0011111100111111001111111001100001100011001111110011111100111111001111110011111101011011001111110011111100111111100110000110001100111111001111110011111100111111001111110101101101011110 3f3f3f98633f3f3f3f3f5b3f3f3f98633f3f3f3f3f5b5e
EUC-JP 璵??歪?????[璵??歪?????[^ 100011111100110011100110001111110011111111001111110001000011111100111111001111110011111100111111010110111000111111001100111001100011111100111111110011111100010000111111001111110011111100111111001111110101101101011110 8fcce63f3fcfc43f3f3f3f3f5b8fcce63f3fcfc43f3f3f3f3f5b5e
UTF-8 璵붺솈歪득쪛捻뚪궢[璵붺솈歪득쪛捻뚪궢[^ 111001111001001010110101111010111011011010111010111011001000011010001000111001101010110110101010111010111001001110011101111011001010101010011011111011111010011010100100111010111001101010101010111010101011011010100010010110111110011110010010101101011110101110110110101110101110110010000110100010001110011010101101101010101110101110010011100111011110110010101010100110111110111110100110101001001110101110011010101010101110101010110110101000100101101101011110 e792b5ebb6baec8688e6adaaeb939decaa9befa6a4eb9aaaeab6a25be792b5ebb6baec8688e6adaaeb939decaa9befa6a4eb9aaaeab6a25b5e
UHC 璵붺솈歪득쪛捻뚪궢[璵붺솈歪득쪛捻뚪궢[^ 111001101010010110010100111001111001100110001100111010001110000010110101111001101010010110010100111001101111011110001100111010011000001010110101010110111110011010100101100101001110011110011001100011001110100011100000101101011110011010100101100101001110011011110111100011001110100110000010101101010101101101011110 e6a594e7998ce8e0b5e6a594e6f78ce982b55be6a594e7998ce8e0b5e6a594e6f78ce982b55b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)