To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????烏g????烏g?諭?????? 001111110011111100111111001111110011111100111111100010010100011110000010100001110011111100111111001111110011111110001001010001111000001010000111001111111001011101000000001111110011111100111111001111110011111100111111 3f3f3f3f3f3f894782873f3f3f3f894782873f97403f3f3f3f3f3f
EUC-JP ??????烏g????烏g?諭?????? 001111110011111100111111001111110011111100111111101100011010100010100011111001110011111100111111001111110011111110110001101010001010001111100111001111111100110110100001001111110011111100111111001111110011111100111111 3f3f3f3f3f3fb1a8a3e73f3f3f3fb1a8a3e73fcda13f3f3f3f3f3f
UTF-8 凉붾젧玲곷젷烏g뎡溜깅졁烏g쭅諭뚨륫溜뺠펹溜 111011111010010110111001111010111011011010111110111011001010000010100111111011111010011010101101111010101011001110110111111011001010000010110111111001111000001110001111111011111011110110000111111010111000111010100001111011111010011110001011111010101011100110000101111011001010000110000001111001111000001110001111111011111011110110000111111011001010110110000101111010001010101110101101111010111001101010101000111010111010010110101011111011111010011110001011111010111011101010100000111011011000111010111001111011111010011110001011 efa5b9ebb6beeca0a7efa6adeab3b7eca0b7e7838fefbd87eb8ea1efa78beab985eca181e7838fefbd87ecad85e8abadeb9aa8eba5abefa78bebbaa0ed8eb9efa78b
UHC 凉붾젧玲곷젷烏g뎡溜깅졁烏g쭅諭뚨륫溜뺠펹溜 1110010110111100100101001110101110100000100111111110011110111111100000011110101110100000101010111110100010100001101000111110011110110101101100101110101011111110101100011110101110100000101100101110100010100001101000111110011110100111100000011110101110110001100011001110011110111000101000011110101011111110100101011110100010111100100010011110101011111110 e5bc94eba09fe7bf81eba0abe8a1a3e7b5b2eafeb1eba0b2e8a1a3e7a781ebb18ce7b8a1eafe95e8bc89eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)