To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??や?}??ヂ□???ァ?朞?ヒ??ヒ? 0011111100111111100000101110001000111111100000010111000000111111001111111000001101100001100000011010000000111111001111110011111110000011010000000011111110011110010011010011111110000011011100010011111100111111100000110111000100111111 3f3f82e23f81703f3f836181a03f3f3f83403f9e4d3f83713f3f83713f
EUC-JP ??や?}??ヂ□???ァ?朞?ヒ??ヒ? 0011111100111111101001001110010000111111101000011101000100111111001111111010010111000010101000101010001000111111001111110011111110100101101000010011111111011011101011100011111110100101110100100011111100111111101001011101001000111111 3f3fa4e43fa1d13f3fa5c2a2a23f3f3fa5a13fdbae3fa5d23f3fa5d23f
UTF-8 룵卽や룶}룴횕ヂ□▩룶죴ァ룵朞卽ヒ룶첂ヒ룶 111010111010001110110101111001011000110110111101111000111000001010000100111010111010001110110110111011111011110110011101111010111010001110110100111011011001101010010101111000111000001110000010111000101001011010100001111000101001011010101001111010111010001110110110111011001010001110110100111000111000001010100001111010111010001110110101111001101001110010011110111001011000110110111101111000111000001110010010111010111010001110110110111011001011001010000010111000111000001110010010111010111010001110110110 eba3b5e58dbde38284eba3b6efbd9deba3b4ed9a95e38382e296a1e296a9eba3b6eca3b4e382a1eba3b5e69c9ee58dbde38392eba3b6ecb282e38392eba3b6
UHC 룵卽や룶}룴횕ヂ□▩룶죴ァ룵朞卽ヒ룶첂ヒ룶 100011111010101011110001111011011010101011100100100011111010101110100011111111011000111110101001110000111000111110101011110000101010000111100000101000101100110010001111101010111010000110001111101010111010000110001111101010101101000110100001111100011110110110101011110100101000111110101011101010101000111110101011110100101000111110101011 8faaf1edaae48faba3fd8fa9c38fabc2a1e0a2cc8faba18faba18faad1a1f1edabd28fabaa8fabd28fab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)