To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????L?????????L^ 001111110011111100111111001111110011111100111111001111110011111100111111010011000011111100111111001111110011111100111111001111110011111100111111001111110100110001011110 3f3f3f3f3f3f3f3f3f4c3f3f3f3f3f3f3f3f3f4c5e
SJIS-WIN ?リ?苑?????L?リ?苑?????L^ 00111111100000111000101000111111100010011001000100111111001111110011111100111111001111110100110000111111100000111000101000111111100010011001000100111111001111110011111100111111001111110100110001011110 3f838a3f89913f3f3f3f3f4c3f838a3f89913f3f3f3f3f4c5e
EUC-JP ?リ?苑??洧??L?リ?苑??洧??L^ 0011111110100101111010100011111110110001111100010011111100111111100011111100011110110100001111110011111101001100001111111010010111101010001111111011000111110001001111110011111110001111110001111011010000111111001111110100110001011110 3fa5ea3fb1f13f3f8fc7b43f3f4c3fa5ea3fb1f13f3f8fc7b43f3f4c5e
UTF-8 曆リ퀣苑뚩뵯洧욧퐶L曆リ퀣苑뚩뵯洧욧퐶L^ 111011111010011010001011111000111000001110101010111011011000000010100011111010001000101110010001111010111001101010101001111010111011010110101111111001101011010010100111111011001001101010100111111011011001000010110110010011001110111110100110100010111110001110000011101010101110110110000000101000111110100010001011100100011110101110011010101010011110101110110101101011111110011010110100101001111110110010011010101001111110110110010000101101100100110001011110 efa68be383aaed80a3e88b91eb9aa9ebb5afe6b4a7ec9aa7ed90b64cefa68be383aaed80a3e88b91eb9aa9ebb5afe6b4a7ec9aa7ed90b64c5e
UHC 曆リ퀣苑뚩뵯洧욧퐶L曆リ퀣苑뚩뵯洧욧퐶L^ 111001101011011110101011111010101011001110010111111010101011110110001100111010001001010010101101111010101111101110111111111010101011110110011111010011001110011010110111101010111110101010110011100101111110101010111101100011001110100010010100101011011110101011111011101111111110101010111101100111110100110001011110 e6b7abeab397eabd8ce894adeafbbfeabd9f4ce6b7abeab397eabd8ce894adeafbbfeabd9f4c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)