To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????b[?????????b[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001001011011001111110011111100111111001111110011111100111111001111110011111100111111011000100101101101011110 3f3f3f3f3f3f3f3f3f625b3f3f3f3f3f3f3f3f3f625b5e
SJIS-WIN ???澳??艶g?b[???澳??艶g?b[^ 0011111100111111001111111110000001010011001111110011111110001001100100001000001010000111001111110110001001011011001111110011111100111111111000000101001100111111001111111000100110010000100000101000011100111111011000100101101101011110 3f3f3fe0533f3f899082873f625b3f3f3fe0533f3f899082873f625b5e
EUC-JP ???澳??艶g?b[???澳??艶g?b[^ 0011111100111111001111111101111110110100001111110011111110110001111100001010001111100111001111110110001001011011001111110011111100111111110111111011010000111111001111111011000111110000101000111110011100111111011000100101101101011110 3f3f3fdfb43f3fb1f0a3e73f625b3f3f3fdfb43f3fb1f0a3e73f625b5e
UTF-8 怜붺윢澳뉒컮艶g왃b[怜붺윢澳뉒컮艶g왃b[^ 1110111110100110101011001110101110110110101110101110110010011100101000101110011010111110101100111110101110001001100100101110110010111011101011101110100010001001101101101110111110111101100001111110110010011001100000110110001001011011111011111010011010101100111010111011011010111010111011001001110010100010111001101011111010110011111010111000100110010010111011001011101110101110111010001000100110110110111011111011110110000111111011001001100110000011011000100101101101011110 efa6acebb6baec9ca2e6beb3eb8992ecbbaee889b6efbd87ec9983625befa6acebb6baec9ca2e6beb3eb8992ecbbaee889b6efbd87ec9983625b5e
UHC 怜붺윢澳뉒컮艶g왃b[怜붺윢澳뉒컮艶g왃b[^ 1110011110110000100101001110011110011111101000111110011111111110100001111110011110110000100101001110011011111101101000111110011110011110101101100110001001011011111001111011000010010100111001111001111110100011111001111111111010000111111001111011000010010100111001101111110110100011111001111001111010110110011000100101101101011110 e7b094e79fa3e7fe87e7b094e6fda3e79eb6625be7b094e79fa3e7fe87e7b094e6fda3e79eb6625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)