To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ?審?厓ぉ???n}?審?厓ぉ???n{^ 001111111001000001010010001111111111101010001101100000101010011100111111001111110011111101101110011111010011111110010000010100100011111111111010100011011000001010100111001111110011111100111111011011100111101101011110 3f90523ffa8d82a73f3f3f6e7d3f90523ffa8d82a73f3f3f6e7b5e
EUC-JP ?審?厓ぉ???n}?審?厓ぉ???n{^ 0011111110111111101100110011111110001111101101001100011110100100101010010011111100111111001111110110111001111101001111111011111110110011001111111000111110110100110001111010010010101001001111110011111100111111011011100111101101011110 3fbfb33f8fb4c7a4a93f3f3f6e7d3fbfb33f8fb4c7a4a93f3f3f6e7b5e
UTF-8 룶審룶厓ぉ▩룴홸n}룶審룶厓ぉ▩룴홸n{^ 1110101110100011101101101110010110101111101010011110101110100011101101101110010110001110100100111110001110000001100010011110001010010110101010011110101110100011101101001110110110011001101110000110111001111101111010111010001110110110111001011010111110101001111010111010001110110110111001011000111010010011111000111000000110001001111000101001011010101001111010111010001110110100111011011001100110111000011011100111101101011110 eba3b6e5afa9eba3b6e58e93e38189e296a9eba3b4ed99b86e7deba3b6e5afa9eba3b6e58e93e38189e296a9eba3b4ed99b86e7b5e
UHC 룶審룶厓ぉ▩룴홸n}룶審룶厓ぉ▩룴홸n{^ 10001111101010111110001111111011100011111010101111100100111011011010101010101001101000101100110010001111101010011100001101110010011011100111110110001111101010111110001111111011100011111010101111100100111011011010101010101001101000101100110010001111101010011100001101110010011011100111101101011110 8fabe3fb8fabe4edaaa9a2cc8fa9c3726e7d8fabe3fb8fabe4edaaa9a2cc8fa9c3726e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)