To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ?濕?厓ぉ???n}?濕?厓ぉ???n{^ 001111111110000001011011001111111111101010001101100000101010011100111111001111110011111101101110011111010011111111100000010110110011111111111010100011011000001010100111001111110011111100111111011011100111101101011110 3fe05b3ffa8d82a73f3f3f6e7d3fe05b3ffa8d82a73f3f3f6e7b5e
EUC-JP ?濕?厓ぉ???n}?濕?厓ぉ???n{^ 0011111111011111101111000011111110001111101101001100011110100100101010010011111100111111001111110110111001111101001111111101111110111100001111111000111110110100110001111010010010101001001111110011111100111111011011100111101101011110 3fdfbc3f8fb4c7a4a93f3f3f6e7d3fdfbc3f8fb4c7a4a93f3f3f6e7b5e
UTF-8 룶濕룶厓ぉ▩룴홸n}룶濕룶厓ぉ▩룴홸n{^ 1110101110100011101101101110011010111111100101011110101110100011101101101110010110001110100100111110001110000001100010011110001010010110101010011110101110100011101101001110110110011001101110000110111001111101111010111010001110110110111001101011111110010101111010111010001110110110111001011000111010010011111000111000000110001001111000101001011010101001111010111010001110110100111011011001100110111000011011100111101101011110 eba3b6e6bf95eba3b6e58e93e38189e296a9eba3b4ed99b86e7deba3b6e6bf95eba3b6e58e93e38189e296a9eba3b4ed99b86e7b5e
UHC 룶濕룶厓ぉ▩룴홸n}룶濕룶厓ぉ▩룴홸n{^ 10001111101010111110001110100101100011111010101111100100111011011010101010101001101000101100110010001111101010011100001101110010011011100111110110001111101010111110001110100101100011111010101111100100111011011010101010101001101000101100110010001111101010011100001101110010011011100111101101011110 8fabe3a58fabe4edaaa9a2cc8fa9c3726e7d8fabe3a58fabe4edaaa9a2cc8fa9c3726e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)