To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???匡????汽n}???匡????汽n{^ 001111110011111100111111100010111010011100111111001111110011111100111111100010110100010001101110011111010011111100111111001111111000101110100111001111110011111100111111001111111000101101000100011011100111101101011110 3f3f3f8ba73f3f3f3f8b446e7d3f3f3f8ba73f3f3f3f8b446e7b5e
EUC-JP ???匡????汽n}???匡????汽n{^ 001111110011111100111111101101101010100100111111001111110011111100111111101101011010010101101110011111010011111100111111001111111011011010101001001111110011111100111111001111111011010110100101011011100111101101011110 3f3f3fb6a93f3f3f3fb5a56e7d3f3f3fb6a93f3f3f3fb5a56e7b5e
UTF-8 렻렑렺匡뗘렑렺뗑汽n}렻렑렺匡뗘렑렺뗑汽n{^ 1110101110100000101110111110101110100000100100011110101110100000101110101110010110001100101000011110101110010111100110001110101110100000100100011110101110100000101110101110101110010111100100011110011010110001101111010110111001111101111010111010000010111011111010111010000010010001111010111010000010111010111001011000110010100001111010111001011110011000111010111010000010010001111010111010000010111010111010111001011110010001111001101011000110111101011011100111101101011110 eba0bbeba091eba0bae58ca1eb9798eba091eba0baeb9791e6b1bd6e7deba0bbeba091eba0bae58ca1eb9798eba091eba0baeb9791e6b1bd6e7b5e
UHC 렻렑렺匡뗘렑렺뗑汽n}렻렑렺匡뗘렑렺뗑汽n{^ 1000111011000011100011101010011010001110110000101100111011000100101101101100010110001110101001101000111011000010101101101100010011010001101010010110111001111101100011101100001110001110101001101000111011000010110011101100010010110110110001011000111010100110100011101100001010110110110001001101000110101001011011100111101101011110 8ec38ea68ec2cec4b6c58ea68ec2b6c4d1a96e7d8ec38ea68ec2cec4b6c58ea68ec2b6c4d1a96e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)