To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 凹??蘂??熬??n}凹??蘂??熬??n{^ 1000100110011010001111110011111111100101010000010011111100111111111000001001001000111111001111110110111001111101100010011001101000111111001111111110010101000001001111110011111111100000100100100011111100111111011011100111101101011110 899a3f3fe5413f3fe0923f3f6e7d899a3f3fe5413f3fe0923f3f6e7b5e
EUC-JP 凹??蘂??熬??n}凹??蘂??熬??n{^ 1011000111111010001111110011111111101001101000100011111100111111110111111111001000111111001111110110111001111101101100011111101000111111001111111110100110100010001111110011111111011111111100100011111100111111011011100111101101011110 b1fa3f3fe9a23f3fdff23f3f6e7db1fa3f3fe9a23f3fdff23f3f6e7b5e
UTF-8 凹좊젵蘂노젾熬곷젲n}凹좊젵蘂노젾熬곷젲n{^ 1110010110000111101110011110110010100010100010101110110010100000101101011110100010011000100000101110101110000101101110001110110010100000101111101110011110000110101011001110101010110011101101111110110010100000101100100110111001111101111001011000011110111001111011001010001010001010111011001010000010110101111010001001100010000010111010111000010110111000111011001010000010111110111001111000011010101100111010101011001110110111111011001010000010110010011011100111101101011110 e587b9eca28aeca0b5e89882eb85b8eca0bee786aceab3b7eca0b26e7de587b9eca28aeca0b5e89882eb85b8eca0bee786aceab3b7eca0b26e7b5e
UHC 凹좊젵蘂노젾熬곷젲n}凹좊젵蘂노젾熬곷젲n{^ 1110100011101010101000001110101110100000101010011110011111011110101100111110101110100000101100001110100010100010100000011110101110100000101001100110111001111101111010001110101010100000111010111010000010101001111001111101111010110011111010111010000010110000111010001010001010000001111010111010000010100110011011100111101101011110 e8eaa0eba0a9e7deb3eba0b0e8a281eba0a66e7de8eaa0eba0a9e7deb3eba0b0e8a281eba0a66e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)