To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 厭????????n}厭????????n{^ 10001001011111010011111100111111001111110011111100111111001111110011111100111111011011100111110110001001011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 897d3f3f3f3f3f3f3f3f6e7d897d3f3f3f3f3f3f3f3f6e7b5e
EUC-JP 厭?????洧??n}厭?????洧??n{^ 1011000111011110001111110011111100111111001111110011111110001111110001111011010000111111001111110110111001111101101100011101111000111111001111110011111100111111001111111000111111000111101101000011111100111111011011100111101101011110 b1de3f3f3f3f3f8fc7b43f3f6e7db1de3f3f3f3f3f8fc7b43f3f6e7b5e
UTF-8 厭묐쓷隣ㅷ뼇洧뺞넯n}厭묐쓷隣ㅷ뼇洧뺞넯n{^ 1110010110001110101011011110101110101100100100001110110010010011101101111110111110100111101100011110001110000101101101111110101110111100100001111110011010110100101001111110101110111010100111101110101110000100101011110110111001111101111001011000111010101101111010111010110010010000111011001001001110110111111011111010011110110001111000111000010110110111111010111011110010000111111001101011010010100111111010111011101010011110111010111000010010101111011011100111101101011110 e58eadebac90ec93b7efa7b1e385b7ebbc87e6b4a7ebba9eeb84af6e7de58eadebac90ec93b7efa7b1e385b7ebbc87e6b4a7ebba9eeb84af6e7b5e
UHC 厭묐쓷隣ㅷ뼇洧뺞넯n}厭묐쓷隣ㅷ뼇洧뺞넯n{^ 1110011011110100100100011110101110011101100101001110110011100100101001001110011110010110100100011110101011111011100101011110011010000110101011100110111001111101111001101111010010010001111010111001110110010100111011001110010010100100111001111001011010010001111010101111101110010101111001101000011010101110011011100111101101011110 e6f491eb9d94ece4a4e79691eafb95e686ae6e7de6f491eb9d94ece4a4e79691eafb95e686ae6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)