To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖?????矣??艶k?陰??純??曖 1001110110001010001111110011111100111111001111110011111111100001111000010011111100111111100010011001000010000010100010110011111110001001010000010011111100111111100011111000001100111111001111111001111001000010 9d8a3f3f3f3f3fe1e13f3f8990828b3f89413f3f8f833f3f9e42
EUC-JP 搖??靷??矣??艶k?陰??純??曖 11011001111010100011111100111111100011111110011110111101001111110011111111100010111000110011111100111111101100011111000010100011111010110011111110110001101000100011111100111111101111011110001100111111001111111101101110100011 d9ea3f3f8fe7bd3f3fe2e33f3fb1f0a3eb3fb1a23f3fbde33f3fdba3
UTF-8 搖깅ㅏ靷숂뙴矣꾨쿅艶k끏陰얍윜純볧닑曖 111001101001000010010110111010101011100110000101111000111000010110001111111010011001110110110111111011001000100010000010111010111001100110110100111001111001111110100011111010101011111010101000111011001011111110000101111010001000100110110110111011111011110110001011111010111000000110001111111010011001100110110000111011001001011010001101111011001001110010011100111001111011010010010100111010111011001110100111111010111000101110010001111001101001101110010110 e69096eab985e3858fe99db7ec8882eb99b4e79fa3eabea8ecbf85e889b6efbd8beb818fe999b0ec968dec9c9ce7b494ebb3a7eb8b91e69b96
UHC 搖깅ㅏ靷숂뙴矣꾨쿅艶k끏陰얍윜純볧닑曖 1110100011110100101100011110101110100100101111111110110011100110100110011110011110001100101101111110101111111000100001001110101110110010100110101110011011111101101000111110101110000101101111111110101111100100101111101110010110011111100111111110001011101101100100111110110110001000100101101110010011110010 e8f4b1eba4bfece699e78cb7ebf884ebb29ae6fda3eb85bfebe4bee59f9fe2ed93ed8896e4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)