To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 搖??毅??擬???△?寃ヨ????倭??B 1001110110001010001111110011111110001011010000100011111100111111100010110101101100111111001111110011111110000001101000100011111110011011100000111000001110001000001111110011111100111111001111111001100001100000001111110011111101000010 9d8a3f3f8b423f3f8b5b3f3f3f81a23f9b8383883f3f3f3f98603f3f42
EUC-JP 搖??毅??擬??璵△?寃ヨ????倭??B 11011001111010100011111100111111101101011010001100111111001111111011010110111100001111110011111110001111110011001110011010100010101001000011111111010101111000111010010111101000001111110011111100111111001111111100111111000001001111110011111101000010 d9ea3f3fb5a33f3fb5bc3f3f8fcce6a2a43fd5e3a5e83f3f3f3fcfc13f3f42
UTF-8 搖삳씮毅껇굢擬륁굦璵△뫁寃ヨ떏戮㏐뭍倭얩뒟B 11100110100100001001011011101100100000101011001111101100100101001010111011100110101011111000010111101010101110111000011111101010101101011010001011100110100100111010110011101011101001011000000111101010101101011010011011100111100100101011010111100010100101101011001111101011101010111000000111100101101011111000001111100011100000111010100011101011100101101000111111101111101001111001001011100011100011111001000011101011101011011000110111100101100000001010110111101100100101101010100111101011100100101001111101000010 e69096ec82b3ec94aee6af85eabb87eab5a2e693aceba581eab5a6e792b5e296b3ebab81e5af83e383a8eb968fefa792e38f90ebad8de580adec96a9eb929f42
UHC 搖삳씮毅껇굢擬륁굦璵△뫁寃ヨ떏戮㏐뭍倭얩뒟B 11101000111101001011101111101011100111011011111111101011111101101000001111101000100000101000100111101011111101001000111111101100100000101000110011100110101001011010000111100010100100011010010111101010101100101010101111101000100010111010010111101011101111011010011111101010101110011011011111101000110111101011111011101101100010101001101101000010 e8f4bbeb9dbfebf683e88289ebf48fec828ce6a5a1e291a5eab2abe88ba5ebbda7eab9b7e8debeed8a9b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)