To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 懿??淹??燿⑦?異?????意③?鎰??B 100111001111001000111111001111111001111110111001001111110011111111100000101000001000011101000110001111111000100011011001001111110011111100111111001111110011111110001000110100111000011101000010001111111110100001001100001111110011111101000010 9cf23f3f9fb93f3fe0a087463f88d93f3f3f3f3f88d387423fe84c3f3f42
EUC-JP 懿??淹??燿??異?????意??鎰??B 11011000111101000011111100111111110111101011101100111111001111111110000010100010001111110011111110110000110110110011111100111111001111110011111100111111101100001101010100111111001111111110111110101101001111110011111101000010 d8f43f3fdebb3f3fe0a23f3fb0db3f3f3f3f3fb0d53f3fefad3f3f42
UTF-8 懿됰ㅆ淹먯뵪燿⑦뭷異싳븠琉뜻뼆意③옩鎰놁텥B 11100110100001111011111111101011100100001011000011100011100001011000011011100110101101111011100111101011101010001010111111101011101101011010101011100111100001111011111111100010100100011010011011101011101011011011011111100111100101011011000011101100100010111011001111101011101110001010000011101111101001111000110011101011100111001011101111101011101111001000011011100110100001001000111111100010100100011010001011101100100110001010100111101001100011101011000011101011100001101000000111101101100001011010010101000010 e687bfeb90b0e38586e6b7b9eba8afebb5aae787bfe291a6ebadb7e795b0ec8bb3ebb8a0efa78ceb9cbbebbc86e6848fe291a2ec98a9e98eb0eb8681ed85a542
UHC 懿됰ㅆ淹먯뵪燿⑦뭷異싳븠琉뜻뼆意③옩鎰놁텥B 11101011111100111000100111101011101001001011011011100101111101001001000011101100100101001010100011101000111111001010100011101101100100101000011011101100101101101001101011101100100101011000100111101011101001001011011011100110100101101001000011101011111100101010100011101001100111101010100011101100111100001000011011101100101101101001101001000010 ebf389eba4b6e5f490ec94a8e8fca8ed9286ecb69aec9589eba4b6e69690ebf2a8e99ea8ecf086ecb69a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)