To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鴨??宜?????液??宥??汚?????^ 100010101001101100111111001111111000101101011000001111110011111100111111001111110011111110001001011101000011111100111111100101110100011100111111001111111000100110011000001111110011111100111111001111110011111101011110 8a9b3f3f8b583f3f3f3f3f89743f3f97473f3f89983f3f3f3f3f5e
EUC-JP 鴨??宜?????液??宥??汚?????^ 101100111111101100111111001111111011010110111001001111110011111100111111001111110011111110110001110101010011111100111111110011011010100000111111001111111011000111111000001111110011111100111111001111110011111101011110 b3fb3f3fb5b93f3f3f3f3fb1d53f3fcda83f3fb1f83f3f3f3f3f5e
UTF-8 鴨뗫떦宜김읂溜경옗液ㅵ뇣宥뱀벑汚밸젾女앶턄^ 11101001101101001010100011101011100101111010101111101011100101101010011011100101101011101001110011101010101110011000000011101100100111011000001011101111101001111000101111101010101100101011110111101100100110001001011111100110101101101011001011100011100001011011010111101011100001111010001111100101101011101010010111101011101100011000000011101011101100101001000111100110101100011001101011101011101100001011100011101100101000001011111011101111101001101000000111101100100101011011011011101101100001001000010001011110 e9b4a8eb97abeb96a6e5ae9ceab980ec9d82efa78beab2bdec9897e6b6b2e385b5eb87a3e5aea5ebb180ebb291e6b19aebb0b8eca0beefa681ec95b6ed84845e
UHC 鴨뗫떦宜김읂溜경옗液ㅵ뇣宥뱀벑汚밸젾女앶턄^ 11100100111001011000101111101011100010111011100111101011111100011011000111101000100111111011100111101010111111101011000011100110100111101001110111100100111110111010010011100101100001111000101111101010111010011011100111101100100100111011000111100111111111011011100111101011101000001011000011100101111111001001110111101001101101011010000001011110 e4e58beb8bb9ebf1b1e89fb9eafeb0e69e9de4fba4e5878beae9b9ec93b1e7fdb9eba0b0e5fc9de9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)