To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鴨??宜?????液??宥??餓?????^ 100010101001101100111111001111111000101101011000001111110011111100111111001111110011111110001001011101000011111100111111100101110100011100111111001111111000100111101100001111110011111100111111001111110011111101011110 8a9b3f3f8b583f3f3f3f3f89743f3f97473f3f89ec3f3f3f3f3f5e
EUC-JP 鴨??宜?????液??宥??餓?????^ 101100111111101100111111001111111011010110111001001111110011111100111111001111110011111110110001110101010011111100111111110011011010100000111111001111111011001011101110001111110011111100111111001111110011111101011110 b3fb3f3fb5b93f3f3f3f3fb1d53f3fcda83f3fb2ee3f3f3f3f3f5e
UTF-8 鴨뗫떦宜긴뻣溜경옗液ㅵ뇣宥뱀벑餓삳젾女앶턄^ 11101001101101001010100011101011100101111010101111101011100101101010011011100101101011101001110011101010101110001011010011101011101110111010001111101111101001111000101111101010101100101011110111101100100110001001011111100110101101101011001011100011100001011011010111101011100001111010001111100101101011101010010111101011101100011000000011101011101100101001000111101001101001001001001111101100100000101011001111101100101000001011111011101111101001101000000111101100100101011011011011101101100001001000010001011110 e9b4a8eb97abeb96a6e5ae9ceab8b4ebbba3efa78beab2bdec9897e6b6b2e385b5eb87a3e5aea5ebb180ebb291e9a493ec82b3eca0beefa681ec95b6ed84845e
UHC 鴨뗫떦宜긴뻣溜경옗液ㅵ뇣宥뱀벑餓삳젾女앶턄^ 11100100111001011000101111101011100010111011100111101011111100011011000111100100101110111011101111101010111111101011000011100110100111101001110111100100111110111010010011100101100001111000101111101010111010011011100111101100100100111011000111100100101110111011101111101011101000001011000011100101111111001001110111101001101101011010000001011110 e4e58beb8bb9ebf1b1e4bbbbeafeb0e69e9de4fba4e5878beae9b9ec93b1e4bbbbeba0b0e5fc9de9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)