To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鴨??奄?????液??宥??暎?????^ 100010101001101100111111001111111000100110000010001111110011111100111111001111110011111110001001011101000011111100111111100101110100011100111111001111111001110111110011001111110011111100111111001111110011111101011110 8a9b3f3f89823f3f3f3f3f89743f3f97473f3f9df33f3f3f3f3f5e
EUC-JP 鴨??奄?????液??宥??暎?????^ 101100111111101100111111001111111011000111100010001111110011111100111111001111110011111110110001110101010011111100111111110011011010100000111111001111111101101011110101001111110011111100111111001111110011111101011110 b3fb3f3fb1e23f3f3f3f3fb1d53f3fcda83f3fdaf53f3f3f3f3f5e
UTF-8 鴨뗫떩奄사꺌溜경옗液ㅵ뇣宥삥뵽暎㏓젾女앶턄^ 11101001101101001010100011101011100101111010101111101011100101101010100111100101101001011000010011101100100000101010110011101010101110101000110011101111101001111000101111101010101100101011110111101100100110001001011111100110101101101011001011100011100001011011010111101011100001111010001111100101101011101010010111101100100000101010010111101011101101011011110111100110100110101000111011100011100011111001001111101100101000001011111011101111101001101000000111101100100101011011011011101101100001001000010001011110 e9b4a8eb97abeb96a9e5a584ec82aceaba8cefa78beab2bdec9897e6b6b2e385b5eb87a3e5aea5ec82a5ebb5bde69a8ee38f93eca0beefa681ec95b6ed84845e
UHC 鴨뗫떩奄사꺌溜경옗液ㅵ뇣宥삥뵽暎㏓젾女앶턄^ 11100100111001011000101111101011100010111011101111100101111100101011101111100111101100101010011111101010111111101011000011100110100111101001110111100100111110111010010011100101100001111000101111101010111010011011101111100110100101001011101111100111101100101010011111101011101000001011000011100101111111001001110111101001101101011010000001011110 e4e58beb8bbbe5f2bbe7b2a7eafeb0e69e9de4fba4e5878beae9bbe694bbe7b2a7eba0b0e5fc9de9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)