To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????鴨??????り????鴨???? 0011111100111111001111110011111100111111001111111000101010011011001111110011111100111111001111110011111100111111100000101110100000111111001111110011111100111111100010101001101100111111001111110011111100111111 3f3f3f3f3f3f8a9b3f3f3f3f3f3f82e83f3f3f3f8a9b3f3f3f3f
EUC-JP ??????鴨??????り????鴨???? 0011111100111111001111110011111100111111001111111011001111111011001111110011111100111111001111110011111100111111101001001110101000111111001111110011111100111111101100111111101100111111001111110011111100111111 3f3f3f3f3f3fb3fb3f3f3f3f3f3fa4ea3f3f3f3fb3fb3f3f3f3f
UTF-8 玲곴낏溜곕젒鴨뗫떩溜띯궗溜り낏溜곕젒鴨뗫떩溜띊 111011111010011010101101111010101011001110110100111010111000001010001111111011111010011110001011111010101011001110010101111011001010000010010010111010011011010010101000111010111001011110101011111010111001011010101001111011111010011110001011111010111001110110101111111010101011011010010111111011111010011110001011111000111000001010001010111010111000001010001111111011111010011110001011111010101011001110010101111011001010000010010010111010011011010010101000111010111001011110101011111010111001011010101001111011111010011110001011111010111001110110001010 efa6adeab3b4eb828fefa78beab395eca092e9b4a8eb97abeb96a9efa78beb9dafeab697efa78be3828aeb828fefa78beab395eca092e9b4a8eb97abeb96a9efa78beb9d8a
UHC 玲곴낏溜곕젒鴨뗫떩溜띯궗溜り낏溜곕젒鴨뗫떩溜띊 11100111101111111000000111101010101100111010100011101010111111101011000011101011101000001001000111100100111001011000101111101011100010111011101111101010111111101000110111100010100000101010110011101010111111101010101011101010101100111010100011101010111111101011000011101011101000001001000111100100111001011000101111101011100010111011101111101010111111101000110111000011 e7bf81eab3a8eafeb0eba091e4e58beb8bbbeafe8de282aceafeaaeab3a8eafeb0eba091e4e58beb8bbbeafe8dc3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)