To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 娃??????よ?歟??節h?崖?????^ 10001000101000010011111100111111001111110011111100111111001111111000001011100110001111111001111101100010001111110011111110010000110111111000001010001000001111111000101001010010001111110011111100111111001111110011111101011110 88a13f3f3f3f3f3f82e63f9f623f3f90df82883f8a523f3f3f3f3f5e
EUC-JP 娃??邕???よ?歟??節h?崖??邕??^ 1011000010100011001111110011111110001111111000011110110100111111001111110011111110100100111010000011111111011101110000110011111100111111110000001110000110100011111010000011111110110011101100110011111100111111100011111110000111101101001111110011111101011110 b0a33f3f8fe1ed3f3f3fa4e83fddc33f3fc0e1a3e83fb3b33f3f8fe1ed3f3f5e
UTF-8 娃륅숲邕뤷뮧力よ퍘歟쀩걦節h퍘崖꿩나邕뤸쥥^ 11100101101010001000001111101011101001011000010111101100100010001011001011101001100000101001010111101011101001001011011111101011101011101010011111101111101001101000101011100011100000101000100011101101100011011001100011100110101011011001111111101100100000001010100111101010101100011010011011100111101011111000000011101111101111011000100011101101100011011001100011100101101101001001011011101010101111111010100111101011100000101001100011101001100000101001010111101011101001001011100011101100101001011010010101011110 e5a883eba585ec88b2e98295eba4b7ebaea7efa68ae38288ed8d98e6ad9fec80a9eab1a6e7af80efbd88ed8d98e5b496eabfa9eb8298e98295eba4b8eca5a55e
UHC 娃륅숲邕뤷뮧力よ퍘歟쀩걦節h퍘崖꿩나邕뤸쥥^ 11101000110111111000111111101111101111011010001111101000101110111000111111100101100100101011001011100110101100111010101011101000101110111000111111100110101000101001011111101001100000011000111111101111101111011010001111101000101110111000111111100100111100001011001011100110101100111010101011101000101110111000111111100110101000101001011101011110 e8df8fefbda3e8bb8fe592b2e6b3aae8bb8fe6a297e9818fefbda3e8bb8fe4f0b2e6b3aae8bb8fe6a2975e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)