To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雅??幼??癒ル?繞??鎰??受??繹 100010011110101100111111001111111001011101100011001111110011111110010110111111001000001110001011001111111110001110000101001111110011111111101000010011000011111100111111100011101111001100111111001111111110001110001000 89eb3f3f97633f3f96fc838b3fe3853f3fe84c3f3f8ef33f3fe388
EUC-JP 雅??幼??癒ル?繞??鎰??受??繹 101100101110110100111111001111111100110111000100001111110011111111001100111111101010010111101011001111111110010111100101001111110011111111101111101011010011111100111111101111001111010100111111001111111110010111101000 b2ed3f3fcdc43f3fccfea5eb3fe5e53f3fefad3f3fbcf53f3fe5e8
UTF-8 雅붴뼹幼쀦벴癒ル젗繞볥쑚鎰뤶틪受쇰뙒繹 111010011001101110000101111010111011011010110100111010111011110010111001111001011011100110111100111011001000000010100110111010111011001010110100111001111001100110010010111000111000001110101011111011001010000010010111111001111011100110011110111010111011001110100101111011001001000110011010111010011000111010110000111010111010010010110110111011011000101110101010111001011000111110010111111011001000011110110000111010111001100110010010111001111011100110111001 e99b85ebb6b4ebbcb9e5b9bcec80a6ebb2b4e79992e383abeca097e7b99eebb3a5ec919ae98eb0eba4b6ed8baae58f97ec87b0eb9992e7b9b9
UHC 雅붴뼹幼쀦벴癒ル젗繞볥쑚鎰뤶틪受쇰뙒繹 1110010010111010100101001110001010010110101111001110101011101010100101111110011010111010101010111110101110101000101010111110101110100000100100111110100110100100100100111110101110011100101110011110110011110000100011111110010010111010100101001110000111110100101111001110101110001100100101111110011010111010 e4ba94e296bceaea97e6baabeba8abeba093e9a493eb9cb9ecf08fe4ba94e1f4bceb8c97e6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)