To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN ??????癲??D??????癲??D^ 0011111100111111001111110011111100111111001111111110000110011111001111110011111101000100001111110011111100111111001111110011111100111111111000011001111100111111001111110100010001011110 3f3f3f3f3f3fe19f3f3f443f3f3f3f3f3fe19f3f3f445e
EUC-JP 薏?????癲??D薏?????癲??D^ 100011111101100111011110001111110011111100111111001111110011111111100010101000010011111100111111010001001000111111011001110111100011111100111111001111110011111100111111111000101010000100111111001111110100010001011110 8fd9de3f3f3f3f3fe2a13f3f448fd9de3f3f3f3f3fe2a13f3f445e
UTF-8 薏앾쬉硫쏆씅癲잛셾D薏앾쬉硫쏆씅癲잛셾D^ 111010001001011010001111111011001001010110111110111011001010110010001001111011111010011110001110111011001000111110000110111011001001010010000101111001111001100110110010111011001001111010011011111011001000010110111110010001001110100010010110100011111110110010010101101111101110110010101100100010011110111110100111100011101110110010001111100001101110110010010100100001011110011110011001101100101110110010011110100110111110110010000101101111100100010001011110 e8968fec95beecac89efa78eec8f86ec9485e799b2ec9e9bec85be44e8968fec95beecac89efa78eec8f86ec9485e799b2ec9e9bec85be445e
UHC 薏앾쬉硫쏆씅癲잛셾D薏앾쬉硫쏆씅癲잛셾D^ 111010111111101110011101111011111010011010011111111010111010100110011011111011001001110110011101111011111010011010011111111011001001100110000011010001001110101111111011100111011110111110100110100111111110101110101001100110111110110010011101100111011110111110100110100111111110110010011001100000110100010001011110 ebfb9defa69feba99bec9d9defa69fec998344ebfb9defa69feba99bec9d9defa69fec9983445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)