To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????M?????????M???^ 001111110011111100111111001111110011111100111111010011010011111100111111001111110011111100111111001111110011111100111111001111110100110100111111001111110011111101011110 3f3f3f3f3f3f4d3f3f3f3f3f3f3f3f3f4d3f3f3f5e
SJIS-WIN ?閥?漿??M????閥?漿??M???^ 00111111100101001011010000111111100111111111011100111111001111110100110100111111001111110011111100111111100101001011010000111111100111111111011100111111001111110100110100111111001111110011111101011110 3f94b43f9ff73f3f4d3f3f3f3f94b43f9ff73f3f4d3f3f3f5e
EUC-JP ?閥?漿??M????閥?漿??M???^ 00111111110010001011011000111111110111101111100100111111001111110100110100111111001111110011111100111111110010001011011000111111110111101111100100111111001111110100110100111111001111110011111101011110 3fc8b63fdef93f3f4d3f3f3f3fc8b63fdef93f3f4d3f3f3f5e
UTF-8 뤶閥쟉漿탮적M렟롍롚뤶閥쟉漿탮적M렟롍롘^ 111010111010010010110110111010011001011010100101111011001001111110001001111001101011110010111111111011011000001110101110111011001010000010000001010011011110101110100000100111111110101110100001100011011110101110100001100110101110101110100100101101101110100110010110101001011110110010011111100010011110011010111100101111111110110110000011101011101110110010100000100000010100110111101011101000001001111111101011101000011000110111101011101000011001100001011110 eba4b6e996a5ec9f89e6bcbfed83aeeca0814deba09feba18deba19aeba4b6e996a5ec9f89e6bcbfed83aeeca0814deba09feba18deba1985e
UHC 뤶閥쟉漿탮적M렟롍롚뤶閥쟉漿탮적M렟롍롘^ 100011111110010011011011111011001100000011110001111011011110110010110101100011101100000011111011010011011000111010110000100011101101001110001110110111101000111111100100110110111110110011000000111100011110110111101100101101011000111011000000111110110100110110001110101100001000111011010011100011101101110001011110 8fe4dbecc0f1edecb58ec0fb4d8eb08ed38ede8fe4dbecc0f1edecb58ec0fb4d8eb08ed38edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)