To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霍ォ閧イ貍芽キ帛匀霍ォ鈞イ貍芽キ帶怏^ 11101000101101111010101111101000100000101011001011100110101111001000100111101000101101111001101111100101111110101000100111101000101101111010101111100111111000001011001011100110101111001000100111101000101101111001101111100110100111001000100101011110 e8b7abe882b2e6bc89e8b79be5fa89e8b7abe7e0b2e6bc89e8b79be69c895e
EUC-JP 霍ォ閧イ貍芽キ帛匀霍ォ鈞イ貍芽キ帶怏^ 1111000010111001100011101010101111101111111000101000111010110010111011001011111010110010111010101000111010110111110101101110011110001111101100111111101111110000101110011000111010101011111011101110001010001110101100101110110010111110101100101110101010001110101101111101011011101000110101111110100101011110 f0b98eabefe28eb2ecbeb2ea8eb7d6e78fb3fbf0b98eabeee28eb2ecbeb2ea8eb7d6e8d7e95e
UTF-8 霍ォ閧イ貍芽キ帛匀霍ォ鈞イ貍芽キ帶怏^ 11101001100111001000110111101111101111011010101111101001100101101010011111101111101111011011001011101000101100101000110111101000100010101011110111101111101111011011011111100101101110001001101111100101100011001000000011101001100111001000110111101111101111011010101111101001100010001001111011101111101111011011001011101000101100101000110111101000100010101011110111101111101111011011011111100101101110001011011011100110100000001000111101011110 e99c8defbdabe996a7efbdb2e8b28de88abdefbdb7e5b89be58c80e99c8defbdabe9889eefbdb2e8b28de88abdefbdb7e5b8b6e6808f5e
UHC ?????芽?帛???鈞??芽?帶怏^ 00111111001111110011111100111111001111111110010010110100001111111101101111011001001111110011111100111111110100001011011100111111001111111110010010110100001111111101001111100001111001001110100001011110 3f3f3f3f3fe4b43fdbd93f3f3fd0b73f3fe4b43fd3e1e4e85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)