To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 狸賊即狸賊側狸賊即狸属他狸賊即狸属他狸短 10010010010010111001000110101111100100011010011010010010010010111001000110101111100100011010010010010010010010111001000110101111100100011010011010010010010010111001000110101110100100011011110010010010010010111001000110101111100100011010011010010010010010111001000110101110100100011011110010010010010010111001001001011010 924b91af91a6924b91af91a4924b91af91a6924b91ae91bc924b91af91a6924b91ae91bc924b925a
EUC-JP 狸賊即狸賊側狸賊即狸属他狸賊即狸属他狸短 11000011101011001100001010110001110000101010100011000011101011001100001010110001110000101010011011000011101011001100001010110001110000101010100011000011101011001100001010110000110000101011111011000011101011001100001010110001110000101010100011000011101011001100001010110000110000101011111011000011101011001100001110111011 c3acc2b1c2a8c3acc2b1c2a6c3acc2b1c2a8c3acc2b0c2bec3acc2b1c2a8c3acc2b0c2bec3acc3bb
UTF-8 狸賊即狸賊側狸賊即狸属他狸賊即狸属他狸短 111001111000101110111000111010001011001110001010111001011000110110110011111001111000101110111000111010001011001110001010111001011000000110110100111001111000101110111000111010001011001110001010111001011000110110110011111001111000101110111000111001011011000110011110111001001011101110010110111001111000101110111000111010001011001110001010111001011000110110110011111001111000101110111000111001011011000110011110111001001011101110010110111001111000101110111000111001111001111110101101 e78bb8e8b38ae58db3e78bb8e8b38ae581b4e78bb8e8b38ae58db3e78bb8e5b19ee4bb96e78bb8e8b38ae58db3e78bb8e5b19ee4bb96e78bb8e79fad
UHC 狸賊?狸賊側狸賊?狸?他狸賊?狸?他狸短 1101011111100001111011101110010000111111110101111110000111101110111001001111011010110000110101111110000111101110111001000011111111010111111000010011111111110110111000101101011111100001111011101110010000111111110101111110000100111111111101101110001011010111111000011101001110101101 d7e1eee43fd7e1eee4f6b0d7e1eee43fd7e13ff6e2d7e1eee43fd7e13ff6e2d7e1d3ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)