To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 撓????幽??巍ル?裕??循???l?弛 1001110110011010001111110011111100111111001111111001011101001000001111110011111110011011110110011000001110001011001111111001011101010100001111110011111110001111011110100011111100111111001111111000001010001100001111111001001001101111 9d9a3f3f3f3f97483f3f9bd9838b3f97543f3f8f7a3f3f3f828c3f926f
EUC-JP 撓????幽??巍ル?裕??循???l?弛 1101100111111010001111110011111100111111001111111100110110101001001111110011111111010110110110111010010111101011001111111100110110110101001111110011111110111101110110110011111100111111001111111010001111101100001111111100001111010000 d9fa3f3f3f3fcda93f3fd6dba5eb3fcdb53f3fbddb3f3f3fa3ec3fc3d0
UTF-8 撓눸뼘뷴뵱幽귢뭍巍ル쵑裕곻쫮循뗫룏力l꼻弛 111001101001001010010011111010111000100010111000111010111011110010011000111010111011011110110100111010111011010110110001111001011011100110111101111010101011011110100010111010111010110110001101111001011011011110001101111000111000001110101011111011001011010110010001111010001010001110010101111010101011001110111011111011001010101110101110111001011011111010101010111010111001011110101011111010111010001110001111111011111010011010001010111011111011110110001100111010101011110010111011111001011011110010011011 e69293eb88b8ebbc98ebb7b4ebb5b1e5b9bdeab7a2ebad8de5b78de383abecb591e8a395eab3bbecabaee5beaaeb97abeba38fefa68aefbd8ceabcbbe5bc9b
UHC 撓눸뼘뷴뵱幽귢뭍巍ル쵑裕곻쫮循뗫룏力l꼻弛 111010001111010110000111110011101011101111000010101110101110010110010100101011111110101011101011100000101110101010111001101101111110100011100100101010111110101110101100100100111110101110101110100000011110111110100110100001101110001011100000100010111110101110001111100011011110011010110011101000111110110010000100100100111110110010101100 e8f587cebbc2bae594afeaeb82eab9b7e8e4abebac93ebae81efa686e2e08beb8f8de6b3a3ec8493ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)