To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 荳ケ蟾ス螻樔クケ蜈カ騾滉クケ驕懃カ夊エ 1110010010111000101110011110010110110111101111011110010110110001100111101110010010111000101110011110010110000101101101101110100110000000100111111110010010111000101110011110100110000001100111001110011110110110100110101110100010110100 e4b8b9e5b7bde5b19ee4b8b9e585b6e9809fe4b8b9e9819ce7b69ae8b4
EUC-JP 荳ケ蟾ス螻樔クケ蜈カ騾滉クケ驕懃カ夊エ 1110100010111010100011101011100111101010101110011000111010111101111010101011001111011100111001101000111010111000100011101011100111101001111001011000111010110110111100011110000011011110111001101000111010111000100011101011100111110001111000011101100011101001100011101011011011010100111010101000111010110100 e8ba8eb9eab98ebdeab3dce68eb88eb9e9e58eb6f1e0dee68eb88eb9f1e1d8e98eb6d4ea8eb4
UTF-8 荳ケ蟾ス螻樔クケ蜈カ騾滉クケ驕懃カ夊エ 111010001000110110110011111011111011110110111001111010001001111110111110111011111011110110111101111010001001111010111011111001101010100010010100111011111011110110111000111011111011110110111001111010001001110010001000111011111011110110110110111010011010100010111110111001101011101110001001111011111011110110111000111011111011110110111001111010011010100110010101111001101000011110000011111011111011110110110110111001011010010010001010111011111011110110110100 e88db3efbdb9e89fbeefbdbde89ebbe6a894efbdb8efbdb9e89c88efbdb6e9a8bee6bb89efbdb8efbdb9e9a995e68783efbdb6e5a48aefbdb4
UHC 荳?蟾?????蜈??滉??驕懃??? 11010100111001010011111111100000111010100011111100111111001111110011111100111111111010001010010100111111001111111111110011010001001111110011111111001110111101101101000011000100001111110011111100111111 d4e53fe0ea3f3f3f3f3fe8a53f3ffcd13f3fcef6d0c43f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)