To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 迪匁ケソ遽晉扱迪夜梵迪夜揆迪匁ケソ遽晉扱B 11100111100011001001011011100110101110011011111111100111101011111001110111100111100010001011010111100111100011001001011011101001100111101001000011100111100011001001011011101001100111011000010011100111100011001001011011100110101110011011111111100111101011111001110111100111100010001011010101000010 e78c96e6b9bfe7af9de788b5e78c96e99e90e78c96e99d84e78c96e6b9bfe7af9de788b542
EUC-JP 迪匁ケソ遽晉扱迪夜梵迪夜揆迪匁ケソ遽晉扱B 1110110111101100110011001110100010001110101110011000111010111111111011101011000111011010111010011011000010110111111011011110110011001100111010111101101111110000111011011110110011001100111010111101100111100100111011011110110011001100111010001000111010111001100011101011111111101110101100011101101011101001101100001011011101000010 edeccce88eb98ebfeeb1dae9b0b7edecccebdbf0edecccebd9e4edeccce88eb98ebfeeb1dae9b0b742
UTF-8 迪匁ケソ遽晉扱迪夜梵迪夜揆迪匁ケソ遽晉扱B 11101000101111111010101011100101100011001000000111101111101111011011100111101111101111011011111111101001100000011011110111100110100110011000100111100110100010011011000111101000101111111010101011100101101001001001110011100110101000101011010111101000101111111010101011100101101001001001110011100110100011111000011011101000101111111010101011100101100011001000000111101111101111011011100111101111101111011011111111101001100000011011110111100110100110011000100111100110100010011011000101000010 e8bfaae58c81efbdb9efbdbfe981bde69989e689b1e8bfaae5a49ce6a2b5e8bfaae5a49ce68f86e8bfaae58c81efbdb9efbdbfe981bde69989e689b142
UHC 迪???遽晉扱迪夜梵迪夜揆迪???遽晉扱B 1110111011101000001111110011111100111111110010111110100011110010110010111101000011100010111011101110100011100101101010001101101111101111111011101110100011100101101010001101000010100110111011101110100000111111001111110011111111001011111010001111001011001011110100001110001001000010 eee83f3f3fcbe8f2cbd0e2eee8e5a8dbefeee8e5a8d0a6eee83f3f3fcbe8f2cbd0e242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)