To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 丈ヲシ」セス酌シェv丈ヲシ」セス酌シェvB 10001111111001001010011010111100101000111011111010111101100011101101111010111100101010100111011010001111111001001010011010111100101000111011111010111101100011101101111010111100101010100111011001000010 8fe4a6bca3bebd8edebcaa768fe4a6bca3bebd8edebcaa7642
EUC-JP 丈ヲシ」セス酌シェv丈ヲシ」セス酌シェvB 101111101110011010001110101001101000111010111100100011101010001110001110101111101000111010111101101111001110000010001110101111001000111010101010011101101011111011100110100011101010011010001110101111001000111010100011100011101011111010001110101111011011110011100000100011101011110010001110101010100111011001000010 bee68ea68ebc8ea38ebe8ebdbce08ebc8eaa76bee68ea68ebc8ea38ebe8ebdbce08ebc8eaa7642
UTF-8 丈ヲシ」セス酌シェv丈ヲシ」セス酌シェvB 111001001011100010001000111011111011110110100110111011111011110110111100111011111011110110100011111011111011110110111110111011111011110110111101111010011000010110001100111011111011110110111100111011111011110110101010011101101110010010111000100010001110111110111101101001101110111110111101101111001110111110111101101000111110111110111101101111101110111110111101101111011110100110000101100011001110111110111101101111001110111110111101101010100111011001000010 e4b888efbda6efbdbcefbda3efbdbeefbdbde9858cefbdbcefbdaa76e4b888efbda6efbdbcefbda3efbdbeefbdbde9858cefbdbcefbdaa7642
UHC 丈?????酌??v丈?????酌??vB 11101101110110110011111100111111001111110011111100111111111011011100110000111111001111110111011011101101110110110011111100111111001111110011111100111111111011011100110000111111001111110111011001000010 eddb3f3f3f3f3fedcc3f3f76eddb3f3f3f3f3fedcc3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)