To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援??蹂l?孃る????蹂??? 00111111001111110011111110001001100001110011111100111111111001101111100010000010100011000011111110011011011011111000001011101001001111110011111100111111001111111110011011111000001111110011111100111111 3f3f3f89873f3fe6f8828c3f9b6f82e93f3f3f3fe6f83f3f3f
EUC-JP 艅??援??蹂l?孃る????蹂??獒 1000111111010110111111010011111100111111101100011110011100111111001111111110110011111010101000111110110000111111110101011101000010100100111010110011111100111111001111110011111111101100111110100011111100111111100011111100101110111011 8fd6fd3f3fb1e73f3fecfaa3ec3fd5d0a4eb3f3f3f3fecfa3f3f8fcbbb
UTF-8 艅덈퀩援좄짆蹂l맦孃る뜄痢싨찄蹂앷턄獒 111010001000100110000101111010111000110110001000111011011000000010101001111001101000111110110100111011001010001010000100111011001010011110000110111010001011100110000010111011111011110110001100111010111010011110100110111001011010110110000011111000111000001010001011111010111001110010000100111011111010011110100101111011001000101110101000111011001011000010000100111010001011100110000010111011001001010110110111111011011000010010000100111001111000110110010010 e88985eb8d88ed80a9e68fb4eca284eca786e8b982efbd8ceba7a6e5ad83e3828beb9c84efa7a5ec8ba8ecb084e8b982ec95b7ed8484e78d92
UHC 艅덈퀩援좄짆蹂l맦孃る뜄痢싨찄蹂앷턄獒 1110011010101001100010001110101110110011100111011110101010110101101000001110100010100011100101011110101110110011101000111110110010010000101011111110010110111110101010101110101110001101100010001110110010111000100110101110011010101001100010001110101110110011100111011110101010110101101000001110100010100011 e6a988ebb39deab5a0e8a395ebb3a3ec90afe5beaaeb8d88ecb89ae6a988ebb39deab5a0e8a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)