To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????蹂??獄??臾ョ?酉?ず額?? 00111111001111110011111100111111001111110011111111100110111110000011111100111111100011011001011000111111001111111110010001101011100000111000011100111111100100111101000100111111100000101011100010001010011110100011111100111111 3f3f3f3f3f3fe6f83f3f8d963f3fe46b83873f93d13f82b88a7a3f3f
EUC-JP ??????蹂??獄??臾ョ?酉?ず額?? 00111111001111110011111100111111001111110011111111101100111110100011111100111111101110011111011000111111001111111110011111001100101001011110011100111111110001101101001100111111101001001011101010110011110110110011111100111111 3f3f3f3f3f3fecfa3f3fb9f63f3fe7cca5e73fc6d33fa4bab3db3f3f
UTF-8 掠뗰퐣杻믭쭓蹂잙젡獄쏄퀗臾ョ뛾酉쒕ず額됲꼩 111011111010010110110101111010111001011110110000111011011001000010100011111011111010011110001000111010111010111110101101111011001010110110010011111010001011100110000010111011001001111010011001111011001010000010100001111001111000110110000100111011001000111110000100111011011000000010010111111010001000011110111110111000111000001110100111111010111001101110111110111010011000010110001001111011001001001010010101111000111000000110011010111010011010000110001101111010111001000010110010111010101011110010101001 efa5b5eb97b0ed90a3efa788ebafadecad93e8b982ec9e99eca0a1e78d84ec8f84ed8097e887bee383a7eb9bbee98589ec9295e3819ae9a18deb90b2eabca9
UHC 掠뗰퐣杻믭쭓蹂잙젡獄쏄퀗臾ョ뛾酉쒕ず額됲꼩 111001011011000110001011111011111011110110001100111010101111010010010010111011111010011110001011111010111011001110011111111010111010000010011010111010001010101110011011111010101011001110001100111010111010110010101011111001111000110110000100111010111011011110011100111010111010101010111010111001001111111010001001111011011000010010000110 e5b18befbd8ceaf492efa78bebb39feba09ae8ab9beab38cebacabe78d84ebb79cebaabae4fe89ed8486

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)