To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ???淫????吟????淫????淫?E 00111111001111110011111110001000111110100011111100111111001111110011111110001011111000010011111100111111001111110011111110001000111110100011111100111111001111110011111110001000111110100011111101000101 3f3f3f88fa3f3f3f3f8be13f3f3f3f88fa3f3f3f3f88fa3f45
EUC-JP ???淫????吟????淫????淫?E 00111111001111110011111110110000111111000011111100111111001111110011111110110110111000110011111100111111001111110011111110110000111111000011111100111111001111110011111110110000111111000011111101000101 3f3f3fb0fc3f3f3f3fb6e33f3f3f3fb0fc3f3f3f3fb0fc3f45
UTF-8 溜깅젡淫륱溜깅젡吟퀳溜깅젡淫륱溜깅젡淫웪E 11101111101001111000101111101010101110011000010111101100101000001010000111100110101101111010101111101011101001011011000111101111101001111000101111101010101110011000010111101100101000001010000111100101100100001001111111101101100000001011001111101111101001111000101111101010101110011000010111101100101000001010000111100110101101111010101111101011101001011011000111101111101001111000101111101010101110011000010111101100101000001010000111100110101101111010101111101100100110111010101001000101 efa78beab985eca0a1e6b7abeba5b1efa78beab985eca0a1e5909fed80b3efa78beab985eca0a1e6b7abeba5b1efa78beab985eca0a1e6b7abec9baa45
UHC 溜깅젡淫륱溜깅젡吟퀳溜깅젡淫륱溜깅젡淫웪E 1110101011111110101100011110101110100000100110101110101111100010100100000101001011101010111111101011000111101011101000001001101011101011111000011011010001000110111010101111111010110001111010111010000010011010111010111110001010010000010100101110101011111110101100011110101110100000100110101110101111100010100111110111010001000101 eafeb1eba09aebe29052eafeb1eba09aebe1b446eafeb1eba09aebe29052eafeb1eba09aebe29f7445

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)