To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????h???? 001111110011111100111111001111110110100000111111001111110011111100111111 3f3f3f3f683f3f3f3f
SJIS-WIN 瀨叱宿術h瀨叱宿術 1111101101010000100011101011011010001111011010001000111101110000011010001111101101010000100011101011011010001111011010001000111101110000 fb508eb68f688f7068fb508eb68f688f70
EUC-JP ?叱宿術h?叱宿術 001111111011110010111000101111011100100110111101110100010110100000111111101111001011100010111101110010011011110111010001 3fbcb8bdc9bdd1683fbcb8bdc9bdd1
UTF-8 瀨叱宿術h瀨叱宿術 11100111100000001010100011100101100011111011000111100101101011101011111111101000101000011001001101101000111001111000000010101000111001011000111110110001111001011010111010111111111010001010000110010011 e780a8e58fb1e5aebfe8a19368e780a8e58fb1e5aebfe8a193
UHC 瀨叱宿術h瀨叱宿術 1101011011101110111100101110101011100010110101101110001011111010011010001101011011101110111100101110101011100010110101101110001011111010 d6eef2eae2d6e2fa68d6eef2eae2d6e2fa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)