To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 螻具スろ嘗蛟墫作ュ 11100101101100011000101111101111101111011000001011101011100011111010011011100101100000001001101011010001100011011110110010101101 e5b18befbd82eb8fa6e5809ad18decad
EUC-JP 螻具スろ嘗蛟墫作ュ 111010101011001110110110111100011000111010111101101001001110110110111110101010001110100111100000110101001101001110111010111011101000111010101101 eab3b6f18ebda4edbea8e9e0d4d3baee8ead
UTF-8 螻具スろ嘗蛟墫作ュ 111010001001111010111011111001011000010110110111111011111011110110111101111000111000001010001101111001011001100010010111111010001001101110011111111001011010001010101011111001001011110110011100111011111011110110101101 e89ebbe585b7efbdbde3828de59897e89b9fe5a2abe4bd9cefbdad
UHC ?具?ろ嘗蛟?作? 0011111111001110111111010011111110101010111011011101111111000100110011101111000100111111111011011100001000111111 3fcefd3faaeddfc4cef13fedc23f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)