To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 爾裙辞裔フ凜 11110000101011011000111010100010111001011110001110001110101010111110010111100001110011001110101010100011 f0ad8ea2e5e38eabe5e1cceaa3
EUC-JP ?爾裙辞裔フ凜 00111111101111001010010011101010111001011011110010101101111010101110001110001110110011001111010010100101 3fbca4eae5bcadeae38eccf4a5
UTF-8 爾裙辞裔フ凜 111011101000000110101100111001111000100010111110111010001010001110011001111010001011111010011110111010001010001110010100111011111011111010001100111001011000011110011100 ee81ace788bee8a399e8be9ee8a394efbe8ce5879c
UHC ?爾裙?裔?凜 0011111111101100101100111100111111011001001111111110011111100000001111111101011111001111 3fecb3cfd93fe7e03fd7cf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)