To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 雍?ぃ???ぃ? 1110100010110100001111111000001010100001001111110011111100111111100000101010000100111111 e8b43f82a13f3f3f82a13f
EUC-JP 雍?ぃ???ぃ? 1111000010110110001111111010010010100011001111110011111100111111101001001010001100111111 f0b63fa4a33f3f3fa4a33f
UTF-8 雍퀭ぃ펜렭퀭ぃ탓 111010011001101110001101111011011000000010101101111000111000000110000011111011011000111010011100111010111010000010101101111011011000000010101101111000111000000110000011111011011000001110010011 e99b8ded80ade38183ed8e9ceba0aded80ade38183ed8393
UHC 雍퀭ぃ펜렭퀭ぃ탓 11101000101111001100010011111010101010101010001111000110111001101000111010111010110001001111101010101010101000111100010110111111 e8bcc4faaaa3c6e68ebac4faaaa3c5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)