To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??揖??醫?? 111000101010001100111111001111111001011101001011001111110011111111100111110011100011111100111111 e2a33f3f974b3f3fe7ce3f3f
EUC-JP 筌??揖??醫?? 111001001010010100111111001111111100110110101100001111110011111111101110110100000011111100111111 e4a53f3fcdac3f3feed03f3f
UTF-8 筌듬쪈揖삼㎗醫롪갸 111001111010110110001100111010111001001110101100111011001010101010001000111001101000111110010110111011001000001010111100111000111000111010010111111010011000011010101011111010111010000110101010111010101011000010111000 e7ad8ceb93acecaa88e68f96ec82bce38e97e986abeba1aaeab0b8
UHC 筌듬쪈揖삼㎗醫롪갸 111011111010011110110101111010111010010110000010111010111110011110111011111011111010011110100011111011001010001010001110111010101011000010111100 efa7b5eba582ebe7bbefa7a3eca28eeab0bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)