To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???よ?幽?? 111000011001111100111111001111110011111110000010111001100011111110010111010010000011111100111111 e19f3f3f3f82e63f97483f3f
EUC-JP 癲???よ?幽?? 111000101010000100111111001111110011111110100100111010000011111111001101101010010011111100111111 e2a13f3f3fa4e83fcda93f3f
UTF-8 癲뽰뇯痢よ쯁幽뚰뭲 111001111001100110110010111010111011110110110000111010111000011110101111111011111010011110100101111000111000001010001000111011001010111110000001111001011011100110111101111010111001101010110000111010111010110110110010 e799b2ebbdb0eb87afefa7a5e38288ecaf81e5b9bdeb9ab0ebadb2
UHC 癲뽰뇯痢よ쯁幽뚰뭲 111011111010011010010110111011001000011110010100111011001011100010101010111010001010100010011101111010101110101110001100111011011001001010000001 efa696ec8794ecb8aae8a89deaeb8ced9281

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)