To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 吾??爾よ?猷?? 10001100111000010011111100111111100011101010001010000010111001100011111110010111010100010011111100111111 8ce13f3f8ea282e63f97513f3f
EUC-JP 吾??爾よ?猷?? 10111000111000110011111100111111101111001010010010100100111010000011111111001101101100100011111100111111 b8e33f3fbca4a4e83fcdb23f3f
UTF-8 吾멸퍓爾よ눧猷몄낫 111001011001000010111110111010111010100110111000111011011000110110010011111001111000100010111110111000111000001010001000111010111000100010100111111001111000110010110111111010111010101010000100111010111000001010101011 e590beeba9b8ed8d93e788bee38288eb88a7e78cb7ebaa84eb82ab
UHC 吾멸퍓爾よ눧猷몄낫 111001111110111010111000111010101011101110001010111011001011001110101010111010001000011110111110111010111010001110111000111011001011001110110100 e7eeb8eabb8aecb3aae887beeba3b8ecb3b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)