To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 正?地?矣??地?矣? 10010000101100110011111110010010011011100011111111100001111000010011111100111111100100100110111000111111111000011110000100111111 90b33f926e3fe1e13f3f926e3fe1e13f
EUC-JP 正?地?矣??地?矣? 11000000101101010011111111000011110011110011111111100010111000110011111100111111110000111100111100111111111000101110001100111111 c0b53fc3cf3fe2e33f3fc3cf3fe2e33f
UTF-8 正렰地렟矣얘♧地렟矣슴 111001101010110110100011111010111010000010110000111001011001110010110000111010111010000010011111111001111001111110100011111011001001011010011000111000101001100110100111111001011001110010110000111010111010000010011111111001111001111110100011111011001000101010110100 e6ada3eba0b0e59cb0eba09fe79fa3ec9698e299a7e59cb0eba09fe79fa3ec8ab4
UHC 正렰地렟矣얘♧地렟矣슴 11101111111000011000111010111101111100101010001010001110101100001110101111111000101111101110101010100010101111111111001010100010100011101011000011101011111110001011110110111111 efe18ebdf2a28eb0ebf8beeaa2bff2a28eb0ebf8bdbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)