To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?k??i?韋??鴨 0011111110000010100010110011111100111111100000101000100100111111111010001110100000111111001111111000101010011011 3f828b3f3f82893fe8e83f3f8a9b
EUC-JP 渶k??i?韋??鴨 10001111110001111110110110100011111010110011111100111111101000111110100100111111111100001110101000111111001111111011001111111011 8fc7eda3eb3f3fa3e93ff0ea3f3fb3fb
UTF-8 渶k굞杻i렟韋용룆鴨 111001101011100010110110111011111011110110001011111010101011010110011110111011111010011110001000111011111011110110001001111010111010000010011111111010011001111110001011111011001001101010101001111010111010001110000110111010011011010010101000 e6b8b6efbd8beab59eefa788efbd89eba09fe99f8bec9aa9eba386e9b4a8
UHC 渶k굞杻i렟韋용룆鴨 1110011110110111101000111110101110000010100001101110101011110100101000111110100110001110101100001110101011011111101111111110101110001111100001011110010011100101 e7b7a3eb8286eaf4a3e98eb0eadfbfeb8f85e4e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)