To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8宜h?幽??巍ル?日?┃怨λ?亦 11100001100111110011111110000010010101111000101101011000100000101000100000111111100101110100100000111111001111111001101111011001100000111000101100111111100100111111101000111111100001001010101110001001100001011000001111001001001111111001011010010010 e19f3f82578b5882883f97483f3f9bd9838b3f93fa3f84ab898583c93f9692
EUC-JP 癲?8宜h?幽??巍ル?日?┃怨λ?亦 11100010101000010011111110100011101110001011010110111001101000111110100000111111110011011010100100111111001111111101011011011011101001011110101100111111110001101111110000111111101010001010110110110001111001011010011011001011001111111100101111110010 e2a13fa3b8b5b9a3e83fcda93f3fd6dba5eb3fc6fc3fa8adb1e5a6cb3fcbf2
UTF-8 癲쒕8宜h뙲幽귦뫛巍ル뵃日딃┃怨λ젦亦 1110011110011001101100101110110010010010100101011110111110111100100110001110010110101110100111001110111110111101100010001110101110011001101100101110010110111001101111011110101010110111101001101110101110101011100110111110010110110111100011011110001110000011101010111110101110110101100000111110011010010111101001011110101110010100100000111110001010010100100000111110011010000000101010001100111010111011111011001010000010100110111001001011101010100110 e799b2ec9295efbc98e5ae9cefbd88eb99b2e5b9bdeab7a6ebab9be5b78de383abebb583e697a5eb9483e29483e680a8cebbeca0a6e4baa6
UHC 癲쒕8宜h뙲幽귦뫛巍ル뵃日딃┃怨λ젦亦 1110111110100110100111001110101110100011101110001110101111110001101000111110100010001100101101011110101011101011100000101110110110010001101110111110100011100100101010111110101110010100100010011110110011101101100010101110100110100110101011011110101010110011101001011110101110100000100111101110011010110010 efa69ceba3b8ebf1a3e88cb5eaeb82ed91bbe8e4abeb9489eced8ae9a6adeab3a5eba09ee6b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)