To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8意??受?????日?┃怨λ?亦 11100001100111110011111110000010010101111000100011010011001111110011111110001110111100110011111100111111001111110011111100111111100100111111101000111111100001001010101110001001100001011000001111001001001111111001011010010010 e19f3f825788d33f3f8ef33f3f3f3f3f93fa3f84ab898583c93f9692
EUC-JP 癲?8意??受?????日?┃怨λ?亦 11100010101000010011111110100011101110001011000011010101001111110011111110111100111101010011111100111111001111110011111100111111110001101111110000111111101010001010110110110001111001011010011011001011001111111100101111110010 e2a13fa3b8b0d53f3fbcf53f3f3f3f3fc6fc3fa8adb1e5a6cb3fcbf2
UTF-8 癲쒕8意덌쭫受쇳뫛銳얜㉡日딃┃怨λ젦亦 1110011110011001101100101110110010010010100101011110111110111100100110001110011010000100100011111110101110001101100011001110110010101101101010111110010110001111100101111110110010000111101100111110101110101011100110111110100110001010101100111110110010010110100111001110001110001001101000011110011010010111101001011110101110010100100000111110001010010100100000111110011010000000101010001100111010111011111011001010000010100110111001001011101010100110 e799b2ec9295efbc98e6848feb8d8cecadabe58f97ec87b3ebab9be98ab3ec969ce389a1e697a5eb9483e29483e680a8cebbeca0a6e4baa6
UHC 癲쒕8意덌쭫受쇳뫛銳얜㉡日딃┃怨λ젦亦 1110111110100110100111001110101110100011101110001110101111110010100010001110111110100111100111111110000111110100101111001110110110010001101110111110011111100101101111101110101110101000101100101110110011101101100010101110100110100110101011011110101010110011101001011110101110100000100111101110011010110010 efa69ceba3b8ebf288efa79fe1f4bced91bbe7e5beeba8b2eced8ae9a6adeab3a5eba09ee6b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)