To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??猷??矣???ル?裕???μ?俉 1110000110011111001111110011111110010111010100010011111100111111111000011110000100111111001111110011111110000011100010110011111110010111010101000011111100111111001111111000001111001010001111111111101001100001 e19f3f3f97513f3fe1e13f3f3f838b3f97543f3f3f83ca3ffa61
EUC-JP 癲??猷??矣???ル?裕??洹μ?俉 1110001010100001001111110011111111001101101100100011111100111111111000101110001100111111001111110011111110100101111010110011111111001101101101010011111100111111100011111100011110111010101001101100110000111111100011111011000110111011 e2a13f3fcdb23f3fe2e33f3f3fa5eb3fcdb53f3f8fc7baa6cc3f8fb1bb
UTF-8 癲숆낄猷쀩걬矣꾩쒜曆ル슢裕든솻洹μ쪙俉 1110011110011001101100101110110010001000100001101110101110000010100001001110011110001100101101111110110010000000101010011110101010110001101011001110011110011111101000111110101010111110101010011110110010010010100111001110111110100110100010111110001110000011101010111110110010001010101000101110100010100011100101011110101110010011101000001110110010000110101110111110011010110100101110011100111010111100111011001010101010011001111001001011111110001001 e799b2ec8886eb8284e78cb7ec80a9eab1ace79fa3eabea9ec929cefa68be383abec8aa2e8a395eb93a0ec86bbe6b4b9cebcecaa99e4bf89
UHC 癲숆낄猷쀩걬矣꾩쒜曆ル슢裕든솻洹μ쪙俉 1110111110100110100110011110101010110011101001011110101110100011100101111110100110000001100101011110101111111000100001001110110010111110101011101110011010110111101010111110101110011010101011101110101110101110101101011110011110011001101100001110101010110111101001011110110010100101100100101110011111101011 efa699eab3a5eba397e98195ebf884ecbeaee6b7abeb9aaeebaeb5e799b0eab7a5eca592e7eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)