To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?‥猷??矣?????揖?????潁??怡 1110000110011111001111111000000101100100100101110101000100111111001111111110000111100001001111110011111100111111001111110011111110010111010010110011111100111111001111110011111100111111100111111111000100111111001111111001110001111101 e19f3f816497513f3fe1e13f3f3f3f3f974b3f3f3f3f3f9ff13f3f9c7d
EUC-JP 癲?‥猷??矣?????揖?????潁??怡 1110001010100001001111111010000111000101110011011011001000111111001111111110001011100011001111110011111100111111001111110011111111001101101011000011111100111111001111110011111100111111110111101111001100111111001111111101011111011110 e2a13fa1c5cdb23f3fe2e33f3f3f3f3fcdac3f3f3f3f3fdef33f3fd7de
UTF-8 癲용‥猷싨샍矣놁쑹銳녿뵃揖썲럳戮⑸뻽潁뺢퇍怡 111001111001100110110010111011001001101010101001111000101000000010100101111001111000110010110111111011001000101110101000111011001000001110001101111001111001111110100011111010111000011010000001111011001001000110111001111010011000101010110011111010111000010110111111111010111011010110000011111001101000111110010110111011001000110110110010111010111001111110110011111011111010011110010010111000101001000110111000111010111011101110111101111001101011110110000001111010111011101010100010111011011000011110001101111001101000000010100001 e799b2ec9aa9e280a5e78cb7ec8ba8ec838de79fa3eb8681ec91b9e98ab3eb85bfebb583e68f96ec8db2eb9fb3efa792e291b8ebbbbde6bd81ebbaa2ed878de680a1
UHC 癲용‥猷싨샍矣놁쑹銳녿뵃揖썲럳戮⑸뻽潁뺢퇍怡 1110111110100110101111111110101110100001101001011110101110100011100110101110011010011000101110111110101111111000100001101110110010111110101010111110011111100101100001101110101110010100100010011110101111100111101111011110010110001110100100111110101110111101101010011110101110010110100010001110011110111000100101011110101010110111100111101110110010101110 efa6bfeba1a5eba39ae698bbebf886ecbeabe7e586eb9489ebe7bde58e93ebbda9eb9688e7b895eab79eecae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)