To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????v??????????vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 造??迦?市????v造??迦?市????vB 1001000110100010001111110011111110001001110111100011111110001110011100110011111100111111001111110011111101110110100100011010001000111111001111111000100111011110001111111000111001110011001111110011111100111111001111110111011001000010 91a23f3f89de3f8e733f3f3f3f7691a23f3f89de3f8e733f3f3f3f7642
EUC-JP 造??迦?市????v造??迦?市????vB 1100001010100100001111110011111110110010111000000011111110111011110101000011111100111111001111110011111101110110110000101010010000111111001111111011001011100000001111111011101111010100001111110011111100111111001111110111011001000010 c2a43f3fb2e03fbbd43f3f3f3f76c2a43f3fb2e03fbbd43f3f3f3f7642
UTF-8 造섦뤋迦욹市킃샅렿롚v造섦뤋迦욹市킃샅렿롚vB 111010011000000010100000111011001000010010100110111010111010010010001011111010001011111110100110111011001001101010111001111001011011100010000010111011011000001010000011111011001000001110000101111010111010000010111111111010111010000110011010011101101110100110000000101000001110110010000100101001101110101110100100100010111110100010111111101001101110110010011010101110011110010110111000100000101110110110000010100000111110110010000011100001011110101110100000101111111110101110100001100110100111011001000010 e980a0ec84a6eba48be8bfa6ec9ab9e5b882ed8283ec8385eba0bfeba19a76e980a0ec84a6eba48be8bfa6ec9ab9e5b882ed8283ec8385eba0bfeba19a7642
UHC 造섦뤋迦욹市킃샅렿롚v造섦뤋迦욹市킃샅렿롚vB 11110000111000111011110010110100100011111011101111001010101111001011111111110000111000111011110010110100100011111011101111110100100011101100011110001110110111100111011011110000111000111011110010110100100011111011101111001010101111001011111111110000111000111011110010110100100011111011101111110100100011101100011110001110110111100111011001000010 f0e3bcb48fbbcabcbff0e3bcb48fbbf48ec78ede76f0e3bcb48fbbcabcbff0e3bcb48fbbf48ec78ede7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)