To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??喩??艶c????恂??猥??? 1110000110011111001111110011111110001000110100110011111100111111100110100110011100111111001111111000100110010000100000101000001100111111001111110011111100111111100111001001011000111111001111111110000011001110001111110011111100111111 e19f3f3f88d33f3f9a673f3f899082833f3f3f3f9c963f3fe0ce3f3f3f
EUC-JP 癲??意??喩??艶c????恂??猥??? 1110001010100001001111110011111110110000110101010011111100111111110100111100100000111111001111111011000111110000101000111110001100111111001111110011111100111111110101111111011000111111001111111110000011010000001111110011111100111111 e2a13f3fb0d53f3fd3c83f3fb1f0a3e33f3f3f3fd7f63f3fe0d03f3f3f
UTF-8 癲⑸뜄意욇윀喩볝걶艶c끇琉귞툞恂ⓦ걶猥됰굞留 111001111001100110110010111000101001000110111000111010111001110010000100111001101000010010001111111011001001101010000111111011001001110010000000111001011001011010101001111010111011001110011101111010101011000110110110111010001000100110110110111011111011110110000011111010111000000110000111111011111010011110001100111010101011011110011110111011011000100010011110111001101000000110000010111000101001001110100110111010101011000110110110111001111000110010100101111010111001000010110000111010101011010110011110111011111010011110001101 e799b2e291b8eb9c84e6848fec9a87ec9c80e596a9ebb39deab1b6e889b6efbd83eb8187efa78ceab79eed889ee68182e293a6eab1b6e78ca5eb90b0eab59eefa78d
UHC 癲⑸뜄意욇윀喩볝걶艶c끇琉귞툞恂ⓦ걶猥됰굞留 1110111110100110101010011110101110001101100010001110101111110010100111101110100110011111100010111110101011100111100100111110001110000001100111001110011011111101101000111110001110000101101110111110101110100100100000101110011110111000100101011110001011100001101010001110001110000001100111001110100011100101100010011110101110000010100001101110101110100111 efa6a9eb8d88ebf29ee99f8beae793e3819ce6fda3e385bbeba482e7b895e2e1a8e3819ce8e589eb8286eba7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)