To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?溢e?怨??語⑤?悠ヨ?袁⑤㎜壤 1110000110011111100000111000101100111111100010001110110010000010100001010011111110001001100001010011111100111111100011001110101010000111010001000011111110010111010010011000001110001000001111111110010111001101100001110100010010000111011011111001101011011111 e19f838b3f88ec82853f89853f3f8cea87443f974983883fe5cd8744876f9adf
EUC-JP 癲ル?溢e?怨??語??悠ヨ?袁??壤 1110001010100001101001011110101100111111101100001110111010100011111001010011111110110001111001010011111100111111101110001110110000111111001111111100110110101010101001011110100000111111111010101100111100111111001111111101010011100001 e2a1a5eb3fb0eea3e53fb1e53f3fb8ec3f3fcdaaa5e83feacf3f3fd4e1
UTF-8 癲ル슪溢e뭣怨댿봼語⑤똾悠ヨ짅袁⑤㎜壤 111001111001100110110010111000111000001110101011111011001000101010101010111001101011101010100010111011111011110110000101111010111010110110100011111001101000000010101000111010111000110010111111111010111011010010111100111010001010101010011110111000101001000110100100111010111001100010111110111001101000001010100000111000111000001110101000111011001010011110000101111010001010001010000001111000101001000110100100111000111000111010011100111001011010001110100100 e799b2e383abec8aaae6baa2efbd85ebada3e680a8eb8cbfebb4bce8aa9ee291a4eb98bee682a0e383a8eca785e8a281e291a4e38e9ce5a3a4
UHC 癲ル슪溢e뭣怨댿봼語⑤똾悠ヨ짅袁⑤㎜壤 1110111110100110101010111110101110011010101100111110110011101110101000111110010110111001101111011110101010110011100010001110001010010100100000111110010111011110101010001110101110001100100001001110101011101101101010111110100010100011100101001110101010111110101010001110101110100111101011101110010110111101 efa6abeb9ab3eceea3e5b9bdeab388e29483e5dea8eb8c84eaedabe8a394eabea8eba7aee5bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)