To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏l?諛?。????↑?冗?????擾 1000100101000111100000101000110000111111111001101000011100111111100000010100001000111111001111110011111100111111100000011010101000111111100011111110011100111111001111110011111100111111001111111000111111101111 8947828c3fe6873f81423f3f3f3f81aa3f8fe73f3f3f3f3f8fef
EUC-JP 烏l?諛?。彛???↑?冗?????擾 10110001101010001010001111101100001111111110101111100111001111111010000110100011100011111011110011111010001111110011111100111111101000101010110000111111101111101110100100111111001111110011111100111111001111111011111011110001 b1a8a3ec3febe73fa1a38fbcfa3f3f3fa2ac3fbee93f3f3f3f3fbef1
UTF-8 烏l츦諛김。彛몃졁廬↑닞冗밴엽隸욅꼳擾 111001111000001110001111111011111011110110001100111011001011100010100110111010001010101110011011111010101011100110000000111000111000000010000010111001011011110110011011111010111010101010000011111011001010000110000001111011111010011010000010111000101000011010010001111010111000101110011110111001011000011010010111111010111011000010110100111011001001011110111101111011111010011010111000111011001001101010000101111010101011110010110011111001101001001110111110 e7838fefbd8cecb8a6e8ab9beab980e38082e5bd9bebaa83eca181efa682e28691eb8b9ee58697ebb0b4ec97bdefa6b8ec9a85eabcb3e693be
UHC 烏l츦諛김。彛몃졁廬↑닞冗밴엽隸욅꼳擾 1110100010100001101000111110110010101110100111001110101110110000101100011110100010100001101000111110110010101101101110001110101110100000101100101110010111111110101000011110100010001000100111101110100110110111101110011110101010111111101100011110011111100110100111101110011110000100100011001110100011110110 e8a1a3ecae9cebb0b1e8a1a3ecadb8eba0b2e5fea1e8889ee9b7b9eabfb1e7e69ee7848ce8f6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)