To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥????ぜ純?ザ艶l?異ょ???????? 111000001100111000111111001111110011111100111111100000101011101010001111100000110011111110000011010101011000100110010000100000101000110000111111100010001101100110000010111001010011111100111111001111110011111100111111001111110011111100111111 e0ce3f3f3f3f82ba8f833f83558990828c3f88d982e53f3f3f3f3f3f3f3f
EUC-JP 猥????ぜ純?ザ艶l?異ょ???????彛 1110000011010000001111110011111100111111001111111010010010111100101111011110001100111111101001011011011010110001111100001010001111101100001111111011000011011011101001001110011100111111001111110011111100111111001111110011111100111111100011111011110011111010 e0d03f3f3f3fa4bcbde33fa5b6b1f0a3ec3fb0dba4e73f3f3f3f3f3f3f8fbcfa
UTF-8 猥롢끇栒뤺ぜ純놁ザ艶l꼷異ょ솾流껎맂若쒖궪彛 111001111000110010100101111010111010000110100010111010111000000110000111111001101010000010010010111010111010010010111010111000111000000110011100111001111011010010010100111010111000011010000001111000111000001010110110111010001000100110110110111011111011110110001100111010101011110010110111111001111001010110110000111000111000001010000111111011001000011010111110111011111010011110001010111010101011101110001110111010111010011110000010111011111010010110110100111011001001001010010110111010101011011010101010111001011011110110011011 e78ca5eba1a2eb8187e6a092eba4bae3819ce7b494eb8681e382b6e889b6efbd8ceabcb7e795b0e38287ec86beefa78aeabb8eeba782efa5b4ec9296eab6aae5bd9b
UHC 猥롢끇栒뤺ぜ純놁ザ艶l꼷異ょ솾流껎맂若쒖궪彛 1110100011100101100011101110001110000101101110111110001011100011100011111110100010101010101111001110001011101101100001101110110010101011101101101110011011111101101000111110110010000100100011111110110010110110101010101110011110011001101100101110101011111100100000111110110110010000100111001110010110101110100111001110110010000010101111001110110010101101 e8e58ee385bbe2e38fe8aabce2ed86ecabb6e6fda3ec848fecb6aae799b2eafc83ed909ce5ae9cec82bcecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)