To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????R??????o??????]^ 00111111001111110011111100111111001111110011111101010010001111110011111100111111001111110011111100111111011011110011111100111111001111110011111100111111001111110101110101011110 3f3f3f3f3f3f523f3f3f3f3f3f6f3f3f3f3f3f3f5d5e
SJIS-WIN 臾るし臾るЧR臾るし臾るЧo臾るし臾るЧ]^ 11100100011010111000001011101001100000101011010111100100011010111000001011101001100001000101100001010010111001000110101110000010111010011000001010110101111001000110101110000010111010011000010001011000011011111110010001101011100000101110100110000010101101011110010001101011100000101110100110000100010110000101110101011110 e46b82e982b5e46b82e9845852e46b82e982b5e46b82e984586fe46b82e982b5e46b82e984585d5e
EUC-JP 臾るし臾るЧR臾るし臾るЧo臾るし臾るЧ]^ 11100111110011001010010011101011101001001011011111100111110011001010010011101011101001111011100101010010111001111100110010100100111010111010010010110111111001111100110010100100111010111010011110111001011011111110011111001100101001001110101110100100101101111110011111001100101001001110101110100111101110010101110101011110 e7cca4eba4b7e7cca4eba7b952e7cca4eba4b7e7cca4eba7b96fe7cca4eba4b7e7cca4eba7b95d5e
UTF-8 臾るし臾るЧR臾るし臾るЧo臾るし臾るЧ]^ 11101000100001111011111011100011100000101000101111100011100000011001011111101000100001111011111011100011100000101000101111010000101001110101001011101000100001111011111011100011100000101000101111100011100000011001011111101000100001111011111011100011100000101000101111010000101001110110111111101000100001111011111011100011100000101000101111100011100000011001011111101000100001111011111011100011100000101000101111010000101001110101110101011110 e887bee3828be38197e887bee3828bd0a752e887bee3828be38197e887bee3828bd0a76fe887bee3828be38197e887bee3828bd0a75d5e
UHC 臾るし臾るЧR臾るし臾るЧo臾るし臾るЧ]^ 11101011101011001010101011101011101010101011011111101011101011001010101011101011101011001011100101010010111010111010110010101010111010111010101010110111111010111010110010101010111010111010110010111001011011111110101110101100101010101110101110101010101101111110101110101100101010101110101110101100101110010101110101011110 ebacaaebaab7ebacaaebacb952ebacaaebaab7ebacaaebacb96febacaaebaab7ebacaaebacb95d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)