To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 霆丈ケ槭$髢呵ョ史U}霆丈ケ槭$髢呵ョ史U{^ 11101000101110111000111111100100101110011001111011100011100000011001000011101001100101101001100111101000101011101000111001101010010101010111110111101000101110111000111111100100101110011001111011100011100000011001000011101001100101101001100111101000101011101000111001101010010101010111101101011110 e8bb8fe4b99ee38190e99699e8ae8e6a557de8bb8fe4b99ee38190e99699e8ae8e6a557b5e
EUC-JP 霆丈ケ槭$髢呵ョ史U}霆丈ケ槭$髢呵ョ史U{^ 1111000010111101101111101110011010001110101110011101110011100101101000011111000011110001111101101101001011101010100011101010111010111011110010110101010101111101111100001011110110111110111001101000111010111001110111001110010110100001111100001111000111110110110100101110101010001110101011101011101111001011010101010111101101011110 f0bdbee68eb9dce5a1f0f1f6d2ea8eaebbcb557df0bdbee68eb9dce5a1f0f1f6d2ea8eaebbcb557b5e
UTF-8 霆丈ケ槭$髢呵ョ史U}霆丈ケ槭$髢呵ョ史U{^ 1110100110011100100001101110010010111000100010001110111110111101101110011110011010100111101011011110111110111100100001001110100110101011101000101110010110010001101101011110111110111101101011101110010110001111101100100101010101111101111010011001110010000110111001001011100010001000111011111011110110111001111001101010011110101101111011111011110010000100111010011010101110100010111001011001000110110101111011111011110110101110111001011000111110110010010101010111101101011110 e99c86e4b888efbdb9e6a7adefbc84e9aba2e591b5efbdaee58fb2557de99c86e4b888efbdb9e6a7adefbc84e9aba2e591b5efbdaee58fb2557b5e
UHC 霆丈??$?呵?史U}霆丈??$?呵?史U{^ 111011111111110111101101110110110011111100111111101000111010010000111111110010101010011100111111110111101100100001010101011111011110111111111101111011011101101100111111001111111010001110100100001111111100101010100111001111111101111011001000010101010111101101011110 effdeddb3f3fa3a43fcaa73fdec8557deffdeddb3f3fa3a43fcaa73fdec8557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)