To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????[???????????[^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 淨??私?屯???私?[淨??私?屯???私?[^ 100111111100010000111111001111111000111010000100001111111001001111010100001111110011111100111111100011101000010000111111010110111001111111000100001111110011111110001110100001000011111110010011110101000011111100111111001111111000111010000100001111110101101101011110 9fc43f3f8e843f93d43f3f3f8e843f5b9fc43f3f8e843f93d43f3f3f8e843f5b5e
EUC-JP 淨?焌私?屯??焌私?[淨?焌私?屯??焌私?[^ 1101111011000110001111111000111111001001111010001011101111100100001111111100011011010110001111110011111110001111110010011110100010111011111001000011111101011011110111101100011000111111100011111100100111101000101110111110010000111111110001101101011000111111001111111000111111001001111010001011101111100100001111110101101101011110 dec63f8fc9e8bbe43fc6d63f3f8fc9e8bbe43f5bdec63f8fc9e8bbe43fc6d63f3f8fc9e8bbe43f5b5e
UTF-8 淨렠焌私렎屯렟렩焌私볕[淨렠焌私렎屯렟렩焌私볕[^ 111001101011011110101000111010111010000010100000111001111000010010001100111001111010011110000001111010111010000010001110111001011011000110101111111010111010000010011111111010111010000010101001111001111000010010001100111001111010011110000001111010111011001110010101010110111110011010110111101010001110101110100000101000001110011110000100100011001110011110100111100000011110101110100000100011101110010110110001101011111110101110100000100111111110101110100000101010011110011110000100100011001110011110100111100000011110101110110011100101010101101101011110 e6b7a8eba0a0e7848ce7a781eba08ee5b1afeba09feba0a9e7848ce7a781ebb3955be6b7a8eba0a0e7848ce7a781eba08ee5b1afeba09feba0a9e7848ce7a781ebb3955b5e
UHC 淨렠焌私렎屯렟렩焌私볕[淨렠焌私렎屯렟렩焌私볕[^ 1110111111100100100011101011000111110001111000001101111011100111100011101010010011010100111010101000111010110000100011101011011111110001111000001101111011100111101110101011010101011011111011111110010010001110101100011111000111100000110111101110011110001110101001001101010011101010100011101011000010001110101101111111000111100000110111101110011110111010101101010101101101011110 efe48eb1f1e0dee78ea4d4ea8eb08eb7f1e0dee7bab55befe48eb1f1e0dee78ea4d4ea8eb08eb7f1e0dee7bab55b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)