To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????|^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110001011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7c5e
SJIS-WIN 探遜旦但探遜丹谷探遜旦但探遜探袖|^ 10010010010101001001000110111011100100100101010110010010010000011001001001010100100100011011101110010010010011111001001001001010100100100101010010010001101110111001001001010101100100100100000110010010010101001001000110111011100100100101010010010001101100110111110001011110 925491bb92559241925491bb924f924a925491bb92559241925491bb925491b37c5e
EUC-JP 探遜旦但探遜丹谷探遜旦但探遜探袖|^ 11000011101101011100001010111101110000111011011011000011101000101100001110110101110000101011110111000011101100001100001110101011110000111011010111000010101111011100001110110110110000111010001011000011101101011100001010111101110000111011010111000010101101010111110001011110 c3b5c2bdc3b6c3a2c3b5c2bdc3b0c3abc3b5c2bdc3b6c3a2c3b5c2bdc3b5c2b57c5e
UTF-8 探遜旦但探遜丹谷探遜旦但探遜探袖|^ 1110011010001110101000101110100110000001100111001110011010010111101001101110010010111101100001101110011010001110101000101110100110000001100111001110010010111000101110011110100010110000101101111110011010001110101000101110100110000001100111001110011010010111101001101110010010111101100001101110011010001110101000101110100110000001100111001110011010001110101000101110100010100010100101100111110001011110 e68ea2e9819ce697a6e4bd86e68ea2e9819ce4b8b9e8b0b7e68ea2e9819ce697a6e4bd86e68ea2e9819ce68ea2e8a2967c5e
UHC 探遜旦但探遜丹谷探遜旦但探遜探袖|^ 11110111101011101110000111100001110100111010100111010011101000111111011110101110111000011110000111010011101000011100110111011011111101111010111011100001111000011101001110101001110100111010001111110111101011101110000111100001111101111010111011100010110000000111110001011110 f7aee1e1d3a9d3a3f7aee1e1d3a1cddbf7aee1e1d3a9d3a3f7aee1e1f7aee2c07c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)