To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 畏??節??張ц????玉??阿?????^ 10001000110110000011111100111111100100001101111100111111001111111001001010100011100001001000100000111111001111110011111100111111100010111100101000111111001111111000100010100010001111110011111100111111001111110011111101011110 88d83f3f90df3f3f92a384883f3f3f3f8bca3f3f88a23f3f3f3f3f5e
EUC-JP 畏??節??張ц????玉??阿??旿??^ 101100001101101000111111001111111100000011100001001111110011111111000100101001011010011111101000001111110011111100111111001111111011011011001100001111110011111110110000101001000011111100111111100011111100000111110100001111110011111101011110 b0da3f3fc0e13f3fc4a5a7e83f3f3f3fb6cc3f3fb0a43f3f8fc1f43f3f5e
UTF-8 畏묋쵌節삡튋張ц쐥嶺묌닊玉롳슥阿잝눐旿딁븳^ 111001111001010110001111111010111010110010001011111011001011010110001100111001111010111110000000111011001000001010100001111011011000101010001011111001011011110010110101110100011000011011101100100100001010010111101111101001101010101111101011101011001000110011101011100010111000101011100111100011101000100111101011101000011011001111101100100010101010010111101001100110001011111111101100100111101001110111101011100010001001000011100110100101111011111111101011100101001000000111101011101110001011001101011110 e7958febac8becb58ce7af80ec82a1ed8a8be5bcb5d186ec90a5efa6abebac8ceb8b8ae78e89eba1b3ec8aa5e998bfec9e9deb8890e697bfeb9481ebb8b35e
UHC 畏묋쵌節삡튋張ц쐥嶺묌닊玉롳슥阿잝눐旿딁븳^ 11101000111001101001000111101000101011001000111011101111101111011011101111100100101110011001111111101101111001011010110011101000100111001000101011100111101011011001000111101001100010001001000111101000101011001000111011101111101111011011101111100100101110011001111111101110100001111010110011100111111110101000101011100111100101011001110001011110 e8e691e8ac8eefbdbbe4b99fede5ace89c8ae7ad91e98891e8ac8eefbdbbe4b99fee87ace7fa8ae7959c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)