To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 阿?????昻???ヤ????逆??慂??^ 100010001010001000111111001111110011111100111111001111111111101011010000001111110011111100111111100000111000010000111111001111110011111100111111100010110111010000111111001111111001110011001000001111110011111101011110 88a23f3f3f3f3ffad03f3f3f83843f3f3f3f8b743f3f9cc83f3f5e
EUC-JP 阿?????????ヤ????逆??慂??^ 1011000010100100001111110011111100111111001111110011111100111111001111110011111100111111101001011110010000111111001111110011111100111111101101011101010100111111001111111101100011001010001111110011111101011110 b0a43f3f3f3f3f3f3f3f3fa5e43f3f3f3fb5d53f3fd8ca3f3f5e
UTF-8 阿뗣끃溜욄퐯昻숅퐢列ヤ퉳栒밸젦逆섇떃慂뗦씟^ 11101001100110001011111111101011100101111010001111101011100000011000001111101111101001111000101111101100100110101000010011101101100100001010111111100110100110001011101111101100100010001000010111101101100100001010001011101111101001101001110011100011100000111010010011101101100010011011001111100110101000001001001011101011101100001011100011101100101000001010011011101001100000001000011011101100100001001000011111101011100101101000001111100110100001011000001011101011100101111010011011101100100101001001111101011110 e998bfeb97a3eb8183efa78bec9a84ed90afe698bbec8885ed90a2efa69ce383a4ed89b3e6a092ebb0b8eca0a6e98086ec8487eb9683e68582eb97a6ec949f5e
UHC 阿뗣끃溜욄퐯昻숅퐢列ヤ퉳栒밸젦逆섇떃慂뗦씟^ 11100100101110011000101111100011100001011011100111101010111111101001111011100110101111011001100011100100111010011001100111101001101111011000101111100110111010101010101111100100101110011000101111100010111000111011100111101011101000001001111011100110101111011001100011100101100010111001100111101001101111011000101111100110100111011011001101011110 e4b98be385b9eafe9ee6bd98e4e999e9bd8be6eaabe4b98be2e3b9eba09ee6bd98e58b99e9bd8be69db35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)