To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???曜??縊i?}???曜??縊i?{^ 001111110011111100111111100101110110101000111111001111111110001101101111100000101000100100111111011111010011111100111111001111111001011101101010001111110011111111100011011011111000001010001001001111110111101101011110 3f3f3f976a3f3fe36f82893f7d3f3f3f976a3f3fe36f82893f7b5e
EUC-JP ???曜??縊i?}???曜??縊i?{^ 001111110011111100111111110011011100101100111111001111111110010111010000101000111110100100111111011111010011111100111111001111111100110111001011001111110011111111100101110100001010001111101001001111110111101101011110 3f3f3fcdcb3f3fe5d0a3e93f7d3f3f3fcdcb3f3fe5d0a3e93f7b5e
UTF-8 閱곕젘曜쒕젡縊i풙}閱곕젘曜쒕젡縊i풙{^ 111010011001011010110001111010101011001110010101111011001010000010011000111001101001101110011100111011001001001010010101111011001010000010100001111001111011100010001010111011111011110110001001111011011001001010011001011111011110100110010110101100011110101010110011100101011110110010100000100110001110011010011011100111001110110010010010100101011110110010100000101000011110011110111000100010101110111110111101100010011110110110010010100110010111101101011110 e996b1eab395eca098e69b9cec9295eca0a1e7b88aefbd89ed92997de996b1eab395eca098e69b9cec9295eca0a1e7b88aefbd89ed92997b5e
UHC 閱곕젘曜쒕젡縊i풙}閱곕젘曜쒕젡縊i풙{^ 111001101111001110110000111010111010000010010100111010001111100010011100111010111010000010011010111001001111110010100011111010011011111010011100011111011110011011110011101100001110101110100000100101001110100011111000100111001110101110100000100110101110010011111100101000111110100110111110100111000111101101011110 e6f3b0eba094e8f89ceba09ae4fca3e9be9c7de6f3b0eba094e8f89ceba09ae4fca3e9be9c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)