To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????A?????????AB 001111110011111100111111001111110011111100111111001111110011111100111111010000010011111100111111001111110011111100111111001111110011111100111111001111110100000101000010 3f3f3f3f3f3f3f3f3f413f3f3f3f3f3f3f3f3f4142
SJIS-WIN ??邵??堯??縟A??邵??堯??縟AB 001111110011111111100111101110000011111100111111111010101001111100111111001111111110001101110100010000010011111100111111111001111011100000111111001111111110101010011111001111110011111111100011011101000100000101000010 3f3fe7b83f3fea9f3f3fe374413f3fe7b83f3fea9f3f3fe3744142
EUC-JP ??邵??堯??縟A??邵??堯??縟AB 001111110011111111101110101110100011111100111111111101001010000100111111001111111110010111010101010000010011111100111111111011101011101000111111001111111111010010100001001111110011111111100101110101010100000101000010 3f3feeba3f3ff4a13f3fe5d5413f3feeba3f3ff4a13f3fe5d54142
UTF-8 쐛숰邵쐛了堯쐛숰縟A쐛숰邵쐛了堯쐛숰縟AB 111011001001000010011011111011001000100010110000111010011000001010110101111011001001000010011011111011111010011010111010111001011010000010101111111011001001000010011011111011001000100010110000111001111011100010011111010000011110110010010000100110111110110010001000101100001110100110000010101101011110110010010000100110111110111110100110101110101110010110100000101011111110110010010000100110111110110010001000101100001110011110111000100111110100000101000010 ec909bec88b0e982b5ec909befa6bae5a0afec909bec88b0e7b89f41ec909bec88b0e982b5ec909befa6bae5a0afec909bec88b0e7b89f4142
UHC 쐛숰邵쐛了堯쐛숰縟A쐛숰邵쐛了堯쐛숰縟AB 100111001000000110011010010010001110000111010000100111001000000111101000111001111110100011101011100111001000000110011010010010001110100110110010010000011001110010000001100110100100100011100001110100001001110010000001111010001110011111101000111010111001110010000001100110100100100011101001101100100100000101000010 9c819a48e1d09c81e8e7e8eb9c819a48e9b2419c819a48e1d09c81e8e7e8eb9c819a48e9b24142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)