To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???娃??晤??v???娃??晤??vB 00111111001111110011111110001000101000010011111100111111100111011110101100111111001111110111011000111111001111110011111110001000101000010011111100111111100111011110101100111111001111110111011001000010 3f3f3f88a13f3f9deb3f3f763f3f3f88a13f3f9deb3f3f7642
EUC-JP ???娃??晤??v???娃??晤??vB 00111111001111110011111110110000101000110011111100111111110110101110110100111111001111110111011000111111001111110011111110110000101000110011111100111111110110101110110100111111001111110111011001000010 3f3f3fb0a33f3fdaed3f3f763f3f3fb0a33f3fdaed3f3f7642
UTF-8 遼⑼쉠娃됵슨晤ⓩ츎v遼⑼쉠娃됵슨晤ⓩ츎vB 111011111010011110000011111000101001000110111100111011001000100110100000111001011010100010000011111010111001000010110101111011001000101010101000111001101001100110100100111000101001001110101001111011001011100010001110011101101110111110100111100000111110001010010001101111001110110010001001101000001110010110101000100000111110101110010000101101011110110010001010101010001110011010011001101001001110001010010011101010011110110010111000100011100111011001000010 efa783e291bcec89a0e5a883eb90b5ec8aa8e699a4e293a9ecb88e76efa783e291bcec89a0e5a883eb90b5ec8aa8e699a4e293a9ecb88e7642
UHC 遼⑼쉠娃됵슨晤ⓩ츎v遼⑼쉠娃됵슨晤ⓩ츎vB 111010011010110010101001111011111011110110101010111010001101111110001001111011111011110110111100111001111111101110101000111001101010111010001001011101101110100110101100101010011110111110111101101010101110100011011111100010011110111110111101101111001110011111111011101010001110011010101110100010010111011001000010 e9aca9efbdaae8df89efbdbce7fba8e6ae8976e9aca9efbdaae8df89efbdbce7fba8e6ae897642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)