To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 嚥?????也??}v嚥?????也??}vB 100110101000101100111111001111110011111100111111001111111001011011100111001111110011111101111101011101101001101010001011001111110011111100111111001111110011111110010110111001110011111100111111011111010111011001000010 9a8b3f3f3f3f3f96e73f3f7d769a8b3f3f3f3f3f96e73f3f7d7642
EUC-JP 嚥??濚??也??}v嚥??濚??也??}vB 11010011111010110011111100111111100011111100100110100001001111110011111111001100111010010011111100111111011111010111011011010011111010110011111100111111100011111100100110100001001111110011111111001100111010010011111100111111011111010111011001000010 d3eb3f3f8fc9a13f3fcce93f3f7d76d3eb3f3f8fc9a13f3fcce93f3f7d7642
UTF-8 嚥띰숴濚욑슥也쏉숴}v嚥띰숴濚욑슥也쏉숴}vB 1110010110011010101001011110101110011101101100001110110010001000101101001110011010111111100110101110110010011010100100011110110010001010101001011110010010111001100111111110110010001111100010011110110010001000101101000111110101110110111001011001101010100101111010111001110110110000111011001000100010110100111001101011111110011010111011001001101010010001111011001000101010100101111001001011100110011111111011001000111110001001111011001000100010110100011111010111011001000010 e59aa5eb9db0ec88b4e6bf9aec9a91ec8aa5e4b99fec8f89ec88b47d76e59aa5eb9db0ec88b4e6bf9aec9a91ec8aa5e4b99fec8f89ec88b47d7642
UHC 嚥띰숴濚욑슥也쏉숴}v嚥띰숴濚욑슥也쏉숴}vB 1110011010111111101101101110111110111101101001001110011110111001100111101110111110111101101110111110010110100101100110111110111110111101101001000111110101110110111001101011111110110110111011111011110110100100111001111011100110011110111011111011110110111011111001011010010110011011111011111011110110100100011111010111011001000010 e6bfb6efbda4e7b99eefbdbbe5a59befbda47d76e6bfb6efbda4e7b99eefbdbbe5a59befbda47d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)