To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???瀛?????v???瀛?????vB 0011111100111111001111111110000001101001001111110011111100111111001111110011111101110110001111110011111100111111111000000110100100111111001111110011111100111111001111110111011001000010 3f3f3fe0693f3f3f3f3f763f3f3fe0693f3f3f3f3f7642
EUC-JP ???瀛??瘦??v???瀛??瘦??vB 001111110011111100111111110111111100101000111111001111111000111111001101111101110011111100111111011101100011111100111111001111111101111111001010001111110011111110001111110011011111011100111111001111110111011001000010 3f3f3fdfca3f3f8fcdf73f3f763f3f3fdfca3f3f8fcdf73f3f7642
UTF-8 拾곥괦瀛랁뢆瘦꼷뢗v拾곥괦瀛랁뢆瘦꼷뢗vB 111011111010010110110011111010101011001110100101111010101011010010100110111001111000000010011011111010111001111010000001111010111010001010000110111001111001100010100110111010101011110010110111111010111010001010010111011101101110111110100101101100111110101010110011101001011110101010110100101001101110011110000000100110111110101110011110100000011110101110100010100001101110011110011000101001101110101010111100101101111110101110100010100101110111011001000010 efa5b3eab3a5eab4a6e7809beb9e81eba286e798a6eabcb7eba29776efa5b3eab3a5eab4a6e7809beb9e81eba286e798a6eabcb7eba2977642
UHC 拾곥괦瀛랁뢆瘦꼷뢗v拾곥괦瀛랁뢆瘦꼷뢗vB 111001001010100110000001111000111000001001010000111001111011101010001101111011011000111101000010111000101011000110000100100011111000111101010010011101101110010010101001100000011110001110000010010100001110011110111010100011011110110110001111010000101110001010110001100001001000111110001111010100100111011001000010 e4a981e38250e7ba8ded8f42e2b1848f8f5276e4a981e38250e7ba8ded8f42e2b1848f8f527642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)