To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????zv?????????zvB 0011111100111111001111110011111100111111001111110011111100111111001111110111101001110110001111110011111100111111001111110011111100111111001111110011111100111111011110100111011001000010 3f3f3f3f3f3f3f3f3f7a763f3f3f3f3f3f3f3f3f7a7642
SJIS-WIN 凋??奉∝除???zv凋??奉∝除???zvB 10010010100111000011111100111111100101011111001010000001111001011000111110011100001111110011111100111111011110100111011010010010100111000011111100111111100101011111001010000001111001011000111110011100001111110011111100111111011110100111011001000010 929c3f3f95f281e58f9c3f3f3f7a76929c3f3f95f281e58f9c3f3f3f7a7642
EUC-JP 凋??奉∝除???zv凋??奉∝除???zvB 11000011111111000011111100111111110010101111010010100010111001111011110111111100001111110011111100111111011110100111011011000011111111000011111100111111110010101111010010100010111001111011110111111100001111110011111100111111011110100111011001000010 c3fc3f3fcaf4a2e7bdfc3f3f3f7a76c3fc3f3fcaf4a2e7bdfc3f3f3f7a7642
UTF-8 凋얏렫奉∝除쿰렰렦zv凋얏렫奉∝除쿰렰렦zvB 1110010110000111100010111110110010010110100011111110101110100000101010111110010110100101100010011110001010001000100111011110100110011001101001001110110010111111101100001110101110100000101100001110101110100000101001100111101001110110111001011000011110001011111011001001011010001111111010111010000010101011111001011010010110001001111000101000100010011101111010011001100110100100111011001011111110110000111010111010000010110000111010111010000010100110011110100111011001000010 e5878bec968feba0abe5a589e2889de999a4ecbfb0eba0b0eba0a67a76e5878bec968feba0abe5a589e2889de999a4ecbfb0eba0b0eba0a67a7642
UHC 凋얏렫奉∝除쿰렰렦zv凋얏렫奉∝除쿰렰렦zvB 1111000010111101101111101110011010001110101110011101110011100101101000011111000011110000101101101100010011110001100011101011110110001110101101010111101001110110111100001011110110111110111001101000111010111001110111001110010110100001111100001111000010110110110001001111000110001110101111011000111010110101011110100111011001000010 f0bdbee68eb9dce5a1f0f0b6c4f18ebd8eb57a76f0bdbee68eb9dce5a1f0f0b6c4f18ebd8eb57a7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)