To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 厭??一℡?寃??v厭??一℡?寃??vB 1000100101111101001111110011111110001000111010101000011110000100001111111001101110000011001111110011111101110110100010010111110100111111001111111000100011101010100001111000010000111111100110111000001100111111001111110111011001000010 897d3f3f88ea87843f9b833f3f76897d3f3f88ea87843f9b833f3f7642
EUC-JP 厭??一??寃??v厭??一??寃??vB 101100011101111000111111001111111011000011101100001111110011111111010101111000110011111100111111011101101011000111011110001111110011111110110000111011000011111100111111110101011110001100111111001111110111011001000010 b1de3f3fb0ec3f3fd5e33f3f76b1de3f3fb0ec3f3fd5e33f3f7642
UTF-8 厭얜쨸一℡뀻寃썼솊v厭얜쨸一℡뀻寃썼솊vB 111001011000111010101101111011001001011010011100111011001010100010111000111001001011100010000000111000101000010010100001111010111000000010111011111001011010111110000011111011001000110110111100111011001000011010001010011101101110010110001110101011011110110010010110100111001110110010101000101110001110010010111000100000001110001010000100101000011110101110000000101110111110010110101111100000111110110010001101101111001110110010000110100010100111011001000010 e58eadec969ceca8b8e4b880e284a1eb80bbe5af83ec8dbcec868a76e58eadec969ceca8b8e4b880e284a1eb80bbe5af83ec8dbcec868a7642
UHC 厭얜쨸一℡뀻寃썼솊v厭얜쨸一℡뀻寃썼솊vB 111001101111010010111110111010111010010010010010111011001110100110100010111001011000010110110001111010101011001010111101111010001001100110001110011101101110011011110100101111101110101110100100100100101110110011101001101000101110010110000101101100011110101010110010101111011110100010011001100011100111011001000010 e6f4beeba492ece9a2e585b1eab2bde8998e76e6f4beeba492ece9a2e585b1eab2bde8998e7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)