To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 潁??泣ч?節??v潁??泣ч?節??vB 1001111111110001001111110011111110001011100000111000010010001001001111111001000011011111001111110011111101110110100111111111000100111111001111111000101110000011100001001000100100111111100100001101111100111111001111110111011001000010 9ff13f3f8b8384893f90df3f3f769ff13f3f8b8384893f90df3f3f7642
EUC-JP 潁??泣ч?節??v潁??泣ч?節??vB 1101111011110011001111110011111110110101111000111010011111101001001111111100000011100001001111110011111101110110110111101111001100111111001111111011010111100011101001111110100100111111110000001110000100111111001111110111011001000010 def33f3fb5e3a7e93fc0e13f3f76def33f3fb5e3a7e93fc0e13f3f7642
UTF-8 潁뺣굢泣ч쪛節덊뱿v潁뺣굢泣ч쪛節덊뱿vB 11100110101111011000000111101011101110101010001111101010101101011010001011100110101100111010001111010001100001111110110010101010100110111110011110101111100000001110101110001101100010101110101110110001101111110111011011100110101111011000000111101011101110101010001111101010101101011010001011100110101100111010001111010001100001111110110010101010100110111110011110101111100000001110101110001101100010101110101110110001101111110111011001000010 e6bd81ebbaa3eab5a2e6b3a3d187ecaa9be7af80eb8d8aebb1bf76e6bd81ebbaa3eab5a2e6b3a3d187ecaa9be7af80eb8d8aebb1bf7642
UHC 潁뺣굢泣ч쪛節덊뱿v潁뺣굢泣ч쪛節덊뱿vB 111001111011100010010101111010111000001010001001111010111110100010101100111010011010010110010100111011111011110110001000111011011001001110100101011101101110011110111000100101011110101110000010100010011110101111101000101011001110100110100101100101001110111110111101100010001110110110010011101001010111011001000010 e7b895eb8289ebe8ace9a594efbd88ed93a576e7b895eb8289ebe8ace9a594efbd88ed93a57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)