To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 葯?????僥??v葯?????僥??vB 11100100110111100011111100111111001111110011111100111111100110010100011000111111001111110111011011100100110111100011111100111111001111110011111100111111100110010100011000111111001111110111011001000010 e4de3f3f3f3f3f99463f3f76e4de3f3f3f3f3f99463f3f7642
EUC-JP 葯??獒??僥??v葯??獒??僥??vB 1110100011100000001111110011111110001111110010111011101100111111001111111101000110100111001111110011111101110110111010001110000000111111001111111000111111001011101110110011111100111111110100011010011100111111001111110111011001000010 e8e03f3f8fcbbb3f3fd1a73f3f76e8e03f3f8fcbbb3f3fd1a73f3f7642
UTF-8 葯볡ㅊ獒붻쾷僥㏝눀v葯볡ㅊ獒붻쾷僥㏝눀vB 111010001001000110101111111010111011001110100001111000111000010110001010111001111000110110010010111010111011011010111011111011001011111010110111111001011000001110100101111000111000111110011101111010111000100010000000011101101110100010010001101011111110101110110011101000011110001110000101100010101110011110001101100100101110101110110110101110111110110010111110101101111110010110000011101001011110001110001111100111011110101110001000100000000111011001000010 e891afebb3a1e3858ae78d92ebb6bbecbeb7e583a5e38f9deb888076e891afebb3a1e3858ae78d92ebb6bbecbeb7e583a5e38f9deb88807642
UHC 葯볡ㅊ獒붻쾷僥㏝눀v葯볡ㅊ獒붻쾷僥㏝눀vB 111001011011010110010011111001111010010010111010111010001010001110010100111010001011001010001101111010001110100110100111111010011000011110100001011101101110010110110101100100111110011110100100101110101110100010100011100101001110100010110010100011011110100011101001101001111110100110000111101000010111011001000010 e5b593e7a4bae8a394e8b28de8e9a7e987a176e5b593e7a4bae8a394e8b28de8e9a7e987a17642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)