To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???偃??暗??v???偃??暗??vB 00111111001111110011111110011000111011100011111100111111100010001100001100111111001111110111011000111111001111110011111110011000111011100011111100111111100010001100001100111111001111110111011001000010 3f3f3f98ee3f3f88c33f3f763f3f3f98ee3f3f88c33f3f7642
EUC-JP ???偃??暗??v???偃??暗??vB 00111111001111110011111111010000111100000011111100111111101100001100010100111111001111110111011000111111001111110011111111010000111100000011111100111111101100001100010100111111001111110111011001000010 3f3f3fd0f03f3fb0c53f3f763f3f3fd0f03f3fb0c53f3f7642
UTF-8 女앸젙偃띾젽暗싳텋v女앸젙偃띾젽暗싳텋vB 111011111010011010000001111011001001010110111000111011001010000010011001111001011000000110000011111010111001110110111110111011001010000010111101111001101001101010010111111011001000101110110011111011011000010110001011011101101110111110100110100000011110110010010101101110001110110010100000100110011110010110000001100000111110101110011101101111101110110010100000101111011110011010011010100101111110110010001011101100111110110110000101100010110111011001000010 efa681ec95b8eca099e58183eb9dbeeca0bde69a97ec8bb3ed858b76efa681ec95b8eca099e58183eb9dbeeca0bde69a97ec8bb3ed858b7642
UHC 女앸젙偃띾젽暗싳텋v女앸젙偃띾젽暗싳텋vB 111001011111110010011101111010111010000010010101111001011110011110001101111010111010000010101111111001001101111010011010111011001011011010001000011101101110010111111100100111011110101110100000100101011110010111100111100011011110101110100000101011111110010011011110100110101110110010110110100010000111011001000010 e5fc9deba095e5e78deba0afe4de9aecb68876e5fc9deba095e5e78deba0afe4de9aecb6887642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)