To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 五????????}v五????????}vB 10001100110111000011111100111111001111110011111100111111001111110011111100111111011111010111011010001100110111000011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 8cdc3f3f3f3f3f3f3f3f7d768cdc3f3f3f3f3f3f3f3f7d7642
EUC-JP 五??濚?????}v五??濚?????}vB 1011100011011110001111110011111110001111110010011010000100111111001111110011111100111111001111110111110101110110101110001101111000111111001111111000111111001001101000010011111100111111001111110011111100111111011111010111011001000010 b8de3f3f8fc9a13f3f3f3f3f7d76b8de3f3f8fc9a13f3f3f3f3f7d7642
UTF-8 五욁꺈濚띹몘掠끿쭓}v五욁꺈濚띹몘掠끿쭓}vB 1110010010111010100101001110110010011010100000011110101010111010100010001110011010111111100110101110101110011101101110011110101110101010100110001110111110100101101101011110101110000001101111111110110010101101100100110111110101110110111001001011101010010100111011001001101010000001111010101011101010001000111001101011111110011010111010111001110110111001111010111010101010011000111011111010010110110101111010111000000110111111111011001010110110010011011111010111011001000010 e4ba94ec9a81eaba88e6bf9aeb9db9ebaa98efa5b5eb81bfecad937d76e4ba94ec9a81eaba88e6bf9aeb9db9ebaa98efa5b5eb81bfecad937d7642
UHC 五욁꺈濚띹몘掠끿쭓}v五욁꺈濚띹몘掠끿쭓}vB 1110011111101001100111101110001110000011101011111110011110111001100011011110100010010001100001101110010110110001100001011110011110100111100010110111110101110110111001111110100110011110111000111000001110101111111001111011100110001101111010001001000110000110111001011011000110000101111001111010011110001011011111010111011001000010 e7e99ee383afe7b98de89186e5b185e7a78b7d76e7e99ee383afe7b98de89186e5b185e7a78b7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)