To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????F}v???????F}vB 001111110011111100111111001111110011111100111111001111110100011001111101011101100011111100111111001111110011111100111111001111110011111101000110011111010111011001000010 3f3f3f3f3f3f3f467d763f3f3f3f3f3f3f467d7642
SJIS-WIN 誾゙骼ェ式竟識F}v誾゙骼ェ式竟識F}vB 11111011101001111101111011101001100011101010101010001110101011101110100011101101100011101010111101000110011111010111011011111011101001111101111011101001100011101010101010001110101011101110100011101101100011101010111101000110011111010111011001000010 fba7dee98eaa8eaee8ed8eaf467d76fba7dee98eaa8eaee8ed8eaf467d7642
EUC-JP 誾゙骼ェ式竟識F}v誾゙骼ェ式竟識F}vB 10001111110111101010010010001110110111101111000111101110100011101010101010111100101100001111000011101111101111001011000101000110011111010111011010001111110111101010010010001110110111101111000111101110100011101010101010111100101100001111000011101111101111001011000101000110011111010111011001000010 8fdea48edef1ee8eaabcb0f0efbcb1467d768fdea48edef1ee8eaabcb0f0efbcb1467d7642
UTF-8 誾゙骼ェ式竟識F}v誾゙骼ェ式竟識F}vB 11101000101010101011111011101111101111101001111011101001101010101011110011101111101111011010101011100101101111001000111111100111101010111001111111101000101011011001100001000110011111010111011011101000101010101011111011101111101111101001111011101001101010101011110011101111101111011010101011100101101111001000111111100111101010111001111111101000101011011001100001000110011111010111011001000010 e8aabeefbe9ee9aabcefbdaae5bc8fe7ab9fe8ad98467d76e8aabeefbe9ee9aabcefbdaae5bc8fe7ab9fe8ad98467d7642
UHC 誾???式竟識F}v誾???式竟識F}vB 1110101111011101001111110011111100111111111000111101001011001100111001011110001111011011010001100111110101110110111010111101110100111111001111110011111111100011110100101100110011100101111000111101101101000110011111010111011001000010 ebdd3f3f3fe3d2cce5e3db467d76ebdd3f3f3fe3d2cce5e3db467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)