To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???泣??袁ъ?[???泣??袁ъ?[^ 001111110011111100111111100010111000001100111111001111111110010111001101100001001000110000111111010110110011111100111111001111111000101110000011001111110011111111100101110011011000010010001100001111110101101101011110 3f3f3f8b833f3fe5cd848c3f5b3f3f3f8b833f3fe5cd848c3f5b5e
EUC-JP ???泣??袁ъ?[???泣??袁ъ?[^ 001111110011111100111111101101011110001100111111001111111110101011001111101001111110110000111111010110110011111100111111001111111011010111100011001111110011111111101010110011111010011111101100001111110101101101011110 3f3f3fb5e33f3feacfa7ec3f5b3f3f3fb5e33f3feacfa7ec3f5b5e
UTF-8 黎싰쒀泣됪콢袁ъ뒌[黎싰쒀泣됪콢袁ъ뒌[^ 11101111101001101000100111101100100010111011000011101100100100101000000011100110101100111010001111101011100100001010101011101100101111011010001011101000101000101000000111010001100010101110101110010010100011000101101111101111101001101000100111101100100010111011000011101100100100101000000011100110101100111010001111101011100100001010101011101100101111011010001011101000101000101000000111010001100010101110101110010010100011000101101101011110 efa689ec8bb0ec9280e6b3a3eb90aaecbda2e8a281d18aeb928c5befa689ec8bb0ec9280e6b3a3eb90aaecbda2e8a281d18aeb928c5b5e
UHC 黎싰쒀泣됪콢袁ъ뒌[黎싰쒀泣됪콢袁ъ뒌[^ 111001101011000110011010111010101011111010101100111010111110100010001001111001101011000110011010111010101011111010101100111011001000101010001001010110111110011010110001100110101110101010111110101011001110101111101000100010011110011010110001100110101110101010111110101011001110110010001010100010010101101101011110 e6b19aeabeacebe889e6b19aeabeacec8a895be6b19aeabeacebe889e6b19aeabeacec8a895b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)