To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 弱??鎰?┐???[弱??鎰?┐???[^ 100011101110001100111111001111111110100001001100001111111000010010100010001111110011111100111111010110111000111011100011001111110011111111101000010011000011111110000100101000100011111100111111001111110101101101011110 8ee33f3fe84c3f84a23f3f3f5b8ee33f3fe84c3f84a23f3f3f5b5e
EUC-JP 弱??鎰?┐???[弱??鎰?┐???[^ 101111001110010100111111001111111110111110101101001111111010100010100100001111110011111100111111010110111011110011100101001111110011111111101111101011010011111110101000101001000011111100111111001111110101101101011110 bce53f3fefad3fa8a43f3f3f5bbce53f3fefad3fa8a43f3f3f5b5e
UTF-8 弱뉗빢鎰륅┐紐뚰뮇[弱뉗빢鎰륅┐紐뚰뮇[^ 111001011011110010110001111010111000100110010111111010111011100110100010111010011000111010110000111010111010010110000101111000101001010010010000111011111010011110001111111010111001101010110000111010111010111010000111010110111110010110111100101100011110101110001001100101111110101110111001101000101110100110001110101100001110101110100101100001011110001010010100100100001110111110100111100011111110101110011010101100001110101110101110100001110101101101011110 e5bcb1eb8997ebb9a2e98eb0eba585e29490efa78feb9ab0ebae875be5bcb1eb8997ebb9a2e98eb0eba585e29490efa78feb9ab0ebae875b5e
UHC 弱뉗빢鎰륅┐紐뚰뮇[弱뉗빢鎰륅┐紐뚰뮇[^ 111001011011000010000111111011001001010110111110111011001111000010001111111011111010011010100100111010111010101010001100111011011001001010010110010110111110010110110000100001111110110010010101101111101110110011110000100011111110111110100110101001001110101110101010100011001110110110010010100101100101101101011110 e5b087ec95beecf08fefa6a4ebaa8ced92965be5b087ec95beecf08fefa6a4ebaa8ced92965b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)