To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鰐?鰐碩鰐??鰐?}鰐?鰐碩鰐??鰐?{^ 10011000011010110011111110011000011010111001000011010111100110000110101100111111001111111001100001101011001111110111110110011000011010110011111110011000011010111001000011010111100110000110101100111111001111111001100001101011001111110111101101011110 986b3f986b90d7986b3f3f986b3f7d986b3f986b90d7986b3f3f986b3f7b5e
EUC-JP 鰐?鰐碩鰐??鰐絪}鰐?鰐碩鰐??鰐絪{^ 1100111111001100001111111100111111001100110000001101100111001111110011000011111100111111110011111100110010001111110100111110110001111101110011111100110000111111110011111100110011000000110110011100111111001100001111110011111111001111110011001000111111010011111011000111101101011110 cfcc3fcfccc0d9cfcc3f3fcfcc8fd3ec7dcfcc3fcfccc0d9cfcc3f3fcfcc8fd3ec7b5e
UTF-8 鰐溺鰐碩鰐솽솰鰐絪}鰐溺鰐碩鰐솽솰鰐絪{^ 111010011011000010010000111011111010011110101100111010011011000010010000111001111010001010101001111010011011000010010000111011001000011010111101111011001000011010110000111010011011000010010000111001111011010110101010011111011110100110110000100100001110111110100111101011001110100110110000100100001110011110100010101010011110100110110000100100001110110010000110101111011110110010000110101100001110100110110000100100001110011110110101101010100111101101011110 e9b090efa7ace9b090e7a2a9e9b090ec86bdec86b0e9b090e7b5aa7de9b090efa7ace9b090e7a2a9e9b090ec86bdec86b0e9b090e7b5aa7b5e
UHC 鰐溺鰐碩鰐솽솰鰐絪}鰐溺鰐碩鰐솽솰鰐絪{^ 111001001100101011101100110010101110010011001010111000001011010111100100110010101011110011100001101111001110000011100100110010101110110011011111011111011110010011001010111011001100101011100100110010101110000010110101111001001100101010111100111000011011110011100000111001001100101011101100110111110111101101011110 e4caeccae4cae0b5e4cabce1bce0e4caecdf7de4caeccae4cae0b5e4cabce1bce0e4caecdf7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)