To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??洵→?幽??馭??幽??野??乳?? 1110101001011111001111110011111110011111101010111000000110101000001111111001011101001000001111110011111111101001011001100011111100111111100101110100100000111111001111111001011011101100001111110011111110010011111110110011111100111111 ea5f3f3f9fab81a83f97483f3fe9663f3f97483f3f96ec3f3f93fb3f3f
EUC-JP 鸚??洵→?幽??馭??幽??野??乳?? 1111001111000000001111110011111111011110101011011010001010101010001111111100110110101001001111110011111111110001110001110011111100111111110011011010100100111111001111111100110011101110001111110011111111000110111111010011111100111111 f3c03f3fdeada2aa3fcda93f3ff1c73f3fcda93f3fccee3f3fc6fd3f3f
UTF-8 鸚쒓퍓洵→룚幽뚯춪馭앮끽幽껋뜪野껋럩乳삣럡 111010011011100010011010111011001001001010010011111011011000110110010011111001101011010010110101111000101000011010010010111010111010001110011010111001011011100110111101111010111001101010101111111011001011011010101010111010011010011010101101111011001001010110101110111010111000000110111101111001011011100110111101111010101011101110001011111010111001110010101010111010011000011110001110111010101011101110001011111010111001111110101001111001001011100110110011111011001000001010100011111010111001111110100001 e9b89aec9293ed8d93e6b4b5e28692eba39ae5b9bdeb9aafecb6aae9a6adec95aeeb81bde5b9bdeabb8beb9caae9878eeabb8beb9fa9e4b9b3ec82a3eb9fa1
UHC 鸚쒓퍓洵→룚幽뚯춪馭앮끽幽껋뜪野껋럩乳삣럡 111001011010010010011100111010101011101110001010111000101110011110100001111001101000111110010110111010101110101110001100111011001010110110000111111001011101111110011101111001101011001110100011111010101110101110000011111011001000110110101011111001011010111110000011111011001000111010001100111010101110000110111011111001011000111010000100 e5a49ceabb8ae2e7a1e68f96eaeb8cecad87e5df9de6b3a3eaeb83ec8dabe5af83ec8e8ceae1bbe58e84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)