To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??馭??????洵→?柔?? 00111111001111110011111111100010100001100011111100111111100101110100100000111111001111111110100101100110001111110011111100111111001111110011111100111111100111111010101110000001101010000011111110001111010111110011111100111111 3f3f3fe2863f3f97483f3fe9663f3f3f3f3f3f9fab81a83f8f5f3f3f
EUC-JP ???竊??幽??馭??????洵→?柔?? 00111111001111110011111111100011111001100011111100111111110011011010100100111111001111111111000111000111001111110011111100111111001111110011111100111111110111101010110110100010101010100011111110111101110000000011111100111111 3f3f3fe3e63f3fcda93f3ff1c73f3f3f3f3f3fdeada2aa3fbdc03f3f
UTF-8 捻뀁뮆竊섉꼷幽낅뼀馭궽쎈쳸僚녹뼔洵→뿥柔⑺겫 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010111000001010000101111010111011110010000000111010011010011010101101111010101011011010111101111011001000111010001000111011001011001110111000111011111010011010111011111010111000010110111001111010111011110010010100111001101011010010110101111000101000011010010010111010111011111110100101111001101001111110010100111000101001000110111010111010101011001010101011 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeb8285ebbc80e9a6adeab6bdec8e88ecb3b8efa6bbeb85b9ebbc94e6b4b5e28692ebbfa5e69f94e291baeab2ab
UHC 捻뀁뮆竊섉꼷幽낅뼀馭궽쎈쳸僚녹뼔洵→뿥柔⑺겫 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011101011100001011110101110010110100010111110010111011111100000101100111010111101111010111010101110011011111010001110100010110011111011001001011010011100111000101110011110100001111001101001011110100101111010101111010110101001111011011000000110111010 e6f7b2ec9295efbc98e6848feaeb85eb968be5df82cebdebab9be8e8b3ec969ce2e7a1e697a5eaf5a9ed81ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)