To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??娃??溢g?儀??如??油 001111110011111100111111111000101000011000111111001111111001011101001000001111110011111110001000101000010011111100111111100010001110110010000010100001110011111110001011010101100011111100111111100101000100000000111111001111111001011011111011 3f3f3fe2863f3f97483f3f88a13f3f88ec82873f8b563f3f94403f3f96fb
EUC-JP ???竊??幽??娃??溢g?儀??如??油 001111110011111100111111111000111110011000111111001111111100110110101001001111110011111110110000101000110011111100111111101100001110111010100011111001110011111110110101101101110011111100111111110001111010000100111111001111111100110011111101 3f3f3fe3e63f3fcda93f3fb0a33f3fb0eea3e73fb5b73f3fc7a13f3fccfd
UTF-8 捻뀁뮆竊섉꼷幽꾩낄娃븐슦溢g솾儀뺧폍如싳궚油 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010101011111010101001111010111000001010000100111001011010100010000011111010111011100010010000111011001000101010100110111001101011101010100010111011111011110110000111111011001000011010111110111001011000010010000000111010111011101010100111111011011000111110001101111001011010011010000010111011001000101110110011111010101011011010011010111001101011001010111001 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeabea9eb8284e5a883ebb890ec8aa6e6baa2efbd87ec86bee58480ebbaa7ed8f8de5a682ec8bb3eab69ae6b2b9
UHC 捻뀁뮆竊섉꼷幽꾩낄娃븐슦溢g솾儀뺧폍如싳궚油 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011101011100001001110110010110011101001011110100011011111101110101110110010011010101100001110110011101110101000111110011110011001101100101110101111110000100101011110111110111100100110001110010111111101100110101110110010000010101011111110101011111010 e6f7b2ec9295efbc98e6848feaeb84ecb3a5e8dfbaec9ab0eceea3e799b2ebf095efbc98e5fd9aec82afeafa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)