To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??娃??溢g?儀??暗??爾 001111110011111100111111111000101000011000111111001111111001011101001000001111110011111110001000101000010011111100111111100010001110110010000010100001110011111110001011010101100011111100111111100010001100001100111111001111111000111010100010 3f3f3fe2863f3f97483f3f88a13f3f88ec82873f8b563f3f88c33f3f8ea2
EUC-JP ???竊??幽??娃??溢g?儀??暗??爾 001111110011111100111111111000111110011000111111001111111100110110101001001111110011111110110000101000110011111100111111101100001110111010100011111001110011111110110101101101110011111100111111101100001100010100111111001111111011110010100100 3f3f3fe3e63f3fcda93f3fb0a33f3fb0eea3e73fb5b73f3fb0c53f3fbca4
UTF-8 捻뀁뮆竊섉꼷幽꾩낄娃븐슦溢g솾儀뺧폍暗싲쵆爾 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010101011111010101001111010111000001010000100111001011010100010000011111010111011100010010000111011001000101010100110111001101011101010100010111011111011110110000111111011001000011010111110111001011000010010000000111010111011101010100111111011011000111110001101111001101001101010010111111011001000101110110010111011001011010110000110111001111000100010111110 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeabea9eb8284e5a883ebb890ec8aa6e6baa2efbd87ec86bee58480ebbaa7ed8f8de69a97ec8bb2ecb586e788be
UHC 捻뀁뮆竊섉꼷幽꾩낄娃븐슦溢g솾儀뺧폍暗싲쵆爾 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011101011100001001110110010110011101001011110100011011111101110101110110010011010101100001110110011101110101000111110011110011001101100101110101111110000100101011110111110111100100110001110010011011110100110101110101110101100100010001110110010110011 e6f7b2ec9295efbc98e6848feaeb84ecb3a5e8dfbaec9ab0eceea3e799b2ebf095efbc98e4de9aebac88ecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)