To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 絶??臟??掌???⑧?臟??掌???⑧?帳 100100001110001000111111001111111110010001100110001111110011111110001111101101100011111100111111001111111000011101000111001111111110010001100110001111110011111110001111101101100011111100111111001111111000011101000111001111111001001010100000 90e23f3fe4663f3f8fb63f3f3f87473fe4663f3f8fb63f3f3f87473f92a0
EUC-JP 絶??臟??掌?????臟??掌?????帳 11000000111001000011111100111111111001111100011100111111001111111011111010111000001111110011111100111111001111110011111111100111110001110011111100111111101111101011100000111111001111110011111100111111001111111100010010100010 c0e43f3fe7c73f3fbeb83f3f3f3f3fe7c73f3fbeb83f3f3f3f3fc4a2
UTF-8 絶앾풙臟랂궘掌싵맖狀⑧닖臟묔맖掌싵맖狀⑧닖帳 111001111011010110110110111011001001010110111110111011011001001010011001111010001000011110011111111010111001111010000010111010101011011010011000111001101000111010001100111011001000101110110101111010111010011110010110111011111010011110111010111000101001000110100111111010111000101110010110111010001000011110011111111010111010110010010100111010111010011110010110111001101000111010001100111011001000101110110101111010111010011110010110111011111010011110111010111000101001000110100111111010111000101110010110111001011011100010110011 e7b5b6ec95beed9299e8879feb9e82eab698e68e8cec8bb5eba796efa7bae291a7eb8b96e8879febac94eba796e68e8cec8bb5eba796efa7bae291a7eb8b96e5b8b3
UHC 絶앾풙臟랂궘掌싵맖狀⑧닖臟묔맖掌싵맖狀⑧닖帳 1110111110111110100111011110111110111110100111001110110111110100100011011110111010000010101011011110110111100110100110101110111010010000101010001110110111101110101010001110111010001000100110101110110111110100100100011110111010010000101010001110110111100110100110101110111010010000101010001110110111101110101010001110111010001000100110101110110111100011 efbe9defbe9cedf48dee82adede69aee90a8edeea8ee889aedf491ee90a8ede69aee90a8edeea8ee889aede3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)