To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???夜??弱?ア節ョア節わ?厓э?B 0011111100111111001111111001011011101001001111110011111110001110111000110011111110000011010000011001000011011111100000111000011110000011010000011001000011011111100000101110110100111111111110101000110110000100100011110011111101000010 3f3f3f96e93f3f8ee33f834190df8387834190df82ed3ffa8d848f3f42
EUC-JP ???夜??弱?ア節ョア節わ?厓э?B 001111110011111100111111110011001110101100111111001111111011110011100101001111111010010110100010110000001110000110100101111001111010010110100010110000001110000110100100111011110011111110001111101101001100011110100111111011110011111101000010 3f3f3fcceb3f3fbce53fa5a2c0e1a5e7a5a2c0e1a4ef3f8fb4c7a7ef3f42
UTF-8 若듸슈夜쇽쉼弱녺ア節ョア節わ풊厓э푵B 111011111010010110110100111010111001001110111000111011001000101010001000111001011010010010011100111011001000011110111101111011001000100110111100111001011011110010110001111010111000010110111010111000111000001010100010111001111010111110000000111000111000001110100111111000111000001010100010111001111010111110000000111000111000001010001111111011011001001010001010111001011000111010010011110100011000110111101101100100011011010101000010 efa5b4eb93b8ec8a88e5a49cec87bdec89bce5bcb1eb85bae382a2e7af80e383a7e382a2e7af80e3828fed928ae58e93d18ded91b542
UHC 若듸슈夜쇽쉼弱녺ア節ョア節わ풊厓э푵B 11100101101011101011010111101111101111011011010011100101101010001011110011101111101111011011000011100101101100001000011011100111101010111010001011101111101111011010101111100111101010111010001011101111101111011010101011101111101111101001000011100100111011011010110011101111101111101000001101000010 e5aeb5efbdb4e5a8bcefbdb0e5b086e7aba2efbdabe7aba2efbdaaefbe90e4edacefbe8342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)