To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 橈??撓??節ヨ?徇??節??訝??敖??B 100111101111010000111111001111111001110110011010001111110011111110010000110111111000001110001000001111111001110001101101001111110011111110010000110111110011111100111111111001100110001000111111001111111001110111000010001111110011111101000010 9ef43f3f9d9a3f3f90df83883f9c6d3f3f90df3f3fe6623f3f9dc23f3f42
EUC-JP 橈??撓??節ヨ?徇??節??訝??敖??B 110111001111011000111111001111111101100111111010001111110011111111000000111000011010010111101000001111111101011111001110001111110011111111000000111000010011111100111111111010111100001100111111001111111101101011000100001111110011111101000010 dcf63f3fd9fa3f3fc0e1a5e83fd7ce3f3fc0e13f3febc33f3fdac43f3f42
UTF-8 橈놅슛撓듾말節ヨ쎗徇뚪쐠節면뿂訝삼쉥敖얇걣B 11100110101010011000100011101011100001101000010111101100100010101001101111100110100100101001001111101011100100111011111011101011101001111001000011100111101011111000000011100011100000111010100011101100100011101001011111100101101111101000011111101011100110101010101011101100100100001010000011100111101011111000000011101011101010011011010011101011101111111000001011101000101010001001110111101100100000101011110011101100100010011010010111100110100101011001011011101100100101101000011111101010101100011010001101000010 e6a988eb8685ec8a9be69293eb93beeba790e7af80e383a8ec8e97e5be87eb9aaaec90a0e7af80eba9b4ebbf82e8a89dec82bcec89a5e69596ec9687eab1a342
UHC 橈놅슛撓듾말節ヨ쎗徇뚪쐠節면뿂訝삼쉥敖얇걣B 11101000111110101000011011101111101111011011100011101000111101011000101011100100101110001011101111101111101111011010101111101000100110111011111011100010110111111000110011101001100111001000011011101111101111011011100011101001100101111000101011100100101110001011101111101111101111011010101111100111111110011011111011100011100000011000110001000010 e8fa86efbdb8e8f58ae4b8bbefbdabe89bbee2df8ce99c86efbdb8e9978ae4b8bbefbdabe7f9bee3818c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)