To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????v??????????vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 畯???雨企??醫?v畯???雨企??醫?vB 11111011011011110011111100111111001111111000100101001010100010101110100100111111001111111110011111001110001111110111011011111011011011110011111100111111001111111000100101001010100010101110100100111111001111111110011111001110001111110111011001000010 fb6f3f3f3f894a8ae93f3fe7ce3f76fb6f3f3f3f894a8ae93f3fe7ce3f7642
EUC-JP 畯???雨企??醫?v畯???雨企??醫?vB 100011111100110110111011001111110011111100111111101100011010101110110100111010110011111100111111111011101101000000111111011101101000111111001101101110110011111100111111001111111011000110101011101101001110101100111111001111111110111011010000001111110111011001000010 8fcdbb3f3f3fb1abb4eb3f3feed03f768fcdbb3f3f3fb1abb4eb3f3feed03f7642
UTF-8 畯얜렰렋雨企렲렪醫렡v畯얜렰렋雨企렲렪醫렡vB 111001111001010110101111111011001001011010011100111010111010000010110000111010111010000010001011111010011001101110101000111001001011110010000001111010111010000010110010111010111010000010101010111010011000011010101011111010111010000010100001011101101110011110010101101011111110110010010110100111001110101110100000101100001110101110100000100010111110100110011011101010001110010010111100100000011110101110100000101100101110101110100000101010101110100110000110101010111110101110100000101000010111011001000010 e795afec969ceba0b0eba08be99ba8e4bc81eba0b2eba0aae986abeba0a176e795afec969ceba0b0eba08be99ba8e4bc81eba0b2eba0aae986abeba0a17642
UHC 畯얜렰렋雨企렲렪醫렡v畯얜렰렋雨企렲렪醫렡vB 11110001111000011011111011101011100011101011110110001110101000101110100111101011110100001110101010001110101111111000111010111000111011001010001010001110101100100111011011110001111000011011111011101011100011101011110110001110101000101110100111101011110100001110101010001110101111111000111010111000111011001010001010001110101100100111011001000010 f1e1beeb8ebd8ea2e9ebd0ea8ebf8eb8eca28eb276f1e1beeb8ebd8ea2e9ebd0ea8ebf8eb8eca28eb27642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)