To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 鉤門ッゆ訷驍譴籵丞豐イv鉤門ッゆ訷驍譴籵丞豐イvB 11100111111010101001011011100101101011111000001011100100111110111010010011101001100000101110011010100010111000101110000010001111111001011110011010110010101100100111011011100111111010101001011011100101101011111000001011100100111110111010010011101001100000101110011010100010111000101110000010001111111001011110011010110010101100100111011001000010 e7ea96e5af82e4fba4e982e6a2e2e08fe5e6b2b276e7ea96e5af82e4fba4e982e6a2e2e08fe5e6b2b27642
EUC-JP 鉤門ッゆ訷驍譴籵丞豐イv鉤門ッゆ訷驍譴籵丞豐イvB 11101110111011001100110011100111100011101010111110100100111001101000111111011101110101001111000111100010111011001010010011100100111000101011111011100111111011001011010010001110101100100111011011101110111011001100110011100111100011101010111110100100111001101000111111011101110101001111000111100010111011001010010011100100111000101011111011100111111011001011010010001110101100100111011001000010 eeeccce78eafa4e68fddd4f1e2eca4e4e2bee7ecb48eb276eeeccce78eafa4e68fddd4f1e2eca4e4e2bee7ecb48eb27642
UTF-8 鉤門ッゆ訷驍譴籵丞豐イv鉤門ッゆ訷驍譴籵丞豐イvB 111010011000100110100100111010011001011010000000111011111011110110101111111000111000001010000110111010001010100010110111111010011010100110001101111010001010110110110100111001111011000110110101111001001011100010011110111010001011000110010000111011111011110110110010011101101110100110001001101001001110100110010110100000001110111110111101101011111110001110000010100001101110100010101000101101111110100110101001100011011110100010101101101101001110011110110001101101011110010010111000100111101110100010110001100100001110111110111101101100100111011001000010 e989a4e99680efbdafe38286e8a8b7e9a98de8adb4e7b1b5e4b89ee8b190efbdb276e989a4e99680efbdafe38286e8a8b7e9a98de8adb4e7b1b5e4b89ee8b190efbdb27642
UHC 鉤門?ゆ?驍譴?丞??v鉤門?ゆ?驍譴?丞??vB 11001111110010011101101010100110001111111010101011100110001111111111110110100100110011001011100100111111111000111010101000111111001111110111011011001111110010011101101010100110001111111010101011100110001111111111110110100100110011001011100100111111111000111010101000111111001111110111011001000010 cfc9daa63faae63ffda4ccb93fe3aa3f3f76cfc9daa63faae63ffda4ccb93fe3aa3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)