To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN ??詐漿ぜ?竊?}v??詐漿ぜ?竊?}vB 0011111100111111100011011011110010011111111101111000001010111010001111111110001010000110001111110111110101110110001111110011111110001101101111001001111111110111100000101011101000111111111000101000011000111111011111010111011001000010 3f3f8dbc9ff782ba3fe2863f7d763f3f8dbc9ff782ba3fe2863f7d7642
EUC-JP ??詐漿ぜ?竊?}v??詐漿ぜ?竊?}vB 0011111100111111101110101011111011011110111110011010010010111100001111111110001111100110001111110111110101110110001111110011111110111010101111101101111011111001101001001011110000111111111000111110011000111111011111010111011001000010 3f3fbabedef9a4bc3fe3e63f7d763f3fbabedef9a4bc3fe3e63f7d7642
UTF-8 뤰쨴詐漿ぜ곧竊켉}v뤰쨴詐漿ぜ곧竊켉}vB 1110101110100100101100001110110010101000101101001110100010101001100100001110011010111100101111111110001110000001100111001110101010110011101001111110011110101011100010101110110010111100100010010111110101110110111010111010010010110000111011001010100010110100111010001010100110010000111001101011110010111111111000111000000110011100111010101011001110100111111001111010101110001010111011001011110010001001011111010111011001000010 eba4b0eca8b4e8a990e6bcbfe3819ceab3a7e7ab8aecbc897d76eba4b0eca8b4e8a990e6bcbfe3819ceab3a7e7ab8aecbc897d7642
UHC 뤰쨴詐漿ぜ곧竊켉}v뤰쨴詐漿ぜ곧竊켉}vB 10001111110111101010010010001110110111101111000111101101111011001010101010111100101100001111000011101111101111001011000101000110011111010111011010001111110111101010010010001110110111101111000111101101111011001010101010111100101100001111000011101111101111001011000101000110011111010111011001000010 8fdea48edef1edecaabcb0f0efbcb1467d768fdea48edef1edecaabcb0f0efbcb1467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)