To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌Q??筌Q??筌Q??筌??爰??醫?? 111000101010001110000010011100000011111100111111111000101010001110000010011100000011111100111111111000101010001110000010011100000011111100111111111000101010001100111111001111111110000010100111001111110011111111100111110011100011111100111111 e2a382703f3fe2a382703f3fe2a382703f3fe2a33f3fe0a73f3fe7ce3f3f
EUC-JP 筌Q??筌Q??筌Q??筌??爰??醫?? 111001001010010110100011110100010011111100111111111001001010010110100011110100010011111100111111111001001010010110100011110100010011111100111111111001001010010100111111001111111110000010101001001111110011111111101110110100000011111100111111 e4a5a3d13f3fe4a5a3d13f3fe4a5a3d13f3fe4a53f3fe0a93f3feed03f3f
UTF-8 筌Q딆퐲筌Q뗭뒏筌Q딆퐲筌듐룂爰쇽㎖醫륁돸 111001111010110110001100111011111011110010110001111010111001010010000110111011011001000010110010111001111010110110001100111011111011110010110001111010111001011110101101111010111001001010001111111001111010110110001100111011111011110010110001111010111001010010000110111011011001000010110010111001111010110110001100111010111001001110010000111010111010001110000010111001111000100010110000111011001000011110111101111000111000111010010110111010011000011010101011111010111010010110000001111010111000111110111000 e7ad8cefbcb1eb9486ed90b2e7ad8cefbcb1eb97adeb928fe7ad8cefbcb1eb9486ed90b2e7ad8ceb9390eba382e788b0ec87bde38e96e986abeba581eb8fb8
UHC 筌Q딆퐲筌Q뗭뒏筌Q딆퐲筌듐룂爰쇽㎖醫륁돸 111011111010011110100011110100011000101011101100101111011001101111101111101001111010001111010001100010111110110010001010100011001110111110100111101000111101000110001010111011001011110110011011111011111010011110110101111000111000111110000011111010101011101010111100111011111010011110100010111011001010001010001111111011001000100110111011 efa7a3d18aecbd9befa7a3d18bec8a8cefa7a3d18aecbd9befa7b5e38f83eababcefa7a2eca28fec89bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)